TY - JOUR
T1 - Ultra-conserved sequences in the genomes of highly diverse Anopheles mosquitoes, with implications for malaria vector control
AU - O'Loughlin, Samantha M.
AU - Forster, Annie J.
AU - Fuchs, Silke
AU - Dottorini, Tania
AU - Nolan, Tony
AU - Crisanti, Andrea
AU - Burt, Austin
N1 - Publisher Copyright:
© The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America.
PY - 2021/6
Y1 - 2021/6
N2 - DNA sequences that are exactly conserved over long evolutionary time scales have been observed in a variety of taxa. Such sequences are likely under strong functional constraint and they have been useful in the field of comparative genomics for identifying genome regions with regulatory function. A potential new application for these ultra-conserved elements (UCEs) has emerged in the development of gene drives to control mosquito populations. Many gene drives work by recognizing and inserting at a specific target sequence in the genome, often imposing a reproductive load as a consequence. They can therefore select for target sequence variants that provide resistance to the drive. Focusing on highly conserved, highly constrained sequences lowers the probability that variant, gene drive-resistant alleles can be tolerated. Here, we search for conserved sequences of 18 bp and over in an alignment of 21 Anopheles genomes, spanning an evolutionary timescale of 100 million years, and characterize the resulting sequences according to their location and function. Over 8000 UCEs were found across the alignment, with a maximum length of 164 bp. Length-corrected gene ontology analysis revealed that genes containing Anopheles UCEs were over-represented in categories with structural or nucleotide-binding functions. Known insect transcription factor binding sites were found in 48% of intergenic Anopheles UCEs. When we looked at the genome sequences of 1142 wild-caught mosquitoes, we found that 15% of the Anopheles UCEs contained no polymorphisms. Our list of Anopheles UCEs should provide a valuable starting point for the selection and testing of new targets for gene-drive modification in the mosquitoes that transmit malaria.
AB - DNA sequences that are exactly conserved over long evolutionary time scales have been observed in a variety of taxa. Such sequences are likely under strong functional constraint and they have been useful in the field of comparative genomics for identifying genome regions with regulatory function. A potential new application for these ultra-conserved elements (UCEs) has emerged in the development of gene drives to control mosquito populations. Many gene drives work by recognizing and inserting at a specific target sequence in the genome, often imposing a reproductive load as a consequence. They can therefore select for target sequence variants that provide resistance to the drive. Focusing on highly conserved, highly constrained sequences lowers the probability that variant, gene drive-resistant alleles can be tolerated. Here, we search for conserved sequences of 18 bp and over in an alignment of 21 Anopheles genomes, spanning an evolutionary timescale of 100 million years, and characterize the resulting sequences according to their location and function. Over 8000 UCEs were found across the alignment, with a maximum length of 164 bp. Length-corrected gene ontology analysis revealed that genes containing Anopheles UCEs were over-represented in categories with structural or nucleotide-binding functions. Known insect transcription factor binding sites were found in 48% of intergenic Anopheles UCEs. When we looked at the genome sequences of 1142 wild-caught mosquitoes, we found that 15% of the Anopheles UCEs contained no polymorphisms. Our list of Anopheles UCEs should provide a valuable starting point for the selection and testing of new targets for gene-drive modification in the mosquitoes that transmit malaria.
KW - Anopheles
KW - Conserved
KW - Gene drive
KW - Malaria
UR - http://www.scopus.com/inward/record.url?scp=85111492938&partnerID=8YFLogxK
U2 - 10.1093/g3journal/jkab086
DO - 10.1093/g3journal/jkab086
M3 - Article
C2 - 33730159
AN - SCOPUS:85111492938
SN - 2160-1836
VL - 11
JO - G3: Genes, Genomes, Genetics
JF - G3: Genes, Genomes, Genetics
IS - 6
M1 - jkab086
ER -