Abstract
Comparisons of orthologous genomic DNA sequences can be used to characterize regions that have been subject to purifying selection and are enriched for functional elements. We here present the results of such an analysis on an alignment of sequences from 29 mammalian species. The alignment captures ∼3.9 neutral substitutions per site and spans ∼1.9 Mbp of the human genome. We identify constrained elements from 3 bp to over 1 kbp in length, covering ∼5.5% of the human locus. Our estimate for the total amount of nonexonic constraint experienced by this locus is roughly twice that for exonic constraint. Constrained elements tend to cluster, and we identify large constrained regions that correspond well with known functional elements. While constraint density inversely correlates with mobile element density, we also show the presence of unambiguously constrained elements overlapping mammalian ancestral repeats. In addition, we describe a number of elements in this region that have undergone intense purifying selection throughout mammalian evolution, and we show that these important elements are more numerous than previously thought. These results were obtained with Genomic Evolutionary Rate Profiling (GERP), a statistically rigorous and biologically transparent framework for constrained element identification. GERP identifies regions at high resolution that exhibit nucleotide substitution deficits, and measures these deficits as “rejected substitutions.” Rejected substitutions reflect the intensity of past purifying selection and are used to rank and characterize constrained elements. We anticipate that GERP and the types of analyses it facilitates will provide further insights and improved annotation for the human genome as mammalian genome sequence data become richer.
References
69
Referenced
1,204
10.1126/science.1072104
10.1242/dev.124.10.1851
/ Development (1997)10.1126/science.1098119
10.1073/pnas.231608898
10.1101/gr.2648404
10.1126/science.1081331
10.1038/ng1090
/ Nat. Genet. (2003)10.1016/S0378-1119(97)00399-5
10.1101/gr.926603
10.1093/bioinformatics/btg1005
10.1101/gr.2067704
10.1242/dev.01390
10.1093/nar/19.13.3667
10.1038/nature01626
10.1016/j.gde.2003.10.001
10.1101/gr.1064503
10.1101/gr.2034704
10.1186/1471-2105-5-192
10.1038/nature01251
10.1126/science.1087047
10.1101/gr.142200
10.1016/j.gde.2003.10.008
10.1126/science.1105136
10.1038/16915
10.1089/10665270252935494
10.1101/gr.716103
10.1101/gr.45502
{'key': '2021111811004616000_15.7.901.28', 'first-page': '4919', 'volume': '12', 'year': '1992', 'journal-title': 'Mol. Cell. Biol.'}
/ Mol. Cell. Biol. (1992)10.1016/S0168-9525(00)02081-3
10.1101/gr.844103
10.1002/(SICI)1097-0169(1997)38:2<120::AID-CM2>3.0.CO;2-B
10.1007/BF02101694
10.1038/nature03154
10.1073/pnas.0404142101
10.1016/S0168-9525(02)00006-9
10.1371/journal.pbio.0030042
10.1101/gr.529803
-
Kimura, M. 1983. The neutral theory of molecular evolution. Cambridge University Press, Cambridge, New York.
(
10.1017/CBO9780511623486
) - Li, W.-H. 1997. Molecular evolution. Sinauer Associates, Sunderland, MA.
10.1038/35054544
10.1101/gr.1602203
10.1073/pnas.012591199
10.1093/bioinformatics/16.11.1046
10.1038/nature01262
10.1038/35054550
10.1016/S0168-9525(01)02445-3
10.1126/science.1088328
10.1126/science.286.5439.458
10.1016/j.devcel.2004.09.004
10.1038/35052548
10.1126/science.1064852
10.1093/nar/29.1.137
10.1038/nature02426
10.1093/bioinformatics/btg459
10.1089/1066527041410472
{'key': '2021111811004616000_15.7.901.56', 'first-page': '468', 'volume': '21', 'year': '2004', 'journal-title': 'Mol. Biol. Evol.'}
/ Mol. Biol. Evol. (2004)10.1017/S0016672303006268
10.1101/gr.1208803
10.1006/geno.2000.6422
10.1038/nature01858
10.1371/journal.pbio.0030007
/ PLoS Biol. (2004){'key': '2021111811004616000_15.7.901.62', 'first-page': '555', 'volume': '13', 'year': '1997', 'journal-title': 'CABIOS'}
/ CABIOS (1997)10.1101/gr.1984404
- http://blast.wustl.edu; WU-BLAST homepage.
- http://www.repeatmasker.org; RepeatMasker homepage.
- http://mendel.stanford.edu/sidowlab; Sidow Lab homepage.
- http://genome.ucsc.edu; UCSC Genome Browser homepage.
- http://www.nisc.nih.gov/data; NISC Comparative Sequencing Program homepage.
- http://www.genome.gov/10002154; NHGRI Genome Sequencing Proposals.
Dates
Type | When |
---|---|
Created | 20 years, 2 months ago (June 17, 2005, 8:34 p.m.) |
Deposited | 3 years, 9 months ago (Nov. 18, 2021, 2:36 p.m.) |
Indexed | 3 days, 7 hours ago (Aug. 29, 2025, 5:50 a.m.) |
Issued | 20 years, 2 months ago (June 17, 2005) |
Published | 20 years, 2 months ago (June 17, 2005) |
Published Online | 20 years, 2 months ago (June 17, 2005) |
Published Print | 20 years, 2 months ago (July 1, 2005) |
@article{Cooper_2005, title={Distribution and intensity of constraint in mammalian genomic sequence}, volume={15}, ISSN={1088-9051}, url={http://dx.doi.org/10.1101/gr.3577405}, DOI={10.1101/gr.3577405}, number={7}, journal={Genome Research}, publisher={Cold Spring Harbor Laboratory}, author={Cooper, Gregory M. and Stone, Eric A. and Asimenos, George and Green, Eric D. and Batzoglou, Serafim and Sidow, Arend}, year={2005}, month=jun, pages={901–913} }