Crossref journal-article
Cold Spring Harbor Laboratory
Genome Research (246)
Abstract

Comparisons of orthologous genomic DNA sequences can be used to characterize regions that have been subject to purifying selection and are enriched for functional elements. We here present the results of such an analysis on an alignment of sequences from 29 mammalian species. The alignment captures ∼3.9 neutral substitutions per site and spans ∼1.9 Mbp of the human genome. We identify constrained elements from 3 bp to over 1 kbp in length, covering ∼5.5% of the human locus. Our estimate for the total amount of nonexonic constraint experienced by this locus is roughly twice that for exonic constraint. Constrained elements tend to cluster, and we identify large constrained regions that correspond well with known functional elements. While constraint density inversely correlates with mobile element density, we also show the presence of unambiguously constrained elements overlapping mammalian ancestral repeats. In addition, we describe a number of elements in this region that have undergone intense purifying selection throughout mammalian evolution, and we show that these important elements are more numerous than previously thought. These results were obtained with Genomic Evolutionary Rate Profiling (GERP), a statistically rigorous and biologically transparent framework for constrained element identification. GERP identifies regions at high resolution that exhibit nucleotide substitution deficits, and measures these deficits as “rejected substitutions.” Rejected substitutions reflect the intensity of past purifying selection and are used to rank and characterize constrained elements. We anticipate that GERP and the types of analyses it facilitates will provide further insights and improved annotation for the human genome as mammalian genome sequence data become richer.

Bibliography

Cooper, G. M., Stone, E. A., Asimenos, G., Green, E. D., Batzoglou, S., & Sidow, A. (2005). Distribution and intensity of constraint in mammalian genomic sequence. Genome Research, 15(7), 901–913.

Authors 6
  1. Gregory M. Cooper (first)
  2. Eric A. Stone (additional)
  3. George Asimenos (additional)
  4. Eric D. Green (additional)
  5. Serafim Batzoglou (additional)
  6. Arend Sidow (additional)
References 69 Referenced 1,204
  1. 10.1126/science.1072104
  2. 10.1242/dev.124.10.1851 / Development (1997)
  3. 10.1126/science.1098119
  4. 10.1073/pnas.231608898
  5. 10.1101/gr.2648404
  6. 10.1126/science.1081331
  7. 10.1038/ng1090 / Nat. Genet. (2003)
  8. 10.1016/S0378-1119(97)00399-5
  9. 10.1101/gr.926603
  10. 10.1093/bioinformatics/btg1005
  11. 10.1101/gr.2067704
  12. 10.1242/dev.01390
  13. 10.1093/nar/19.13.3667
  14. 10.1038/nature01626
  15. 10.1016/j.gde.2003.10.001
  16. 10.1101/gr.1064503
  17. 10.1101/gr.2034704
  18. 10.1186/1471-2105-5-192
  19. 10.1038/nature01251
  20. 10.1126/science.1087047
  21. 10.1101/gr.142200
  22. 10.1016/j.gde.2003.10.008
  23. 10.1126/science.1105136
  24. 10.1038/16915
  25. 10.1089/10665270252935494
  26. 10.1101/gr.716103
  27. 10.1101/gr.45502
  28. {'key': '2021111811004616000_15.7.901.28', 'first-page': '4919', 'volume': '12', 'year': '1992', 'journal-title': 'Mol. Cell. Biol.'} / Mol. Cell. Biol. (1992)
  29. 10.1016/S0168-9525(00)02081-3
  30. 10.1101/gr.844103
  31. 10.1002/(SICI)1097-0169(1997)38:2<120::AID-CM2>3.0.CO;2-B
  32. 10.1007/BF02101694
  33. 10.1038/nature03154
  34. 10.1073/pnas.0404142101
  35. 10.1016/S0168-9525(02)00006-9
  36. 10.1371/journal.pbio.0030042
  37. 10.1101/gr.529803
  38. Kimura, M. 1983. The neutral theory of molecular evolution. Cambridge University Press, Cambridge, New York. (10.1017/CBO9780511623486)
  39. Li, W.-H. 1997. Molecular evolution. Sinauer Associates, Sunderland, MA.
  40. 10.1038/35054544
  41. 10.1101/gr.1602203
  42. 10.1073/pnas.012591199
  43. 10.1093/bioinformatics/16.11.1046
  44. 10.1038/nature01262
  45. 10.1038/35054550
  46. 10.1016/S0168-9525(01)02445-3
  47. 10.1126/science.1088328
  48. 10.1126/science.286.5439.458
  49. 10.1016/j.devcel.2004.09.004
  50. 10.1038/35052548
  51. 10.1126/science.1064852
  52. 10.1093/nar/29.1.137
  53. 10.1038/nature02426
  54. 10.1093/bioinformatics/btg459
  55. 10.1089/1066527041410472
  56. {'key': '2021111811004616000_15.7.901.56', 'first-page': '468', 'volume': '21', 'year': '2004', 'journal-title': 'Mol. Biol. Evol.'} / Mol. Biol. Evol. (2004)
  57. 10.1017/S0016672303006268
  58. 10.1101/gr.1208803
  59. 10.1006/geno.2000.6422
  60. 10.1038/nature01858
  61. 10.1371/journal.pbio.0030007 / PLoS Biol. (2004)
  62. {'key': '2021111811004616000_15.7.901.62', 'first-page': '555', 'volume': '13', 'year': '1997', 'journal-title': 'CABIOS'} / CABIOS (1997)
  63. 10.1101/gr.1984404
  64. http://blast.wustl.edu; WU-BLAST homepage.
  65. http://www.repeatmasker.org; RepeatMasker homepage.
  66. http://mendel.stanford.edu/sidowlab; Sidow Lab homepage.
  67. http://genome.ucsc.edu; UCSC Genome Browser homepage.
  68. http://www.nisc.nih.gov/data; NISC Comparative Sequencing Program homepage.
  69. http://www.genome.gov/10002154; NHGRI Genome Sequencing Proposals.
Dates
Type When
Created 20 years, 2 months ago (June 17, 2005, 8:34 p.m.)
Deposited 3 years, 9 months ago (Nov. 18, 2021, 2:36 p.m.)
Indexed 3 days, 7 hours ago (Aug. 29, 2025, 5:50 a.m.)
Issued 20 years, 2 months ago (June 17, 2005)
Published 20 years, 2 months ago (June 17, 2005)
Published Online 20 years, 2 months ago (June 17, 2005)
Published Print 20 years, 2 months ago (July 1, 2005)
Funders 0

None

@article{Cooper_2005, title={Distribution and intensity of constraint in mammalian genomic sequence}, volume={15}, ISSN={1088-9051}, url={http://dx.doi.org/10.1101/gr.3577405}, DOI={10.1101/gr.3577405}, number={7}, journal={Genome Research}, publisher={Cold Spring Harbor Laboratory}, author={Cooper, Gregory M. and Stone, Eric A. and Asimenos, George and Green, Eric D. and Batzoglou, Serafim and Sidow, Arend}, year={2005}, month=jun, pages={901–913} }