Abstract
To compare entire genomes from different species, biologists increasingly need alignment methods that are efficient enough to handle long sequences, and accurate enough to correctly align the conserved biological features between distant species. We present LAGAN, a system for rapid global alignment of two homologous genomic sequences, and Multi-LAGAN, a system for multiple global alignment of genomic sequences. We tested our systems on a data set consisting of greater than 12 Mb of high-quality sequence from 12 vertebrate species. All the sequence was derived from the genomic region orthologous to an ∼1.5-Mb region on human chromosome 7q31.3. We found that both LAGAN and Multi-LAGAN compare favorably with other leading alignment methods in correctly aligning protein-coding exons, especially between distant homologs such as human and chicken, or human and fugu. Multi-LAGAN produced the most accurate alignments, while requiring just 75 minutes on a personal computer to obtain the multiple alignment of all 12 sequences. Multi-LAGAN is a practical method for generating multiple alignments of long genomic sequences at any evolutionary distance. Our systems are publicly available athttp://lagan.stanford.edu.
References
37
Referenced
880
10.1006/jmbi.1990.9999
10.1093/nar/25.17.3389
10.1089/cmb.1997.4.369
/ J. Comp. Biol. / Re-Aligner: A program for refining DNA sequence multialignments. by Anson (1997)10.1016/0022-2836(87)90316-0
10.1101/gr.10.7.950
10.1016/S0304-3975(99)00324-2
10.1101/gr.789803
-
Brudno M. Morgenstern B. (2002) Fast and sensitive alignment of large genomic sequences. Proceeding of the IEEE Computer Society Bioinformatics Conference (CSB) ..
(
10.1186/1471-2105-4-66
) 10.1093/nar/27.11.2369
10.1093/nar/30.11.2478
10.1101/gr.142200
10.1145/146637.146656
10.1006/jmbi.1996.0679
10.1101/gr.45502
- Gusfield D. (1999) Algorithms on strings, trees, and sequences: Computer science and computational biology (Cambridge University Press, Cambridge, UK), pp 351â353.
10.3109/10425179309015629
/ DNA Seq. / Positive and negative regulatory elements of the rabbit ε-globin gene revealed by an improved multiple alignment program and functional analysis. by Hardison (1993)10.1006/geno.1994.1275
10.1073/pnas.89.22.10915
10.1093/bioinformatics/18.suppl_1.S312
/ Bioinformatics / Efficient multiple genome alignment. by Höhl (2002)10.1093/bioinformatics/17.9.803
10.1101/gr.229202. Article published online before March 2002
10.1101/gr.10.8.1115
10.1093/bioinformatics/16.11.1046
10.1093/bioinformatics/17.5.391
10.1093/bioinformatics/15.3.211
10.1093/bioinformatics/14.3.290
10.1016/0022-2836(70)90057-4
10.1101/gr.194201
10.1006/jmbi.2000.4042
10.1093/bioinformatics/15.11.909
10.1101/gr.10.4.577
10.1073/pnas.042692299
10.1016/0022-2836(81)90087-5
10.1006/jtbi.1996.0213
10.1007/BF02143508
10.1093/nar/22.22.4673
10.1089/cmb.1994.1.337
Dates
Type | When |
---|---|
Created | 22 years, 5 months ago (April 1, 2003, 10:44 p.m.) |
Deposited | 8 months, 3 weeks ago (Dec. 11, 2024, 8:40 p.m.) |
Indexed | 5 days, 1 hour ago (Aug. 30, 2025, 12:22 p.m.) |
Issued | 22 years, 5 months ago (March 12, 2003) |
Published | 22 years, 5 months ago (March 12, 2003) |
Published Online | 22 years, 5 months ago (March 12, 2003) |
Published Print | 22 years, 5 months ago (April 1, 2003) |
@article{Brudno_2003, title={LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNA}, volume={13}, ISSN={1549-5469}, url={http://dx.doi.org/10.1101/gr.926603}, DOI={10.1101/gr.926603}, number={4}, journal={Genome Research}, publisher={Cold Spring Harbor Laboratory}, author={Brudno, Michael and Do, Chuong B. and Cooper, Gregory M. and Kim, Michael F. and Davydov, Eugene and Program, NISC Comparative Sequencing and Green, Eric D. and Sidow, Arend and Batzoglou, Serafim}, year={2003}, month=mar, pages={721–731} }