Crossref journal-article
Springer Science and Business Media LLC
Genome Biology (297)
Abstract

AbstractBowtie is an ultrafast, memory-efficient alignment program for aligning short DNA sequence reads to large genomes. For the human genome, Burrows-Wheeler indexing allows Bowtie to align more than 25 million reads per CPU hour with a memory footprint of approximately 1.3 gigabytes. Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches. Multiple processor cores can be used simultaneously to achieve even greater alignment speeds. Bowtie is open source http://bowtie.cbcb.umd.edu.

Bibliography

Langmead, B., Trapnell, C., Pop, M., & Salzberg, S. L. (2009). Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology, 10(3).

Authors 4
  1. Ben Langmead (first)
  2. Cole Trapnell (additional)
  3. Mihai Pop (additional)
  4. Steven L Salzberg (additional)
References 30 Referenced 19,048
  1. Down TA, Rakyan VK, Turner DJ, Flicek P, Li H, Kulesha E, Graf S, Johnson N, Herrero J, Tomazou EM, Thorne NP, Backdahl L, Herberth M, Howe KL, Jackson DK, Miretti MM, Marioni JC, Birney E, Hubbard TJ, Durbin R, Tavare S, Beck S: A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis. Nat Biotechnol. 2008, 26: 779-785. 10.1038/nbt1414. (10.1038/nbt1414) / Nat Biotechnol by TA Down (2008)
  2. Johnson DS, Mortazavi A, Myers RM, Wold B: Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007, 316: 1497-1502. 10.1126/science.1141319. (10.1126/science.1141319) / Science by DS Johnson (2007)
  3. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y: RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008, 18: 1509-1517. 10.1101/gr.079558.108. (10.1101/gr.079558.108) / Genome Res by JC Marioni (2008)
  4. Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, Hall KP, Evers DJ, Barnes CL, Bignell HR, Boutell JM, Bryant J, Carter RJ, Keira Cheetham R, Cox AJ, Ellis DJ, Flatbush MR, Gormley NA, Humphray SJ, Irving LJ, Karbelashvili MS, Kirk SM, Li H, Liu X, Maisinger KS, Murray LJ, Obradovic B, Ost T, Parkinson ML, Pratt MR, et al: Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008, 456: 53-59. 10.1038/nature07517. (10.1038/nature07517) / Nature by DR Bentley (2008)
  5. Ley TJ, Mardis ER, Ding L, Fulton B, McLellan MD, Chen K, Dooling D, Dunford-Shore BH, McGrath S, Hickenbotham M, Cook L, Abbott R, Larson DE, Koboldt DC, Pohl C, Smith S, Hawkins A, Abbott S, Locke D, Hillier LW, Miner T, Fulton L, Magrini V, Wylie T, Glasscock J, Conyers J, Sander N, Shi X, Osborne JR, Minx P, et al: DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature. 2008, 456: 66-72. 10.1038/nature07485. (10.1038/nature07485) / Nature by TJ Ley (2008)
  6. Wang J, Wang W, Li R, Li Y, Tian G, Goodman L, Fan W, Zhang J, Li J, Zhang J, Guo Y, Feng B, Li H, Lu Y, Fang X, Liang H, Du Z, Li D, Zhao Y, Hu Y, Yang Z, Zheng H, Hellmann I, Inouye M, Pool J, Yi X, Zhao J, Duan J, Zhou Y, Qin J, et al: The diploid genome sequence of an Asian individual. Nature. 2008, 456: 60-65. 10.1038/nature07484. (10.1038/nature07484) / Nature by J Wang (2008)
  7. Li H, Ruan J, Durbin R: Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008, 18: 1851-1858. 10.1101/gr.078212.108. (10.1101/gr.078212.108) / Genome Res by H Li (2008)
  8. Li R, Li Y, Kristiansen K, Wang J: SOAP: short oligonucleotide alignment program. Bioinformatics. 2008, 24: 713-714. 10.1093/bioinformatics/btn025. (10.1093/bioinformatics/btn025) / Bioinformatics by R Li (2008)
  9. Kaiser J: DNA sequencing. A plan to capture human diversity in 1000 genomes. Science. 2008, 319: 395-10.1126/science.319.5862.395. (10.1126/science.319.5862.395) / Science by J Kaiser (2008)
  10. Smith AD, Xuan Z, Zhang MQ: Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinformatics. 2008, 9: 128-10.1186/1471-2105-9-128. (10.1186/1471-2105-9-128) / BMC Bioinformatics by AD Smith (2008)
  11. Lin H, Zhang Z, Zhang MQ, Ma B, Li M: ZOOM! Zillions Of Oligos Mapped. Bioinformatics. 2008, 24: 2431-2437. 10.1093/bioinformatics/btn416. (10.1093/bioinformatics/btn416) / Bioinformatics by H Lin (2008)
  12. SHRiMP - SHort Read Mapping Package. [http://compbio.cs.toronto.edu/shrimp/]
  13. Baeza-Yates RA, Perleberg CH: Fast and practical approximate string matching. Inf Process Lett. 1996, 59: 21-27. 10.1016/0020-0190(96)00083-X. (10.1016/0020-0190(96)00083-X) / Inf Process Lett by RA Baeza-Yates (1996)
  14. Burkhardt S, Kärkkäinen J: Better Filtering with Gapped q-Grams. Fundam Inf. 2003, 56: 51-70. / Fundam Inf by S Burkhardt (2003)
  15. Ma B, Tromp J, Li M: PatternHunter: faster and more sensitive homology search. Bioinformatics. 2002, 18: 440-445. 10.1093/bioinformatics/18.3.440. (10.1093/bioinformatics/18.3.440) / Bioinformatics by B Ma (2002)
  16. Smith TF, Waterman MS: Identification of common molecular subsequences. J Mol Biol. 1981, 147: 195-197. 10.1016/0022-2836(81)90087-5. (10.1016/0022-2836(81)90087-5) / J Mol Biol by TF Smith (1981)
  17. Burrows M, Wheeler DJ: A Block Sorting Lossless Data Compression Algorithm. Technical Report 124. 1994, Palo Alto, CA: Digital Equipment Corporation / A Block Sorting Lossless Data Compression Algorithm. Technical Report 124 by M Burrows (1994)
  18. Ferragina P, Manzini G: Opportunistic data structures with applications. [http://web.unipmn.it/~manzini/papers/focs00draft.pdf]
  19. Ferragina P, Manzini G: An experimental study of an opportunistic index. Proceedings of the Twelfth Annual ACM-SIAM Symposium on Discrete algorithms. 2001, Washington, DC: Society for Industrial and Applied Mathematics, 269-278. / Proceedings of the Twelfth Annual ACM-SIAM Symposium on Discrete algorithms by P Ferragina (2001)
  20. Healy J, Thomas EE, Schwartz JT, Wigler M: Annotating large genomes with exact word matches. Genome Res. 2003, 13: 2306-2315. 10.1101/gr.1350803. (10.1101/gr.1350803) / Genome Res by J Healy (2003)
  21. Lippert RA: Space-efficient whole genome comparisons with Burrows-Wheeler transforms. J Comput Biol. 2005, 12: 407-415. 10.1089/cmb.2005.12.407. (10.1089/cmb.2005.12.407) / J Comput Biol by RA Lippert (2005)
  22. Graf S, Nielsen FG, Kurtz S, Huynen MA, Birney E, Stunnenberg H, Flicek P: Optimized design and assessment of whole genome tiling arrays. Bioinformatics. 2007, 23: i195-i204. 10.1093/bioinformatics/btm200. (10.1093/bioinformatics/btm200) / Bioinformatics by S Graf (2007)
  23. Lam TW, Sung WK, Tam SL, Wong CK, Yiu SM: Compressed indexing and local alignment of DNA. Bioinformatics. 2008, 24: 791-797. 10.1093/bioinformatics/btn032. (10.1093/bioinformatics/btn032) / Bioinformatics by TW Lam (2008)
  24. Ewing B, Green P: Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 1998, 8: 186-194. (10.1101/gr.8.3.186) / Genome Res by B Ewing (1998)
  25. Bowtie: An ultrafast memory-efficient short read aligner. [http://bowtie.cbcb.umd.edu/]
  26. Campbell PJ, Stephens PJ, Pleasance ED, O'Meara S, Li H, Santarius T, Stebbings LA, Leroy C, Edkins S, Hardy C, Teague JW, Menzies A, Goodhead I, Turner DJ, Clee CM, Quail MA, Cox A, Brown C, Durbin R, Hurles ME, Edwards PA, Bignell GR, Stratton MR, Futreal PA: Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing. Nat Genet. 2008, 40: 722-729. 10.1038/ng.128. (10.1038/ng.128) / Nat Genet by PJ Campbell (2008)
  27. Holt KE, Parkhill J, Mazzoni CJ, Roumagnac P, Weill FX, Goodhead I, Rance R, Baker S, Maskell DJ, Wain J, Dolecek C, Achtman M, Dougan G: High-throughput sequencing provides insights into genome variation and evolution in Salmonella typhi. Nat Genet. 2008, 40: 987-993. 10.1038/ng.195. (10.1038/ng.195) / Nat Genet by KE Holt (2008)
  28. Nagalakshmi U, Wang Z, Waern K, Shou C, Raha D, Gerstein M, Snyder M: The transcriptional landscape of the yeast genome defined by RNA sequencing. Science. 2008, 320: 1344-1349. 10.1126/science.1158441. (10.1126/science.1158441) / Science by U Nagalakshmi (2008)
  29. Kärkkäinen J: Fast BWT in small space by blockwise suffix sorting. Theor Comput Sci. 2007, 387: 249-257. (10.1016/j.tcs.2007.07.018) / Theor Comput Sci by J Kärkkäinen (2007)
  30. Doring A, Weese D, Rausch T, Reinert K: SeqAn an efficient, generic C++ library for sequence analysis. BMC Bioinformatics. 2008, 9: 11-10.1186/1471-2105-9-11. (10.1186/1471-2105-9-11) / BMC Bioinformatics by A Doring (2008)
Dates
Type When
Created 16 years, 6 months ago (March 4, 2009, 2:13 p.m.)
Deposited 10 months, 4 weeks ago (Oct. 8, 2024, 6:17 a.m.)
Indexed 37 minutes ago (Sept. 7, 2025, 9:43 a.m.)
Issued 16 years, 6 months ago (March 4, 2009)
Published 16 years, 6 months ago (March 4, 2009)
Published Online 16 years, 6 months ago (March 4, 2009)
Funders 0

None

@article{Langmead_2009, title={Ultrafast and memory-efficient alignment of short DNA sequences to the human genome}, volume={10}, ISSN={1474-760X}, url={http://dx.doi.org/10.1186/gb-2009-10-3-r25}, DOI={10.1186/gb-2009-10-3-r25}, number={3}, journal={Genome Biology}, publisher={Springer Science and Business Media LLC}, author={Langmead, Ben and Trapnell, Cole and Pop, Mihai and Salzberg, Steven L}, year={2009}, month=mar }