Abstract
Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.
Bibliography
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., Paulovich, A., Pomeroy, S. L., Golub, T. R., Lander, E. S., & Mesirov, J. P. (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences, 102(43), 15545â15550.
Authors
11
- Aravind Subramanian (first)
- Pablo Tamayo (additional)
- Vamsi K. Mootha (additional)
- Sayan Mukherjee (additional)
- Benjamin L. Ebert (additional)
- Michael A. Gillette (additional)
- Amanda Paulovich (additional)
- Scott L. Pomeroy (additional)
- Todd R. Golub (additional)
- Eric S. Lander (additional)
- Jill P. Mesirov (additional)
References
34
Referenced
43,615
10.1126/science.270.5235.467
10.1038/nbt1296-1675
10.1126/science.1086384
10.1038/ng1180
10.1073/pnas.1032913100
10.1056/NEJMoa031314
- Hollander M. & Wolfe D. A. ( 1999) Nonparametric Statistical Methods (Wiley New York).
10.1016/S0166-4328(01)00297-2
10.1093/bioinformatics/btf877
10.1016/S0092-8674(03)00570-1
10.1038/nature03441
10.1146/annurev.genet.36.042902.092433
10.1073/pnas.96.25.14440
10.1159/000071572
10.1002/humu.10081
10.1038/ng765
10.1073/pnas.94.13.6948
- Barbouti, A., Hoglund, M., Johansson, B., Lassen, C., Nilsson, P. G., Hagemeijer, A., Mitelman, F. & Fioretos, T. ( 2003) Cancer Res. 63, 1202–1206. 12649177 / Cancer Res. (2003)
10.1038/sj.leu.2401482
10.1038/sj.onc.1205573
10.1016/S0268-960X(03)00040-7
10.1073/pnas.191502998
10.1038/nm733
10.1073/pnas.241500798
10.1038/ncb985
10.1007/s00109-002-0355-1
10.1128/MCB.22.15.5575-5584.2002
10.1158/1078-0432.CCR-0629-3
- Monti, S., Savage, K. J., Kutok, J. L., Feuerhake, F., Kurtin, P., Mihm, M., Wu, B., Pasqualucci, L., Neuberg, D., Aguiar, R. C., et al. ( 2004) Blood 105, 1851–1861. 15550490 / Blood (2004)
10.1038/nm1052
10.1038/ng1490
10.1186/gb-2003-4-1-r7
10.2165/00822942-200403040-00009
10.1093/bioinformatics/btg363
Dates
Type | When |
---|---|
Created | 19 years, 11 months ago (Sept. 30, 2005, 8:33 p.m.) |
Deposited | 3 years, 4 months ago (April 12, 2022, 2:03 p.m.) |
Indexed | 9 hours, 4 minutes ago (Sept. 4, 2025, 9:56 a.m.) |
Issued | 19 years, 11 months ago (Sept. 30, 2005) |
Published | 19 years, 11 months ago (Sept. 30, 2005) |
Published Online | 19 years, 11 months ago (Sept. 30, 2005) |
Published Print | 19 years, 10 months ago (Oct. 25, 2005) |
@article{Subramanian_2005, title={Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles}, volume={102}, ISSN={1091-6490}, url={http://dx.doi.org/10.1073/pnas.0506580102}, DOI={10.1073/pnas.0506580102}, number={43}, journal={Proceedings of the National Academy of Sciences}, publisher={Proceedings of the National Academy of Sciences}, author={Subramanian, Aravind and Tamayo, Pablo and Mootha, Vamsi K. and Mukherjee, Sayan and Ebert, Benjamin L. and Gillette, Michael A. and Paulovich, Amanda and Pomeroy, Scott L. and Golub, Todd R. and Lander, Eric S. and Mesirov, Jill P.}, year={2005}, month=sep, pages={15545–15550} }