Crossref journal-article
Oxford University Press (OUP)
Bioinformatics (286)
Abstract

AbstractFeature selection techniques have become an apparent need in many bioinformatics applications. In addition to the large pool of techniques that have already been developed in the machine learning and data mining fields, specific applications in bioinformatics have led to a wealth of newly proposed techniques.In this article, we make the interested reader aware of the possibilities of feature selection, providing a basic taxonomy of feature selection techniques, and discussing their use, variety and potential in a number of both common as well as upcoming bioinformatics applications.Contact:  yvan.saeys@psb.ugent.beSupplementary information:  http://bioinformatics.psb.ugent.be/supplementary_data/yvsae/fsreview

Bibliography

Saeys, Y., Inza, I., & Larrañaga, P. (2007). A review of feature selection techniques in bioinformatics. Bioinformatics, 23(19), 2507–2517.

Authors 3
  1. Yvan Saeys (first)
  2. Iñaki Inza (additional)
  3. Pedro Larrañaga (additional)
References 137 Referenced 3,893
  1. 10.1073/pnas.96.12.6745 / Proc. Nat. Acad. Sci. USA / Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays by Alon (1999)
  2. 10.2165/00822942-200504030-00004 / Appl. Bioinformatics / Feature selection and the class imbalance problem in predicting protein function from sequence by Al-Shahib (2005)
  3. 10.1073/pnas.102102699 / Proc. Nat. Acad. Sci. USA / Selection bias in gene extraction on the basis of microarray gene-expression data by Ambroise (2002)
  4. 10.1093/bioinformatics/17.6.509 / Bioinformatics / A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes by Baldi (2001)
  5. 10.1093/bioinformatics/18.3.395 / Bioinformatics / An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers by Ball (2002)
  6. {'key': '2023041208441392600_', 'first-page': '773', 'article-title': 'Pattern recognition and reduction of dimensionality. In', 'volume-title': 'Handbook of Statistics II', 'author': 'Ben-Bassat', 'year': '1982'} / Handbook of Statistics II / Pattern recognition and reduction of dimensionality. In by Ben-Bassat (1982)
  7. 10.1089/106652700750050943 / J. Comput. Biol. / Tissue classification with gene expression profiles by Ben-Dor (2000)
  8. 10.1002/pmic.200500192 / Proteomics / A robust meta classification strategy for cancer detection from MS data by Bhanot (2006)
  9. 10.1142/S0218001404003800 / Int. J. Pattern Recognit. Artif. Intell. / Gene selection for cancer classification using wrapper approaches by Blanco (2004)
  10. 10.1186/gb-2002-3-4-research0017 / Genome Biol. / New feature subset selection procedures for classification of expression profiles by (2002)
  11. 10.1093/bioinformatics/btg419 / Bioinformatics / Is cross-validation valid for small-sample microarray classification? by Braga-Neto (2004)
  12. 10.1016/j.febslet.2004.07.055 / FEBS Lett. / Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments by Breitling (2004)
  13. 10.1093/bioinformatics/bti760 / Bioinformatics / PCP: a program for supervised classification of gene expression profiles by Buturovic (2005)
  14. {'key': '2023041208441392600_', 'article-title': 'SVM and Kernel Methods Matlab Toolbox. In', 'volume-title': 'Perception Systèmes et Information', 'author': 'Canu', 'year': '2003'} / Perception Systèmes et Information / SVM and Kernel Methods Matlab Toolbox. In by Canu (2003)
  15. 10.1086/381000 / Am. J. Hum. Genet. / Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium by Carlson (2004)
  16. 10.1093/bioinformatics/14.2.139 / Bioinformatics / Feature selection for genetic sequence classification by Chuzhanova (1998)
  17. 10.1093/bib/6.1.57 / Brief. Bioinformatics / A survey of current work in biomedical text mining by Cohen (2005)
  18. {'key': '2023041208441392600_', 'first-page': '54', 'article-title': 'A comparative study on feature selection for E.coli promoter recognition', 'volume': '11', 'author': 'Conilione', 'year': '2005', 'journal-title': 'Int. J. Inf. Technol'} / Int. J. Inf. Technol / A comparative study on feature selection for E.coli promoter recognition by Conilione (2005)
  19. 10.1007/978-0-387-47509-7_4 / Fundamentals of Data Mining in Genomics and Proteomics / Pre-processing mass spectrometry data. In by Coombes (2007)
  20. {'key': '2023041208441392600_', 'first-page': '84', 'article-title': 'Combined optimization of feature selection and algorithm parameter interaction in machine learning of language', 'author': 'Daelemans', 'year': '2003'} / Combined optimization of feature selection and algorithm parameter interaction in machine learning of language by Daelemans (2003)
  21. 10.1038/ng1001-229 / Nat. Genet. / High-resolution haplotype structure in the human genome by Daly (2001)
  22. 10.1186/1471-2105-6-173 / BMC Bioinformatics / Normal uniform mixture differential gene expression detection in cDNA microarrays by Dean (2005)
  23. 10.1093/bioinformatics/18.suppl_2.S75 / Bioinformatics / Feature subset selection for splice site prediction by Degroeve (2002)
  24. 10.1093/nar/27.23.4636 / Nucleic Acids Res. / Improved microbial gene identification with GLIMMER by Delcher (1999)
  25. 10.1186/1471-2105-7-3 / BMC Bioinformatics / Gene selection and classification of microarray data using random forest by Díaz-Uriarte (2006)
  26. {'key': '2023041208441392600_', 'first-page': '523', 'article-title': 'Minimum redundancy feature selection from microarray gene expression data', 'author': 'Ding', 'year': '2003'} / Minimum redundancy feature selection from microarray gene expression data by Ding (2003)
  27. 10.1093/bioinformatics/btg1011 / Bioinformatics / Combining NLP and probabilistic categorisation for document and term selection for Swiss-Prot medical annotation by Dobrokhotov (2003)
  28. {'volume-title': 'Pattern Classification', 'year': '2001', 'author': 'Duda', 'key': '2023041208441392600_'} / Pattern Classification by Duda (2001)
  29. 10.1198/016214502753479248 / J. Am. Stat. Assoc / Comparison of discriminant methods for the classification of tumors using gene expression data by Dudoit (2002)
  30. 10.1214/ss/1056397487 / Stat. Sci. / Multiple hypothesis testing in microarray experiments by Dudoit (2003)
  31. 10.1198/016214501753382129 / J. Am. Stat. Assoc. / Empirical Bayes analysis of a microarray experiment by Efron (2001)
  32. {'key': '2023041208441392600_', 'first-page': '216', 'article-title': 'PubMiner:machine learning-based text mining for biomedical information analysis. In', 'volume': 'Vol. 3192', 'author': 'Eom', 'year': '2000', 'journal-title': 'Lecture Notes in Artificial Intelligence'} / Lecture Notes in Artificial Intelligence / PubMiner:machine learning-based text mining for biomedical information analysis. In by Eom (2000)
  33. {'key': '2023041208441392600_', 'first-page': '403', 'author': 'Ferri', 'year': '1994', 'journal-title': 'Pattern Recognition in Practice IV, Multiple Paradigms, Comparative Studies and Hybrid Systems'} / Pattern Recognition in Practice IV, Multiple Paradigms, Comparative Studies and Hybrid Systems by Ferri (1994)
  34. {'key': '2023041208441392600_', 'first-page': '1289', 'article-title': 'An extensive empirical study of feature selection metrics for text classification', 'volume': '3', 'author': 'Forman', 'year': '2003', 'journal-title': 'J. Mach. Learn. Res.'} / J. Mach. Learn. Res. / An extensive empirical study of feature selection metrics for text classification by Forman (2003)
  35. 10.1186/1471-2105-7-126 / BMC Bioinformatics / A two-sample Bayesian t-test for microarray data by Fox (2006)
  36. 10.1126/science.1069424 / Science / The structure of haplotype blocks in the human genome by Gabriel (2002)
  37. 10.1093/bioinformatics/bti494 / Bioinformatics / Proteomic mass spectra classification using decision tree based ensemble methods by Geurts (2005)
  38. 10.1093/bioinformatics/btl230 / Bioinformatics / Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks by Gevaert (2006)
  39. 10.1155/JBB.2005.147 / J. Biomed. Biotechnol. / Classification and selection of biomarkers in genomic data using LASSO by Ghosh (2005)
  40. 10.1126/science.286.5439.531 / Science / Molecular classification of cancer: class discovery and class prediction by gene expression monitoring by Golub (1999)
  41. {'key': '2023041208441392600_', 'first-page': '830', 'article-title': 'Application of genetic algorithm—support vector machine hybrid for prediction of clinical phenotypes based on geneome-wide SNP profiles of sib pairs. In', 'volume-title': 'Lecture Notes in Computer Science 3614', 'author': 'Gong', 'year': '2005'} / Lecture Notes in Computer Science 3614 / Application of genetic algorithm—support vector machine hybrid for prediction of clinical phenotypes based on geneome-wide SNP profiles of sib pairs. In by Gong (2005)
  42. 10.1093/bioinformatics/btl196 / Bioinformatics / Comparative gene marker selection suite by Gould (2006)
  43. {'key': '2023041208441392600_', 'first-page': '1157', 'article-title': 'An introduction to variable and feature selection', 'volume': '3', 'author': 'Guyon', 'year': '2003', 'journal-title': 'J. Mach Learn Res.'} / J. Mach Learn Res. / An introduction to variable and feature selection by Guyon (2003)
  44. 10.1023/A:1012487302797 / Mach. Learn. / Gene selection for cancer classification using support vector machines by Guyon (2002)
  45. {'key': '2023041208441392600_', 'article-title': 'Correlation-based feature selection for machine learning', 'volume-title': 'PhD Thesis', 'author': 'Hall', 'year': '1999'} / PhD Thesis / Correlation-based feature selection for machine learning by Hall (1999)
  46. 10.1093/bioinformatics/bti1021 / Bioinformatics / Tag SNP selection in genotype data for maximizing SNP prediction accuracy by Halperin (2005)
  47. 10.1093/bioinformatics/btl350 / Bioinformatics / Substring selection for biomedical document classification by Han (2006)
  48. 10.1093/bioinformatics/btl420 / Bioinformatics / MLR-tagging: informative SNP selection for unphased genotypes based on multiple linear regression by He (2006)
  49. 10.1002/mas.20072 / Mass Spectrom. Rev. / Processing and classification of protein mass spectra by Hilario (2006)
  50. {'volume-title': 'Adaptation in Natural and Artificial Systems', 'year': '1975', 'author': 'Holland', 'key': '2023041208441392600_'} / Adaptation in Natural and Artificial Systems by Holland (1975)
  51. 10.1016/S0004-3702(00)00052-7 / Artif. Intell. / Feature subset selection by Bayesian networks based optimization by Inza (2000)
  52. 10.1016/j.artmed.2004.01.007 / Artif. Intell. Med. / Filter versus wrapper gene selection approaches in DNA microarray domains by Inza (2004)
  53. 10.1186/1472-6947-6-27 / BMC Med. Inform. Decis. Mak. / An assessment of recently published gene expression data analyses: reporting experimental design and statistical factors by Jafari (2006)
  54. 10.1038/nrg1768 / Nat. Rev. Genet. / Literature mining for the biologist: from information retrieval to biological discovery by Jensen (2006)
  55. 10.1186/1471-2105-5-81 / BMC Bioinformatics / Joint analysis of two microarray gene-expression data sets to select lung adenocarcinoma marker genes by Jiang (2004)
  56. 10.1186/1471-2105-6-148 / BMC Bioinformatics / Feature selection and classification for microarray data analysis: evolutionary methods for identifying predictive genes by Jirapech-Umpai (2005)
  57. {'key': '2023041208441392600_', 'first-page': '41', 'article-title': 'Feature selection in proteomic pattern data with support vector machines', 'author': 'Jong', 'year': '2004'} / Feature selection in proteomic pattern data with support vector machines by Jong (2004)
  58. 10.1093/bioinformatics/18.9.1167 / Bioinformatics / Identification of regulatory elements using a feature selection method by Keles (2002)
  59. 10.1186/1471-2105-7-411 / BMC Bioinformatics / miTarget: microRNA target gene prediction using a support vector machine by Kim (2006)
  60. 10.1007/978-94-009-9941-1_3 / Pattern Recognition and Signal Processing, Chapter Feature Set Search Algorithms by Kittler (1978)
  61. {'key': '2023041208441392600_', 'first-page': '234', 'article-title': 'Data mining using MLC++: a machine learning library in C++. In', 'volume-title': 'Tools with Artificial Intelligence', 'author': 'Kohavi', 'year': '1996'} / Tools with Artificial Intelligence / Data mining using MLC++: a machine learning library in C++. In by Kohavi (1996)
  62. {'key': '2023041208441392600_', 'first-page': '284', 'article-title': 'Toward optimal feature selection. In', 'volume-title': 'Proceedings of the Thirteenth International Conference on Machine Learning', 'author': 'Koller', 'year': '1996'} / Proceedings of the Thirteenth International Conference on Machine Learning / Toward optimal feature selection. In by Koller (1996)
  63. 10.1038/85776 / Nat. Genet / Variation in the spice of life by Kruglyak (2001)
  64. 10.1093/bioinformatics/btl233 / Bioinformatics / BNTagger: improved tagging SNP selection using Bayesian networks by LeeP (2006)
  65. 10.1093/bioinformatics/btg458 / Bioinformatics / CHOISS for selection on single nucleotide polymorphism markers on interval regularity by Lee (2004)
  66. 10.1016/j.csda.2004.03.017 / Comput. Stat. and Data Anal. / An extensive comparison of recent classification tools applied to microarray data by Lee (2005)
  67. 10.1093/bioinformatics/19.1.90 / Bioinformatics / Gene selection: a Bayesian variable selection approach by Lee (2003)
  68. 10.1093/bioinformatics/btk005 / Bioinformatics / EDGE: extraction and analysis of differential gene expression by Leek (2006)
  69. 10.1186/1471-2105-6-68 / BMC Bioinformatics / Feature selection and nearest centroid classification for protein mass spectrometry by Levner (2005)
  70. 10.1093/bioinformatics/17.12.1131 / Bioinformatics / Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method by Li (2001)
  71. 10.1093/bioinformatics/bth098 / Bioinformatics / Applications of the GA/KNN method to SELDI proteomics data by Li (2004)
  72. 10.1093/bioinformatics/bth267 / Bioinformatics / A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression by Li (2004)
  73. {'first-page': '137', 'article-title': 'How many genes are needed for a discriminant microarray data analysis? In', 'author': 'Li', 'key': '2023041208441392600_'} / How many genes are needed for a discriminant microarray data analysis? In by Li
  74. {'key': '2023041208441392600_', 'first-page': '1184', 'article-title': 'Large-scale ensemble decision analysis of sib-pair ibd profiles for identification of the relevant molecular signatures for alcoholism. In', 'volume-title': 'Lecture Notes in Computer Science 3614', 'author': 'Li', 'year': '2005'} / Lecture Notes in Computer Science 3614 / Large-scale ensemble decision analysis of sib-pair ibd profiles for identification of the relevant molecular signatures for alcoholism. In by Li (2005)
  75. 10.1086/425587 / Am. J. Hum. Genet. / Finding haplotype tagging SNPs by use of principal components analysis by Lin (2004)
  76. 10.1007/978-1-4615-5689-3 / Feature Selection for Knowledge Discovery and Data Mining by Liu (1998)
  77. {'key': '2023041208441392600_', 'first-page': '51', 'article-title': 'A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns', 'volume': '13', 'author': 'Liu', 'year': '2002', 'journal-title': 'Genome Inform.'} / Genome Inform. / A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns by Liu (2002)
  78. 10.3233/ISB-00132 / Silico Biol / Using amino acid patterns to accurately predict translation initiation sites. In by Liu (2004)
  79. 10.1186/1471-2105-5-110 / BMC Bioinformatics / Tests for finding complex patterns of differential expression in cancers: towards individualized medicine by Lyons-Weiler (2004)
  80. 10.1016/j.patcog.2006.07.010 / Pattern Recognit. / Selecting features in microarray classification using ROC curves by Mamitsuka (2006)
  81. 10.1093/bioinformatics/bti724 / Bioinformatics / Regularized ROC method for disease classification and biomarker selection with microarray data by Ma (2005)
  82. 10.1093/bioinformatics/btl602 / Bioinformatics / Prophet, a web-based tool for class prediction using microarray data by Medina (2007)
  83. 10.1093/bioinformatics/bti499 / Bioinformatics / Prediction error estimation: a comparison of resampling methods by Molinaro (2005)
  84. 10.1089/106652701300099074 / J. Comput. Biol. / On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data by Newton (2001)
  85. 10.1093/bioinformatics/19.1.37 / Bioinformatics / Genetic algorithms applied to multi-class prediction for the analysis of gene expression data by Ooi (2003)
  86. 10.1093/bioinformatics/btg167 / Bioinformatics / On the use of permutation in and the performance of a class of nonparametric methods to detect differential gene expression by Pan (2003)
  87. {'key': '2023041208441392600_', 'first-page': '52', 'article-title': 'A nonparametric scoring algorithm for identifying informative genes from microarray data', 'volume': '6', 'author': 'Park', 'year': '2001', 'journal-title': 'Pac. Symp. on Biocompu'} / Pac. Symp. on Biocompu / A nonparametric scoring algorithm for identifying informative genes from microarray data by Park (2001)
  88. 10.1186/1471-2105-7-345 / BMC Bioinformatics / Individualized markers optimize class prediction of microarray data by Pavlidis (2006)
  89. 10.1373/49.4.533 / Clin. Chem. / Mass spectometry-based diagnostic: the upcoming revolution in disease detection by Petricoin (2003)
  90. 10.1016/S0140-6736(02)07746-2 / The Lancet / Use of proteomics patterns in serum to identify ovarian cancer by Petricoin (2002)
  91. 10.1093/bioinformatics/btk013 / Bioinformatics / Multidimensional local false discovery rate for microarray studies by Ploner (2006)
  92. 10.1093/bioinformatics/bth160 / Bioinformatics / Improving false discovery rate estimation by Pounds (2004)
  93. 10.1002/pmic.200400857 / Proteomics / Mining mass-spectra for diagnosis and biomarker discovery of cerebral accidents by Prados (2004)
  94. 10.1093/bioinformatics/bti670 / Bioinformatics / Analysis of mass spectral serum profiles for biomarker selection by Ressom (2005)
  95. 10.1093/bioinformatics/btl678 / Bioinformatics / Peak selection from MALDI-TOF mass spectra using ant colony optimization by Ressom (2007)
  96. 10.1038/73432 / Nat. Genet. / Systematic variation in gene expression patterns in human cancer cell lines by Ross (2000)
  97. 10.1016/j.patcog.2005.11.001 / Pattern Recognit. / Incremental wrapper-based gene selection from microarray data for cancer classification by Ruiz (2006)
  98. 10.1186/1471-2105-5-64 / BMC Bioinformatics / Feature selection for splice site prediction: a new method using EDA-based feature ranking by Saeys (2004)
  99. 10.1093/bioinformatics/btl639 / Bioinformatics / In search of the small ones: improved prediction of short exons in vertebrates, plants, fungi, and protists by Saeys (2007)
  100. 10.1093/nar/26.2.544 / Nucleic Acids Res. / Microbial gene identification using interpolated markov models by Salzberg (1998)
  101. 10.1093/bioinformatics/bti436 / Bioinformatics / twilight; a Bioconductor package for estimating the local false discovery rate by Scheid (2005)
  102. 10.1016/j.artmed.2004.04.002 / Artif. Intell. Med. / Data mining and genetic algorithm based gene/SNP selection by Shah (2004)
  103. 10.1093/bioinformatics/btl532 / Bioinformatics / Combining functional and linkage disequilibrium information in the selection of tag snps by Sham (2007)
  104. 10.1016/j.jbi.2005.04.002 / J. Biomed. Inform. / A machine learning perspective on the development of clinical decision support systems utilizing mass spectra of blood samples by Shin (2006)
  105. 10.1142/S0218001488000145 / Int. J. Pattern Recogni. / On automatic feature selection by Siedelecky (1998)
  106. 10.1093/bioinformatics/btl407 / Bioinformatics / What should be expected from feature selection in small-sample settings by Sima (2006)
  107. 10.1093/bioinformatics/bti081 / Bioinformatics / Superior feature-set ranking for small samples using bolstered error estimation by Sima (2005)
  108. 10.1089/10665270360688219 / J. Comput. Biol. / Discriminative motifs by Sinha (2003)
  109. {'key': '2023041208441392600_', 'first-page': '293', 'article-title': 'Prototype and feature selection by sampling and random mutation hill climbing algorithms', 'author': 'Skalak', 'year': '1994'} / Prototype and feature selection by sampling and random mutation hill climbing algorithms by Skalak (1994)
  110. 10.2202/1544-6115.1027 / Stat. Appl. in Genet. and Mol. Biol. / Linear models and empirical Bayes methods for assessing differential expression in microarray experiments by Smyth (2004)
  111. 10.1093/bioinformatics/btg182 / Bioinformatics / Class prediction and discovery using gene microarray and proteomics mass spectroscopy data: curses, caveats, cautions by Somorjai (2003)
  112. 10.1093/bioinformatics/bti033 / Bioinformatics / A comprhensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis by Statnikov (2005)
  113. 10.1111/1467-9868.00346 / J. R. Stat. Soc. Ser. B / A direct approach to false discovery rates by Storey (2002)
  114. 10.1093/bioinformatics/btg179 / Bioinformatics / RankGene: identification of diagnostic genes based on expression data by Su (2003)
  115. 10.1093/bioinformatics/bth282 / Bioinformatics / Identification of DNA regulatory motifs using Bayesian variable selection by Tadesse (2004)
  116. 10.1101/gr.165101 / Genome Res. / An efficient and robust statistical modeling approach to discover differentially expressed genes using genomic expression profiles by Thomas (2001)
  117. 10.1093/bioinformatics/bth357 / Bioinformatics / Sample classification from protein mass spectrometry, by ‘peak probability contrast’ by Tibshirani (2004)
  118. 10.1093/bioinformatics/btl074 / Bioinformatics / GALGO: an R package for multivariate variable selection using genetic algorithms by Trevino (2006)
  119. 10.1093/bioinformatics/18.11.1454 / Bioinformatics / Nonparametric methods for identifying differentially expressed genes in microarray data by Troyanskaya (2002)
  120. 10.1073/pnas.091062498 / Proceedings of the National Academy of Sciences / Significance analysis of microarrays applied to ionizing radiation response. In by Tusher (2001)
  121. 10.1093/bioinformatics/btl214 / Bioinformatics / Novel unsupervised feature filtering of biological data by Varshavsky (2006)
  122. 10.1016/j.compbiolchem.2004.11.001 / Comput. Biol. Chem. / Gene selection from microarray data for cancer classification–a machine learning approach by Wang (2005)
  123. {'key': '2023041208441392600_', 'first-page': '1057', 'article-title': 'Tumor classification based on DNA copy number aberrations determined using SNPS arrays', 'volume': '5', 'author': 'Wang', 'year': '2006', 'journal-title': 'Oncol. Rep.'} / Oncol. Rep. / Tumor classification based on DNA copy number aberrations determined using SNPS arrays by Wang (2006)
  124. {'key': '2023041208441392600_', 'first-page': '1439', 'article-title': 'Use of the zero-norm with linear models and kernel methods', 'volume': '3', 'author': 'Weston', 'year': '2003', 'journal-title': 'J. Mach. Learn. Res.'} / J. Mach. Learn. Res. / Use of the zero-norm with linear models and kernel methods by Weston (2003)
  125. {'volume-title': 'Data Mining: Practical Machine Learning Tools and Techniques', 'year': '2005', 'author': 'Witten', 'key': '2023041208441392600_'} / Data Mining: Practical Machine Learning Tools and Techniques by Witten (2005)
  126. 10.1093/bioinformatics/btg210 / Bioinformatics / Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data by Wu (2003)
  127. {'key': '2023041208441392600_', 'first-page': '601', 'article-title': 'Feature selection for high-dimensional genomic microarray data', 'author': 'Xing', 'year': '2001'} / Feature selection for high-dimensional genomic microarray data by Xing (2001)
  128. 10.1101/gr.190001 / Genome Res. / Biomarker identification by feature wrappers by Xiong (2001)
  129. 10.1093/bioinformatics/bti108 / Bioinformatics / Identifying differentially expressed genes from microarray experiments via statistic synthesis by Yang (2005)
  130. 10.1016/S1535-6108(02)00032-6 / Cancer Cell / Classification, subtype discovery, and prediction of outcome in pediatric lymphoblastic leukemia by gene expression profiling by Yeoh (2002)
  131. 10.1186/gb-2003-4-12-r83 / Genome Biol. / Multiclass classification of microarray data with repeated measurements: application to cancer by Yeung (2003)
  132. 10.1093/bioinformatics/bti319 / Bioinformatics / Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data by Yeung (2005)
  133. 10.1093/bioinformatics/bti1030 / Bioinformatics / Bayesian neural network approaches to ovarian cancer identification from high-resolution mass spectrometry data by Yu (2005)
  134. 10.1093/bioinformatics/bti370 / Bioinformatics / Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data by Yu (2005)
  135. {'key': '2023041208441392600_', 'first-page': '1205', 'article-title': 'Efficient feature selection via analysis of relevance and redundancy', 'volume': '5', 'author': 'Yu', 'year': '2004', 'journal-title': 'J. Mach. Learn. Res.'} / J. Mach. Learn. Res. / Efficient feature selection via analysis of relevance and redundancy by Yu (2004)
  136. 10.1093/bioinformatics/18.5.689 / Bioinformatics / Support vector machines with selective kernel scaling for protein classification and identification of key amino acid positions by Zavaljevsky (2002)
  137. 10.1186/1471-2105-7-197 / BMC Bioinformatics / Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data by Zhang (2006)
Dates
Type When
Created 17 years, 11 months ago (Aug. 24, 2007, 8:30 p.m.)
Deposited 7 months ago (Jan. 20, 2025, 11:58 a.m.)
Indexed 20 hours, 41 minutes ago (Aug. 21, 2025, 2:16 p.m.)
Issued 17 years, 11 months ago (Aug. 24, 2007)
Published 17 years, 11 months ago (Aug. 24, 2007)
Published Online 17 years, 11 months ago (Aug. 24, 2007)
Published Print 17 years, 10 months ago (Oct. 1, 2007)
Funders 0

None

@article{Saeys_2007, title={A review of feature selection techniques in bioinformatics}, volume={23}, ISSN={1367-4803}, url={http://dx.doi.org/10.1093/bioinformatics/btm344}, DOI={10.1093/bioinformatics/btm344}, number={19}, journal={Bioinformatics}, publisher={Oxford University Press (OUP)}, author={Saeys, Yvan and Inza, Iñaki and Larrañaga, Pedro}, year={2007}, month=aug, pages={2507–2517} }