Crossref journal-article
Oxford University Press (OUP)
Genomics, Proteomics & Bioinformatics (286)
Abstract

Abstract The ability to predict the subcellular localization of a protein from its sequence is of great importance, as it provides information about the protein’s function. We present a computational tool, PredSL, which utilizes neural networks, Markov chains, profile hidden Markov models, and scoring matrices for the prediction of the subcellular localization of proteins in eukaryotic cells from the N-terminal amino acid sequence. It aims to classify proteins into five groups: chloroplast, thylakoid, mitochondrion, secretory pathway, and “other”. When tested in a five-fold cross-validation procedure, PredSL demonstrates 86.7% and 87.1% overall accuracy for the plant and non-plant datasets, respectively. Compared with TargetP, which is the most widely used method to date, and LumenP, the results of PredSL are comparable in most cases. When tested on the experimentally verified proteins of the Saccharomyces cerevisiae genome, PredSL performs comparably if not better than any available algorithm for the same task. Furthermore, PredSL is the only method capable for the prediction of these subcellular localizations that is available as a stand-alone application through the URL: http://bioinformatics.biol.uoa.gr/PredSL/.

Bibliography

Petsalaki, E. I., Bagos, P. G., Litou, Z. I., & Hamodrakas, S. J. (2006). PredSL: A Tool for the N-Terminal Sequence-Based Prediction of Protein Subcellular Localization. Genomics, Proteomics & Bioinformatics, 4(1), 48–55.

Authors 4
  1. Evangelia I. Petsalaki (first)
  2. Pantelis G. Bagos (additional)
  3. Zoi I. Litou (additional)
  4. Stavros J. Hamodrakas (additional)
References 30 Referenced 125
  1. 10.1002/1439-7633(20000818)1:2<86::AID-CBIC86>3.0.CO;2-A / Chembiochem / Protein targeting (Nobel lecture) by Blobel (2000)
  2. {'key': '2024051008221587200_bib2', 'first-page': '291', 'article-title': 'An overview on predicting the subcellular location of a protein', 'volume': '2', 'author': 'Feng', 'year': '2002', 'journal-title': 'In Silico Biol.'} / In Silico Biol. / An overview on predicting the subcellular location of a protein by Feng (2002)
  3. 10.1101/gr.96802 / Genome Res. / Predicting protein cellular localization using a domain projection method by Mott (2002)
  4. 10.1006/jmbi.1996.0804 / J. Mol. Biol. / Relation between amino acid composition and cellular location of proteins by Cedano (1997)
  5. 10.1006/bbrc.2000.3815 / Biochem. Biophys. Res. Commun. / Prediction of protein subcellular locations by incorporating quasi-sequence-order effect by Chou (2000)
  6. 10.1002/prot.10507 / Proteins / Better prediction of subcellular localization by combining evolutionary and structural information by Nair (2003)
  7. 10.1006/jmbi.2000.3903 / J. Mol. Biol. / Predicting subcellular localization of proteins based on their N-terminal amino acid sequence by Emanuelsson (2000)
  8. 10.1002/(SICI)1097-0134(19990101)34:1<137::AID-PROT11>3.0.CO;2-O / Proteins / Prediction of membrane protein types and subcellular locations by Chou (1999)
  9. 10.1093/nar/gkh378 / Nucleic Acids Res. / PrediSi: prediction of signal peptides and their cleavage positions by Hiller (2004)
  10. 10.1093/nar/gkh485 / Nucleic Acids Res. / Proteome Analyst: custom predictions with explanations in a web-based tool for high-throughput proteome annotations by Szafron (2004)
  11. 10.1093/bioinformatics/btg222 / Bioinformatics / Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs by Park (2003)
  12. 10.1093/bioinformatics/19.2.307 / Bioinformatics / A profile hidden Markov model for signal peptides generated by HMMER by Zhang (2003)
  13. 10.1093/bioinformatics/bth171 / Bioinformatics / MITOPRED: a genome-scale method for prediction of nucleus-encoded mitochondrial proteins by Guda (2004)
  14. {'key': '2024051008221587200_bib14', 'first-page': '122', 'article-title': 'Prediction of signal peptides and signal anchors by a hidden Markov model', 'volume': '6', 'author': 'Nielsen', 'year': '1998', 'journal-title': 'Proc. Int. Conf. Intell. Syst. Mol. Biol.'} / Proc. Int. Conf. Intell. Syst. Mol. Biol. / Prediction of signal peptides and signal anchors by a hidden Markov model by Nielsen (1998)
  15. 10.1093/bioinformatics/bti372 / Bioinformatics / Prediction of subcellular localization using sequence-biased recurrent networks by Boden (2005)
  16. 10.1142/S0129065797000537 / Int. J. Neural Syst. / A neural network method for identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites by Nielsen (1997)
  17. 10.1110/ps.8.5.978 / Protein Sci. / ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites by Emanuelsson (1999)
  18. 10.1110/ps.0306003 / Protein Sci. / LumenP-a neural network predictor for protein localization in the thylakoid lumen by Westerlund (2003)
  19. 10.1093/bioinformatics/18.1.77 / Bioinformatics / Tolerating some redundancy significantly speeds up clustering of large protein databases by Li (2002)
  20. 10.1038/nature02026 / Nature / Global analysis of protein localization in budding yeast by Huh (2003)
  21. 10.1093/bioinformatics/18.2.298 / Bioinformatics / Extensive feature detection of N-terminal protein sorting signals by Bannai (2002)
  22. 10.1101/gad.970902 / Genes Dev. / Subcellular localization of the yeast proteome by Kumar (2002)
  23. {'key': '2024051008221587200_bib23', 'article-title': 'Prediction of mitochondrial proteins using support vector machine and hidden Markov model', 'author': 'Kumar', 'year': '2005', 'journal-title': 'J. Biol. Chem.'} / J. Biol. Chem. / Prediction of mitochondrial proteins using support vector machine and hidden Markov model by Kumar (2005)
  24. 10.1104/pp.104.040717 / Plant Physiol. / Analysis of curated and predicted plastid subproteomes of Arabidopsis. Subcellular compartmentalization leads to distinctive proteome properties by Sun (2004)
  25. 10.1186/1471-2164-5-40 / BMC Genomics / A chloroplast-localized vesicular transport system: a bioinformatics approach by Andersson (2004)
  26. 10.1093/nar/gki070 / Nucleic Acids Res. / The universal protein resource (UniProt) by Bairoch (2005)
  27. {'article-title': 'Faster-learning variations on back-propagation: an empirical study', 'year': '1988', 'author': 'Fahlman', 'key': '2024051008221587200_bib27'} / Faster-learning variations on back-propagation: an empirical study by Fahlman (1988)
  28. 10.1017/CBO9780511790492 / Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids by Durbin (1998)
  29. 10.1093/bioinformatics/14.9.755 / Bioinformatics / Profile hidden Markov models by Eddy (1998)
  30. 10.1093/nar/22.22.4673 / Nucleic Acids Res. / CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice by Thompson (1994)
Dates
Type When
Created 19 years, 4 months ago (April 19, 2006, 1:45 p.m.)
Deposited 1 year, 3 months ago (May 10, 2024, 4:22 a.m.)
Indexed 2 months, 2 weeks ago (June 9, 2025, 4:44 p.m.)
Issued 19 years, 5 months ago (March 1, 2006)
Published 19 years, 5 months ago (March 1, 2006)
Published Online 19 years, 4 months ago (April 18, 2006)
Published Print 19 years, 5 months ago (March 1, 2006)
Funders 1
  1. thylakoid targeting peptide prediction

@article{Petsalaki_2006, title={PredSL: A Tool for the N-Terminal Sequence-Based Prediction of Protein Subcellular Localization}, volume={4}, ISSN={2210-3244}, url={http://dx.doi.org/10.1016/s1672-0229(06)60016-8}, DOI={10.1016/s1672-0229(06)60016-8}, number={1}, journal={Genomics, Proteomics &amp; Bioinformatics}, publisher={Oxford University Press (OUP)}, author={Petsalaki, Evangelia I. and Bagos, Pantelis G. and Litou, Zoi I. and Hamodrakas, Stavros J.}, year={2006}, month=mar, pages={48–55} }