Abstract
A variety of nuclear localization signals (NLSs) are experimentally known although only one motif was available for database searches through PROSITE. We initially collected a set of 91 experimentally verified NLSs from the literature. Through iterated ‘in silico mutagenesis’ we then extended the set to 214 potential NLSs. This final set matched in 43% of all known nuclear proteins and in no known non‐nuclear protein. We estimated that >17% of all eukaryotic proteins may be imported into the nucleus. Finally, we found an overlap between the NLS and DNA‐binding region for 90% of the proteins for which both the NLS and DNA‐binding regions were known. Thus, evolution seemed to have used part of the existing DNA‐binding mechanism when compartmentalizing DNA‐binding proteins into the nucleus. However, only 56 of our 214 NLS motifs overlapped with DNA‐binding regions. These 56 NLSs enabled a de novo prediction of partial DNA‐binding regions for ∼800 proteins in human, fly, worm and yeast.
References
24
Referenced
563
10.1093/nar/27.1.49
10.1093/nar/28.1.235
10.1073/pnas.94.10.5055
{'key': 'e_1_2_5_5_1', 'first-page': '193', 'article-title': 'Nuclear localization signals (NLS)', 'volume': '3', 'author': 'Boulikas T.', 'year': '1993', 'journal-title': 'Crit. Rev. Eukaryot. Gene Expr.'}
/ Crit. Rev. Eukaryot. Gene Expr. / Nuclear localization signals (NLS) by Boulikas T. (1993)10.1002/jcb.240550106
10.1038/32100
10.1016/S0092-8674(00)81419-1
10.1002/1097-0134(20001001)41:1<98::AID-PROT120>3.0.CO;2-S
10.1093/bioinformatics/15.7.563
10.1093/nar/27.1.215
10.1002/(SICI)1097-4644(19980701)70:1<94::AID-JCB10>3.0.CO;2-B
10.1074/jbc.275.4.2647
{'key': 'e_1_2_5_14_1', 'first-page': '509', 'article-title': 'Efficient discovery of conserved patterns using a pattern graph', 'volume': '13', 'author': 'Jonassen I.', 'year': '1997', 'journal-title': 'Comp. Appl. Biol. Sci.'}
/ Comp. Appl. Biol. Sci. / Efficient discovery of conserved patterns using a pattern graph by Jonassen I. (1997)10.1093/nar/23.10.1647
- Liu J.andRost B.(2000)Analysing all proteins in entire genomes. CUBIC Columbia University Department of Biochemistry and Molecular Biophysics http://cubic.bioc.columbia.edu/genomes
10.1146/annurev.biochem.67.1.265
10.1038/380730a0
10.1016/S0014-5793(99)01446-5
10.1016/S0076-6879(96)66033-9
10.1093/protein/12.2.85
10.1016/S0968-0004(00)89080-5
10.1073/pnas.89.16.7442
10.1128/MCB.19.2.1210
10.1016/S0968-0004(98)01204-3
Dates
Type | When |
---|---|
Created | 23 years, 1 month ago (July 26, 2002, 6:48 p.m.) |
Deposited | 1 year, 8 months ago (Dec. 18, 2023, 3:56 p.m.) |
Indexed | 3 weeks, 3 days ago (Aug. 5, 2025, 9:11 a.m.) |
Issued | 24 years, 9 months ago (Nov. 1, 2000) |
Published | 24 years, 9 months ago (Nov. 1, 2000) |
Published Online | 24 years, 9 months ago (Nov. 1, 2000) |
Published Print | 24 years, 9 months ago (Nov. 1, 2000) |
@article{Cokol_2000, title={Finding nuclear localization signals}, volume={1}, ISSN={1469-3178}, url={http://dx.doi.org/10.1093/embo-reports/kvd092}, DOI={10.1093/embo-reports/kvd092}, number={5}, journal={EMBO reports}, publisher={Springer Science and Business Media LLC}, author={Cokol, Murat and Nair, Rajesh and Rost, Burkhard}, year={2000}, month=nov, pages={411–415} }