Abstract
Structural genomics has as its goal the provision of structural information for all possible ORF sequences through a combination of experimental and computational approaches. The access to genome sequences and cloning resources from an ever-widening array of organisms is driving high-throughput structural studies by the New York Structural Genomics Research Consortium. In this report, we outline the progress of the Consortium in establishing its pipeline for structural genomics, and some of the experimental and bioinformatics efforts leading to structural annotation of proteins. The Consortium has established a pipeline for structural biology studies, automated modeling of ORF sequences using solved (template) structures, and a novel high-throughput approach (metallomics) to examining the metal binding to purified protein targets. The Consortium has so far produced 493 purified proteins from >1077 expression vectors. A total of 95 have resulted in crystal structures, and 81 are deposited in the Protein Data Bank (PDB). Comparative modeling of these structures has generated >40,000 structural models. We also initiated a high-throughput metal analysis of the purified proteins; this has determined that 10%-15% of the targets contain a stoichiometric structural or catalytic transition metal atom. The progress of the structural genomics centers in the U.S. and around the world suggests that the goal of providing useful structural information on most all ORF domains will be realized. This projected resource will provide structural biology information important to understanding the function of most proteins of the cell.
References
58
Referenced
42
10.1093/nar/25.17.3389
10.1093/nar/gkh039
10.1126/science.1065659
10.1093/nar/30.1.17
10.1038/417141a
10.1093/nar/28.1.235
10.1093/nar/gkg095
{'key': '2021111810594756000_14.10b.2145.8', 'first-page': '591', 'volume': '44', 'year': '2003', 'journal-title': 'Methods Biochem. Anal.'}
/ Methods Biochem. Anal. (2003)10.1038/13783
10.1073/pnas.89.21.10041
10.1021/bi9605503
10.1110/ps.4570102
10.1038/nsmb0304-201
10.1093/nar/gkg543
-
Fiser, A., Sanchez, R., Melo, F., and Sali, A. 2001. Comparative protein structure modeling. In Computational biochemistry and biophysics (eds. M. Watanabe et al.), pp. 275-312. Marcel Decker, NY.
(
10.1201/9780203903827.pt3
) 10.1038/415141a
{'key': '2021111810594756000_14.10b.2145.17', 'first-page': '1663', 'volume': '299', 'year': '2003', 'journal-title': 'Science'}
/ Science (2003)10.1016/S0301-4622(03)00101-7
10.1021/ar0302235
/ Acct. Chem. Res. (2004)10.1107/S0909049503024166
10.1126/science.1925561
10.1016/S0968-0004(00)89105-7
10.1126/science.273.5275.595
10.1093/nar/gkg460
10.1073/pnas.142413399
10.1110/ps.10101
10.1074/jbc.270.23.13807
10.1006/jmbi.1995.0159
10.1016/S0969-2126(97)00260-8
10.1093/nar/30.1.255
10.1093/nar/gkh095
{'key': '2021111810594756000_14.10b.2145.32', 'first-page': '2', 'volume': '2002', 'year': '2001', 'journal-title': 'NSLS Activity Report'}
/ NSLS Activity Report (2001)10.1038/ng1140
10.1016/S1357-4310(95)91170-7
-
____. 100,000 protein structures for the biologist. Nat. Struct. Biol. 5: 1029-1032.
(
10.1038/4136
) {'key': '2021111810594756000_14.10b.2145.36', 'first-page': '216', 'volume': '422', 'year': '2003', 'journal-title': 'Nature Insight'}
/ Nature Insight (2003)10.1073/pnas.95.23.13597
10.1093/nar/28.1.250
10.1093/nar/29.14.2994
-
Shi, W., Ostrov, D., Gerchman, S., Kycia, H., Studier, W., Edstrom, W., Bresnick, A.R., Ehrlich, J., Blanchard, J., Almo, S.C., et al. 2003. High-throughput structural biology and proteomics. In Protein chips, biochips, and proteomics: The next phase of genomics discovery, Chapter 12, pp. 299-324. Marcel Decker, NY.
(
10.1201/9780203911129.ch12
) 10.1371/journal.pbio.0000045
10.1002/pro.5560010502
10.1007/s00216-003-2333-z
10.1016/S1472-9792(03)00051-9
/ Tuberculosis (Edinb) (2003)10.1016/S0968-0004(02)02169-2
10.1126/science.1091317
10.1038/88640
10.1038/nature01262
10.1093/nar/30.1.245
10.1093/nar/gkg068
10.1093/nar/28.1.10
10.1016/S1367-5931(02)00015-7
- www.nigms.nih.gov/psi; NIH Web site providing information and relevant links for the Protein Structure Initiative.
- http://targetdb.pdb.org; Web site operated by the Protein Databank to allow searching of targets from the structural genomics centers.
- www.nysgxrc.org; Web site operated by the NYSGRC. Its functions are to provide a public target list and progress as well as to allow consortium members to enter target data.
- http://salilab.org/modbase; MODBASE, a comprehensive database of comparative protein structure models.
- www-archbac.u-psud.fr/genomics/COG_Guess.html; Clusters of Orthologous Groups Database Query Page to perform similarity search in COG database. This provides a function and COG category guess for input sequence.
- http://salilab.org/modbase/models_nysgxrc.html; Summary and statistics of homology modeling results using the NYSGXRC PDB structures as templates.
Dates
Type | When |
---|---|
Created | 20 years, 10 months ago (Oct. 15, 2004, 1:42 p.m.) |
Deposited | 3 years, 9 months ago (Nov. 18, 2021, 2:31 p.m.) |
Indexed | 1 year, 2 months ago (June 6, 2024, 6:40 p.m.) |
Issued | 20 years, 10 months ago (Oct. 15, 2004) |
Published | 20 years, 10 months ago (Oct. 15, 2004) |
Published Online | 20 years, 10 months ago (Oct. 15, 2004) |
Published Print | 20 years, 10 months ago (Oct. 15, 2004) |
@article{Chance_2004, title={High-Throughput Computational and Experimental Techniques in Structural Genomics}, volume={14}, ISSN={1088-9051}, url={http://dx.doi.org/10.1101/gr.2537904}, DOI={10.1101/gr.2537904}, number={10b}, journal={Genome Research}, publisher={Cold Spring Harbor Laboratory}, author={Chance, Mark R. and Fiser, Andras and Sali, Andrej and Pieper, Ursula and Eswar, Narayanan and Xu, Guiping and Fajardo, J. Eduardo and Radhakannan, Thirumuruhan and Marinkovic, Nebojsa}, year={2004}, month=oct, pages={2145–2154} }