Abstract
AbstractA major bottleneck in comparative modeling is the alignment quality; this is especially true for proteins whose distant relationships could be reliably recognized only by recent advances in fold recognition. The best algorithms excel in recognizing distant homologs but often produce incorrect alignments for over 50% of protein pairs in large fold‐prediction benchmarks. The alignments obtained by sequence–sequence or sequence–structure matching algorithms differ significantly from the structural alignments. To study this problem, we developed a simplified method to explicitly enumerate all possible alignments for a pair of proteins. This allowed us to estimate the number ofsignificantly differentalignments for a given scoring method that score better than the structural alignment. Using several examples of distantly related proteins, we show that for standard sequence–sequence alignment methods, the number of significantly different alignments is usually large, often about 1010alternatives. This distance decreases when the alignment method is improved, but the number is still too large for the brute force enumeration approach. More effective strategies were needed, so we evaluated and compared two well‐known approaches for searching the space of suboptimal alignments. We combined their best features and produced a hybrid method, which yielded alignments that surpassed the original alignments for about 50% of protein pairs with minimal computational effort.
References
43
Referenced
68
10.1093/nar/25.17.3389
10.1038/80734
10.1038/13783
{'key': 'e_1_2_7_5_1', 'volume-title': 'Fourth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction.', 'author': 'CASP4.', 'year': '2000'}
/ Fourth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction. by CASP4. (2000)10.1073/pnas.87.8.3240
10.1016/S0076-6879(97)77022-8
10.1002/pro.5560050711
10.1016/0022-2836(92)90693-E
10.1093/protein/6.8.801
10.1093/protein/12.1.31
10.1002/pro.5560011217
10.1007/s008940050087
10.1002/pro.5560070620
10.1110/ps.9.8.1487
10.1006/jmbi.1999.2583
10.1002/(SICI)1097-0134(1999)37:3 <30::AID-PROT6>3.0.CO;2-S
10.1107/S0567739478001680
10.1093/oxfordjournals.jbchem.a124869
10.1093/protein/9.2.127
10.1016/S0022-2836(05)80134-2
10.1089/cmb.1994.1.349
10.1016/0022-2836(70)90057-4
10.1016/S0969-2126(97)00260-8
10.1093/protein/5.2.121
{'key': 'e_1_2_7_26_1', 'first-page': '328', 'article-title': 'Multiple model approach—Dealing with alignment ambiguities in protein modeling', 'author': 'Pawlowski K.', 'year': '1997', 'journal-title': 'modeling'}
/ modeling / Multiple model approach—Dealing with alignment ambiguities in protein modeling by Pawlowski K. (1997)10.1006/jsbi.2001.4394
10.1073/pnas.85.8.2444
10.1110/ps.9.11.2181
10.1110/ps.9.2.232
/ Protein Sci. / Comparison of sequence profiles. Structural predictions with no structure information by Rychlewski L. (2000)10.1016/S0959-440X(97)80027-9
10.1016/0022-2836(91)90667-U
10.1093/protein/5.4.305
10.1093/protein/11.9.739
10.1002/prot.10006
10.1016/0022-2836(81)90087-5
10.1093/protein/9.1.27
10.1002/prot.10015
10.1093/protein/3.7.565
10.1006/jmbi.1995.0340
10.1007/978-1-4899-6846-3
10.1073/pnas.89.13.6090
10.1002/(SICI)1097-0134(1999)37:3 <22::AID-PROT5>3.0.CO;2-W
10.1016/0022-2836(91)80062-Y
Dates
Type | When |
---|---|
Created | 23 years, 1 month ago (July 28, 2002, 6:41 p.m.) |
Deposited | 1 year, 7 months ago (Jan. 6, 2024, 7:08 a.m.) |
Indexed | 1 month, 3 weeks ago (July 7, 2025, 8:04 a.m.) |
Issued | 23 years, 1 month ago (July 1, 2002) |
Published | 23 years, 1 month ago (July 1, 2002) |
Published Online | 16 years, 4 months ago (April 13, 2009) |
Published Print | 23 years, 1 month ago (July 1, 2002) |
@article{Jaroszewski_2002, title={In search for more accurate alignments in the twilight zone}, volume={11}, ISSN={1469-896X}, url={http://dx.doi.org/10.1110/ps.4820102}, DOI={10.1110/ps.4820102}, number={7}, journal={Protein Science}, publisher={Wiley}, author={Jaroszewski, Lukasz and Li, Weizhong and Godzik, Adam}, year={2002}, month=jul, pages={1702–1713} }