Abstract
A wealth of protein and DNA sequence data is being generated by genome projects and other sequencing efforts. A crucial barrier to deciphering these sequences and understanding the relations among them is the difficulty of detecting subtle local residue patterns common to multiple sequences. Such patterns frequently reflect similar molecular structures and biological properties. A mathematical definition of this "local multiple alignment" problem suitable for full computer automation has been used to develop a new and sensitive algorithm, based on the statistical method of iterative sampling. This algorithm finds an optimized local alignment model forNsequences inN-linear time, requiring only seconds on current workstations, and allows the simultaneous detection and optimization of multiple patterns and pattern repeats. The method is illustrated as applied to helix-turn-helix proteins, lipocalins, and prenyltransferases.
References
116
Referenced
1,171
- Aitchison J. Statistical Prediction Analysis (1972).
-
AKAIKE, H, NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION, IEEE TRANSACTIONS ON AUTOMATIC CONTROL 19: 716 (1974).
(
10.1109/TAC.1974.1100705
) / IEEE TRANSACTIONS ON AUTOMATIC CONTROL (1974) - AKRIGG, D, SERPENT - AN INFORMATION-STORAGE AND ANALYSIS RESOURCE FOR PROTEIN SEQUENCES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 8: 295 (1992). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1992)
- ALEXANDROV, N.N., LOCAL MULTIPLE ALIGNMENT BY CONSENSUS MATRIX, COMPUTER APPLICATIONS IN THE BIOSCIENCES 8: 339 (1992). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1992)
-
ALTSCHUL, S.F., WEIGHTS FOR DATA RELATED BY A TREE, JOURNAL OF MOLECULAR BIOLOGY 207: 647 (1989).
(
10.1016/0022-2836(89)90234-9
) / JOURNAL OF MOLECULAR BIOLOGY (1989) -
ALTSCHUL, S.F., AMINO-ACID SUBSTITUTION MATRICES FROM AN INFORMATION THEORETIC PERSPECTIVE, JOURNAL OF MOLECULAR BIOLOGY 219: 555 (1991).
(
10.1016/0022-2836(91)90193-A
) / JOURNAL OF MOLECULAR BIOLOGY (1991) - ALTSCHUL, S.F., SIGNIFICANCE OF NUCLEOTIDE-SEQUENCE ALIGNMENTS - A METHOD FOR RANDOM SEQUENCE PERMUTATION THAT PRESERVES DINUCLEOTIDE AND CODON USAGE, MOLECULAR BIOLOGY AND EVOLUTION 2: 526 (1985). / MOLECULAR BIOLOGY AND EVOLUTION (1985)
-
ALTSCHUL, S.F., TREES, STARS, AND MULTIPLE BIOLOGICAL SEQUENCE ALIGNMENT, SIAM JOURNAL ON APPLIED MATHEMATICS 49: 197 (1989).
(
10.1137/0149012
) / SIAM JOURNAL ON APPLIED MATHEMATICS (1989) -
BACON, D.J., MULTIPLE SEQUENCE ALIGNMENT, JOURNAL OF MOLECULAR BIOLOGY 191: 153 (1986).
(
10.1016/0022-2836(86)90252-4
) / JOURNAL OF MOLECULAR BIOLOGY (1986) -
BAINS, W, MULTAN - A PROGRAM TO ALIGN MULTIPLE DNA-SEQUENCES, NUCLEIC ACIDS RESEARCH 14: 159 (1986).
(
10.1093/nar/14.1.159
) / NUCLEIC ACIDS RESEARCH (1986) -
BARBIER, C.S., AMINO-ACID SUBSTITUTIONS IN THE CYTR REPRESSOR WHICH ALTER ITS CAPACITY TO REGULATE GENE-EXPRESSION, JOURNAL OF BACTERIOLOGY 174: 2881 (1992).
(
10.1128/jb.174.9.2881-2890.1992
) / JOURNAL OF BACTERIOLOGY (1992) -
BARTON, G.J., A STRATEGY FOR THE RAPID MULTIPLE ALIGNMENT OF PROTEIN SEQUENCES - CONFIDENCE LEVELS FROM TERTIARY STRUCTURE COMPARISONS, JOURNAL OF MOLECULAR BIOLOGY 198: 327 (1987).
(
10.1016/0022-2836(87)90316-0
) / JOURNAL OF MOLECULAR BIOLOGY (1987) -
BARTON, G.J., FLEXIBLE PROTEIN-SEQUENCE PATTERNS - A SENSITIVE METHOD TO DETECT WEAK STRUCTURAL SIMILARITIES, JOURNAL OF MOLECULAR BIOLOGY 212: 389 (1990).
(
10.1016/0022-2836(90)90133-7
) / JOURNAL OF MOLECULAR BIOLOGY (1990) -
BERENS, C, THE ROLE OF THE N-TERMINUS IN TET REPRESSOR FOR TET OPERATOR BINDING DETERMINED BY A MUTATIONAL ANALYSIS, JOURNAL OF BIOLOGICAL CHEMISTRY 267: 1945 (1992).
(
10.1016/S0021-9258(18)46038-3
) / JOURNAL OF BIOLOGICAL CHEMISTRY (1992) -
BERG, O.G., SELECTION OF DNA-BINDING SITES BY REGULATORY PROTEINS - STATISTICAL-MECHANICAL THEORY AND APPLICATION TO OPERATORS AND PROMOTERS, JOURNAL OF MOLECULAR BIOLOGY 193: 723 (1987).
(
10.1016/0022-2836(87)90354-8
) / JOURNAL OF MOLECULAR BIOLOGY (1987) - BERGER, M.P., A NOVEL RANDOMIZED ITERATIVE STRATEGY FOR ALIGNING MULTIPLE PROTEIN SEQUENCES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 7: 479 (1991). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1991)
-
BOCSKEI, Z, PHEROMONE BINDING TO 2 RODENT URINARY PROTEINS REVEALED BY X-RAY CRYSTALLOGRAPHY, NATURE 360: 186 (1992).
(
10.1038/360186a0
) / NATURE (1992) - BOGUSKI, M.S., ANALYSIS OF CONSERVED DOMAINS AND SEQUENCE MOTIFS IN CELLULAR REGULATORY PROTEINS AND LOCUS-CONTROL REGIONS USING NEW SOFTWARE TOOLS FOR MULTIPLE ALIGNMENT AND VISUALIZATION, NEW BIOLOGIST 4: 247 (1992). / NEW BIOLOGIST (1992)
- BOGUSKI, M.S., NOVEL REPETITIVE SEQUENCE MOTIFS IN THE ALPHA AND BETA SUBUNITS OF PRENYL-PROTEIN TRANSFERASES AND HOMOLOGY OF THE ALPHA SUBUNIT TO THE MAD2 GENE-PRODUCT OF YEAST, NEW BIOLOGIST 4: 408 (1992). / NEW BIOLOGIST (1992)
- Boguski, M. S., Protein Engineering: A Practical Approach: 57 (1992). / Protein Engineering: A Practical Approach (1992)
- BORK, P, AN ATPASE DOMAIN COMMON TO PROKARYOTIC CELL-CYCLE PROTEINS, SUGAR KINASES, ACTIN, AND HSP70 HEAT-SHOCK PROTEINS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 89: 7290 (1992). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1992)
-
BRENNAN, R.G., THE HELIX-TURN-HELIX DNA-BINDING MOTIF, JOURNAL OF BIOLOGICAL CHEMISTRY 264: 1903 (1989).
(
10.1016/S0021-9258(18)94115-3
) / JOURNAL OF BIOLOGICAL CHEMISTRY (1989) - Brown, .M, Proceedings of the First International Conference on Intelligent Systems for Molecular Biology: 47 (1993). / Proceedings of the First International Conference on Intelligent Systems for Molecular Biology (1993)
-
BRUNELLE, A, DETERMINING RESIDUE-BASE INTERACTIONS BETWEEN ARAC-PROTEIN AND ARAI DNA, JOURNAL OF MOLECULAR BIOLOGY 209: 607 (1989).
(
10.1016/0022-2836(89)90598-6
) / JOURNAL OF MOLECULAR BIOLOGY (1989) -
BRYANT, S.H., AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF, PROTEINS-STRUCTURE FUNCTION AND GENETICS 16: 92 (1993).
(
10.1002/prot.340160110
) / PROTEINS-STRUCTURE FUNCTION AND GENETICS (1993) -
CARDON, L.R., EXPECTATION MAXIMIZATION ALGORITHM FOR IDENTIFYING PROTEIN-BINDING SITES WITH VARIABLE LENGTHS FROM UNALIGNED DNA FRAGMENTS, JOURNAL OF MOLECULAR BIOLOGY 223: 159 (1992).
(
10.1016/0022-2836(92)90723-W
) / JOURNAL OF MOLECULAR BIOLOGY (1992) -
CARRILLO, H, THE MULTIPLE SEQUENCE ALIGNMENT PROBLEM IN BIOLOGY, SIAM JOURNAL ON APPLIED MATHEMATICS 48: 1073 (1988).
(
10.1137/0148063
) / SIAM JOURNAL ON APPLIED MATHEMATICS (1988) - CHAPPEY, C, MASH - AN INTERACTIVE PROGRAM FOR MULTIPLE ALIGNMENT AND CONSENSUS SEQUENCE CONSTRUCTION FOR BIOLOGICAL SEQUENCES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 7: 195 (1991). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1991)
-
CHOTHIA, C, PRINCIPLES THAT DETERMINE THE STRUCTURE OF PROTEINS, ANNUAL REVIEW OF BIOCHEMISTRY 53: 537 (1984).
(
10.1146/annurev.bi.53.070184.002541
) / ANNUAL REVIEW OF BIOCHEMISTRY (1984) -
CLARKE, S, PROTEIN ISOPRENYLATION AND METHYLATION AT CARBOXYL-TERMINAL CYSTEINE RESIDUES, ANNUAL REVIEW OF BIOCHEMISTRY 61: 355 (1992).
(
10.1146/annurev.bi.61.070192.002035
) / ANNUAL REVIEW OF BIOCHEMISTRY (1992) -
CONTRERAS, A, THE EFFECT ON THE FUNCTION OF THE TRANSCRIPTIONAL ACTIVATOR NTRC FROM KLEBSIELLA-PNEUMONIAE OF MUTATIONS IN THE DNA-RECOGNITION HELIX, NUCLEIC ACIDS RESEARCH 16: 4025 (1988).
(
10.1093/nar/16.9.4025
) / NUCLEIC ACIDS RESEARCH (1988) -
COWAN, S.W., CRYSTALLOGRAPHIC REFINEMENT OF HUMAN SERUM RETINOL BINDING-PROTEIN AT 2A RESOLUTION, PROTEINS-STRUCTURE FUNCTION AND GENETICS 8: 44 (1990).
(
10.1002/prot.340080108
) / PROTEINS-STRUCTURE FUNCTION AND GENETICS (1990) - Dayhoff, M. O., Atlas of Protein Sequence and Structure 5 3: 345 (1978). / Atlas of Protein Sequence and Structure (1978)
-
DEMPSTER, A. P., JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL 39: 1 (1977).
(
10.1111/j.2517-6161.1977.tb01600.x
) / JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL (1977) - DEPIEREUX, E, MATCH-BOX - A FUNDAMENTALLY NEW ALGORITHM FOR THE SIMULTANEOUS ALIGNMENT OF SEVERAL PROTEIN SEQUENCES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 8: 501 (1992). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1992)
-
DODD, I.B., IMPROVED DETECTION OF HELIX-TURN-HELIX DNA-BINDING MOTIFS IN PROTEIN SEQUENCES, NUCLEIC ACIDS RESEARCH 18: 5019 (1990).
(
10.1093/nar/18.17.5019
) / NUCLEIC ACIDS RESEARCH (1990) -
DODD, I.B., THE PREDICTION OF HELIX-TURN-HELIX DNA-BINDING REGIONS IN PROTEINS - A REPLY, PROTEIN ENGINEERING 2: 174 (1988).
(
10.1093/protein/2.3.174
) / PROTEIN ENGINEERING (1988) -
DOOLITTLE, R.F., RECONSTRUCTING HISTORY WITH AMINO-ACID-SEQUENCES, PROTEIN SCIENCE 1: 191 (1992).
(
10.1002/pro.5560010201
) / PROTEIN SCIENCE (1992) -
DRUMMOND, M, SEQUENCE AND DOMAIN RELATIONSHIPS OF NTRC AND NIFA FROM KLEBSIELLA-PNEUMONIAE - HOMOLOGIES TO OTHER REGULATORY PROTEINS, EMBO JOURNAL 5: 441 (1986).
(
10.1002/j.1460-2075.1986.tb04230.x
) / EMBO JOURNAL (1986) -
DRUMMOND, M.H., THE FUNCTION OF ISOLATED DOMAINS AND CHIMAERIC PROTEINS CONSTRUCTED FROM THE TRANSCRIPTIONAL ACTIVATORS NIFA AND NTRC OF KLEBSIELLA-PNEUMONIAE, MOLECULAR MICROBIOLOGY 4: 29 (1990).
(
10.1111/j.1365-2958.1990.tb02012.x
) / MOLECULAR MICROBIOLOGY (1990) -
FENG, D.F., ALIGNING AMINO-ACID SEQUENCES - COMPARISON OF COMMONLY USED METHODS, JOURNAL OF MOLECULAR EVOLUTION 21: 112 (1985).
(
10.1007/BF02100085
) / JOURNAL OF MOLECULAR EVOLUTION (1985) -
FENG, D.F., PROGRESSIVE SEQUENCE ALIGNMENT AS A PREREQUISITE TO CORRECT PHYLOGENETIC TREES, JOURNAL OF MOLECULAR EVOLUTION 25: 351 (1987).
(
10.1007/BF02603120
) / JOURNAL OF MOLECULAR EVOLUTION (1987) -
FLOWER, D.R., STRUCTURE AND SEQUENCE RELATIONSHIPS IN THE LIPOCALINS AND RELATED PROTEINS, PROTEIN SCIENCE 2: 753 (1993).
(
10.1002/pro.5560020507
) / PROTEIN SCIENCE (1993) -
GELFAND, A. E., JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION 85: 398 (1990).
(
10.1080/01621459.1990.10476213
) / JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (1990) - GEMAN, S, STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES, IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 6: 721 (1984). / IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (1984)
-
GOODMAN, L.A., EXPLORATORY LATENT STRUCTURE-ANALYSIS USING BOTH IDENTIFIABLE AND UNIDENTIFIABLE MODELS, BIOMETRIKA 61: 215 (1974).
(
10.1093/biomet/61.2.215
) / BIOMETRIKA (1974) - GRIBSKOV, M, PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 84: 4355 (1987). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1987)
-
GUSFIELD, D, EFFICIENT METHODS FOR MULTIPLE SEQUENCE ALIGNMENT WITH GUARANTEED ERROR-BOUNDS, BULLETIN OF MATHEMATICAL BIOLOGY 55: 141 (1993).
(
10.1007/BF02460299
) / BULLETIN OF MATHEMATICAL BIOLOGY (1993) -
HALL, P, JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL 51: 459 (1989).
(
10.1111/j.2517-6161.1989.tb01440.x
) / JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL (1989) -
Haussler D. CIS Technical Report UCSC-CRL-92-23 (1992).
(
10.1016/0378-1097(92)90536-W
) - HENIKOFF, S, PLAYING WITH BLOCKS - SOME PITFALLS OF FORCING MULTIPLE ALIGNMENTS, NEW BIOLOGIST 3: 1148 (1991). / NEW BIOLOGIST (1991)
-
HENIKOFF, S, AUTOMATED ASSEMBLY OF PROTEIN BLOCKS FOR DATABASE SEARCHING, NUCLEIC ACIDS RESEARCH 19: 6565 (1991).
(
10.1093/nar/19.23.6565
) / NUCLEIC ACIDS RESEARCH (1991) - HENIKOFF, S, AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 89: 10915 (1992). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1992)
- HENNEKE, C.M., A MULTIPLE SEQUENCE ALIGNMENT ALGORITHM FOR HOMOLOGOUS PROTEINS USING SECONDARY STRUCTURE INFORMATION AND OPTIONALLY KEYING ALIGNMENTS TO FUNCTIONALLY IMPORTANT SITES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 5: 141 (1989). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1989)
- HERTZ, G.Z., IDENTIFICATION OF CONSENSUS PATTERNS IN UNALIGNED DNA-SEQUENCES KNOWN TO BE FUNCTIONALLY RELATED, COMPUTER APPLICATIONS IN THE BIOSCIENCES 6: 81 (1990). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1990)
- HIGGINS, D.G., CLUSTAL-V - IMPROVED SOFTWARE FOR MULTIPLE SEQUENCE ALIGNMENT, COMPUTER APPLICATIONS IN THE BIOSCIENCES 8: 189 (1992). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1992)
-
HOLDEN, H.M., THE MOLECULAR-STRUCTURE OF INSECTICYANIN FROM THE TOBACCO HORNWORM MANDUCA-SEXTA L AT 2.6 A RESOLUTION, EMBO JOURNAL 6: 1565 (1987).
(
10.1002/j.1460-2075.1987.tb02401.x
) / EMBO JOURNAL (1987) -
HUBER, R, MOLECULAR-STRUCTURE OF THE BILIN BINDING-PROTEIN (BBP) FROM PIERIS-BRASSICAE AFTER REFINEMENT AT 2.0-A RESOLUTION, JOURNAL OF MOLECULAR BIOLOGY 198: 499 (1987).
(
10.1016/0022-2836(87)90296-8
) / JOURNAL OF MOLECULAR BIOLOGY (1987) -
JOHNSON, M.S., A METHOD FOR THE SIMULTANEOUS ALIGNMENT OF 3 OR MORE AMINO-ACID-SEQUENCES, JOURNAL OF MOLECULAR EVOLUTION 23: 267 (1986).
(
10.1007/BF02115583
) / JOURNAL OF MOLECULAR EVOLUTION (1986) 10.1126/science.220.4598.671
-
KOSTREWA, D, CRYSTAL-STRUCTURE OF THE FACTOR FOR INVERSION STIMULATION FIS AT 2.0 ANGSTROM RESOLUTION, JOURNAL OF MOLECULAR BIOLOGY 226: 209 (1992).
(
10.1016/0022-2836(92)90134-6
) / JOURNAL OF MOLECULAR BIOLOGY (1992) - LAMERICHS, RMJN, THE AMINO-TERMINAL DOMAIN OF LEXA REPRESSOR IS ALPHA-HELICAL BUT DIFFERS FROM CANONICAL HELIX-TURN-HELIX PROTEINS - A TWO-DIMENSIONAL H-1-NMR STUDY, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 86: 6863 (1989). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1989)
-
LAWRENCE, C.E., AN EXPECTATION MAXIMIZATION (EM) ALGORITHM FOR THE IDENTIFICATION AND CHARACTERIZATION OF COMMON SITES IN UNALIGNED BIOPOLYMER SEQUENCES, PROTEINS-STRUCTURE FUNCTION AND GENETICS 7: 41 (1990).
(
10.1002/prot.340070105
) / PROTEINS-STRUCTURE FUNCTION AND GENETICS (1990) -
LESK, A.M., HOW DIFFERENT AMINO-ACID-SEQUENCES DETERMINE SIMILAR PROTEIN STRUCTURES - STRUCTURE AND EVOLUTIONARY DYNAMICS OF THE GLOBINS, JOURNAL OF MOLECULAR BIOLOGY 136: 225 (1980).
(
10.1016/0022-2836(80)90373-3
) / JOURNAL OF MOLECULAR BIOLOGY (1980) -
LEUNG, M.Y., AN EFFICIENT ALGORITHM FOR IDENTIFYING MATCHES WITH ERRORS IN MULTIPLE LONG MOLECULAR SEQUENCES, JOURNAL OF MOLECULAR BIOLOGY 221: 1367 (1991).
(
10.1016/0022-2836(91)90938-3
) / JOURNAL OF MOLECULAR BIOLOGY (1991) - LIPMAN, D.J., A TOOL FOR MULTIPLE SEQUENCE ALIGNMENT, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 86: 4412 (1989). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1989)
- Little R. J. A. Statistical Analysis with Missing Data (1987).
- Liu J. Harvard University Department of Statistics Research Report R-426 (1992).
- LIU J.S. unpublished data.
-
METROPOLIS, N, EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES, JOURNAL OF CHEMICAL PHYSICS 21: 1087 (1953).
(
10.1063/1.1699114
) / JOURNAL OF CHEMICAL PHYSICS (1953) -
MONACO, H.L., 3-DIMENSIONAL STRUCTURE AND ACTIVE-SITE OF 3 HYDROPHOBIC MOLECULE-BINDING PROTEINS WITH SIGNIFICANT AMINO-ACID-SEQUENCE SIMILARITY, BIOPOLYMERS 32: 457 (1992).
(
10.1002/bip.360320425
) / BIOPOLYMERS (1992) - MURATA, M, SIMULTANEOUS COMPARISON OF 3 PROTEIN SEQUENCES, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 82: 3073 (1985). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1985)
- NAGATA, A, HUMAN BRAIN PROSTAGLANDIN-D SYNTHASE HAS BEEN EVOLUTIONARILY DIFFERENTIATED FROM LIPOPHILIC-LIGAND CARRIER PROTEINS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 88: 4020 (1991). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1991)
-
NORTH, ACT, 3-DIMENSIONAL ARRANGEMENT OF CONSERVED AMINO-ACID RESIDUES IN A SUPERFAMILY OF SPECIFIC LIGAND-BINDING PROTEINS, INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES 11: 56 (1989).
(
10.1016/0141-8130(89)90041-X
) / INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES (1989) - Orchard, T., Proceedings of the Sixth Berkeley Symposium on Mathematics, Statistics and Probability 1: 697 (1972). / Proceedings of the Sixth Berkeley Symposium on Mathematics, Statistics and Probability (1972)
-
PABO, C.O., TRANSCRIPTION FACTORS - STRUCTURAL FAMILIES AND PRINCIPLES OF DNA RECOGNITION, ANNUAL REVIEW OF BIOCHEMISTRY 61: 1053 (1992).
(
10.1146/annurev.bi.61.070192.005201
) / ANNUAL REVIEW OF BIOCHEMISTRY (1992) -
PAPIZ, M.Z., THE STRUCTURE OF BETA-LACTOGLOBULIN AND ITS SIMILARITY TO PLASMA RETINOL-BINDING PROTEIN, NATURE 324: 383 (1986).
(
10.1038/324383a0
) / NATURE (1986) - PARRYSMITH, D.J., SOMAP - A NOVEL INTERACTIVE APPROACH TO MULTIPLE PROTEIN SEQUENCES ALIGNMENT, COMPUTER APPLICATIONS IN THE BIOSCIENCES 7: 233 (1991). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1991)
- PARRYSMITH, D.J., ADSP - A NEW PACKAGE FOR COMPUTATIONAL SEQUENCE-ANALYSIS, COMPUTER APPLICATIONS IN THE BIOSCIENCES 8: 451 (1992). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1992)
- PEITSCH, M.C., NEW BIOL 2: 197 (1990). / NEW BIOL (1990)
- POHL, F. M., NATURE-NEW BIOLOGY 234: 277 (1971). / NATURE-NEW BIOLOGY (1971)
-
POSFAI, J, PREDICTIVE MOTIFS DERIVED FROM CYTOSINE METHYLTRANSFERASES, NUCLEIC ACIDS RESEARCH 17: 2421 (1989).
(
10.1093/nar/17.7.2421
) / NUCLEIC ACIDS RESEARCH (1989) -
QUEEN, C, IMPROVEMENTS TO A PROGRAM FOR DNA ANALYSIS - A PROCEDURE TO FIND HOMOLOGIES AMONG MANY SEQUENCES, NUCLEIC ACIDS RESEARCH 10: 449 (1982).
(
10.1093/nar/10.1.449
) / NUCLEIC ACIDS RESEARCH (1982) 10.1016/S0065-3233(08)60520-3
-
ROLFES, R.J., ESCHERICHIA-COLI GENE PURR ENCODING A REPRESSOR PROTEIN FOR PURINE NUCLEOTIDE SYNTHESIS - CLONING, NUCLEOTIDE-SEQUENCE, AND INTERACTION WITH THE PURF OPERATOR, JOURNAL OF BIOLOGICAL CHEMISTRY 263: 19653 (1988).
(
10.1016/S0021-9258(19)77686-8
) / JOURNAL OF BIOLOGICAL CHEMISTRY (1988) - ROYTBERG, M.A., A SEARCH FOR COMMON PATTERNS IN MANY SEQUENCES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 8: 57 (1992). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1992)
-
RUSSELL, R.B., MULTIPLE PROTEIN-SEQUENCE ALIGNMENT FROM TERTIARY STRUCTURE COMPARISON - ASSIGNMENT OF GLOBAL AND RESIDUE CONFIDENCE LEVELS, PROTEINS-STRUCTURE FUNCTION AND GENETICS 14: 309 (1992).
(
10.1002/prot.340140216
) / PROTEINS-STRUCTURE FUNCTION AND GENETICS (1992) -
SALI, A, DEFINITION OF GENERAL TOPOLOGICAL EQUIVALENCE IN PROTEIN STRUCTURES - A PROCEDURE INVOLVING COMPARISON OF PROPERTIES AND RELATIONSHIPS THROUGH SIMULATED ANNEALING AND DYNAMIC-PROGRAMMING, JOURNAL OF MOLECULAR BIOLOGY 212: 403 (1990).
(
10.1016/0022-2836(90)90134-8
) / JOURNAL OF MOLECULAR BIOLOGY (1990) -
SANKOFF, D, MINIMAL MUTATION TREES OF SEQUENCES, SIAM JOURNAL ON APPLIED MATHEMATICS 28: 35 (1975).
(
10.1137/0128004
) / SIAM JOURNAL ON APPLIED MATHEMATICS (1975) -
SAWYER, L, PROTEIN-STRUCTURE - ONE FOLD AMONG MANY, NATURE 327: 659 (1987).
(
10.1038/327659a0
) / NATURE (1987) -
Schafer, W. R., Annual Review of Genetics 26: 209 (1992).
(
10.1146/annurev.ge.26.120192.001233
) / Annual Review of Genetics (1992) -
SCHELL, M.A., USE OF SATURATION MUTAGENESIS TO LOCALIZE PROBABLE FUNCTIONAL DOMAINS IN THE NAHR PROTEIN, A LYSR-TYPE TRANSCRIPTION ACTIVATOR, JOURNAL OF BIOLOGICAL CHEMISTRY 265: 3844 (1990).
(
10.1016/S0021-9258(19)39671-1
) / JOURNAL OF BIOLOGICAL CHEMISTRY (1990) -
SCHULER, G.D., A WORKBENCH FOR MULTIPLE ALIGNMENT CONSTRUCTION AND ANALYSIS, PROTEINS-STRUCTURE FUNCTION AND GENETICS 9: 180 (1991).
(
10.1002/prot.340090304
) / PROTEINS-STRUCTURE FUNCTION AND GENETICS (1991) - Schwartz, R. M., Atlas of Protein Sequence and Structure 5 3: 353 (1978). / Atlas of Protein Sequence and Structure (1978)
- SCHWARZ, G, ESTIMATING DIMENSION OF A MODEL, ANNALS OF STATISTICS 6: 461 (1978). / ANNALS OF STATISTICS (1978)
-
SHEWCHUK, L.M., TRANSCRIPTIONAL SWITCHING BY THE MERR PROTEIN - ACTIVATION AND REPRESSION MUTANTS IMPLICATE DISTINCT DNA AND MERCURY(II) BINDING DOMAINS, BIOCHEMISTRY 28: 2340 (1989).
(
10.1021/bi00431a053
) / BIOCHEMISTRY (1989) - SMITH, H.O., FINDING SEQUENCE MOTIFS IN GROUPS OF FUNCTIONALLY RELATED PROTEINS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 87: 826 (1990). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1990)
- SMITH, R.F., AUTOMATIC-GENERATION OF PRIMARY SEQUENCE PATTERNS FROM SETS OF RELATED PROTEIN SEQUENCES, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 87: 118 (1990). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1990)
-
SOBEL, E, A MULTIPLE SEQUENCE ALIGNMENT PROGRAM, NUCLEIC ACIDS RESEARCH 14: 363 (1986).
(
10.1093/nar/14.1.363
) / NUCLEIC ACIDS RESEARCH (1986) -
SPIRO, S, INTERCONVERSION OF THE DNA-BINDING SPECIFICITIES OF 2 RELATED TRANSCRIPTION REGULATORS, CRP AND FNR, MOLECULAR MICROBIOLOGY 4: 1831 (1990).
(
10.1111/j.1365-2958.1990.tb02031.x
) / MOLECULAR MICROBIOLOGY (1990) - STADEN, R, METHODS FOR DISCOVERING NOVEL MOTIFS IN NUCLEIC-ACID SEQUENCES, COMPUTER APPLICATIONS IN THE BIOSCIENCES 5: 293 (1989). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1989)
- States, D. J., Sequence Analysis Primer: 141 (1991). / Sequence Analysis Primer (1991)
- STORMO, G.D., IDENTIFYING PROTEIN-BINDING SITES FROM UNALIGNED DNA FRAGMENTS, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 86: 1183 (1989). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1989)
-
SUBBIAH, S, A METHOD FOR MULTIPLE SEQUENCE ALIGNMENT WITH GAPS, JOURNAL OF MOLECULAR BIOLOGY 209: 539 (1989).
(
10.1016/0022-2836(89)90592-5
) / JOURNAL OF MOLECULAR BIOLOGY (1989) -
TANNER, M.A., THE CALCULATION OF POSTERIOR DISTRIBUTIONS BY DATA AUGMENTATION, JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION 82: 528 (1987).
(
10.1080/01621459.1987.10478458
) / JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (1987) -
TAYLOR, W.R., HIERARCHICAL METHOD TO ALIGN LARGE NUMBERS OF BIOLOGICAL SEQUENCES, METHODS IN ENZYMOLOGY 183: 456 (1990).
(
10.1016/0076-6879(90)83031-4
) / METHODS IN ENZYMOLOGY (1990) -
TREISMAN, J, THE HOMEODOMAIN - A NEW FACE FOR THE HELIX-TURN-HELIX, BIOESSAYS 14: 145 (1992).
(
10.1002/bies.950140302
) / BIOESSAYS (1992) - VINGRON, M, A FAST AND SENSITIVE MULTIPLE SEQUENCE ALIGNMENT ALGORITHM, COMPUTER APPLICATIONS IN THE BIOSCIENCES 5: 115 (1989). / COMPUTER APPLICATIONS IN THE BIOSCIENCES (1989)
-
VINGRON, M, MOTIF RECOGNITION AND ALIGNMENT FOR MANY SEQUENCES BY COMPARISON OF DOT-MATRICES, JOURNAL OF MOLECULAR BIOLOGY 218: 33 (1991).
(
10.1016/0022-2836(91)90871-3
) / JOURNAL OF MOLECULAR BIOLOGY (1991) - VINGRON, M, WEIGHTING IN SEQUENCE SPACE - A COMPARISON OF METHODS IN TERMS OF GENERALIZED SEQUENCES, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 90: 8777 (1993). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1993)
-
WATERMAN, M.S., PATTERN-RECOGNITION IN SEVERAL SEQUENCES - CONSENSUS AND ALIGNMENT, BULLETIN OF MATHEMATICAL BIOLOGY 46: 515 (1984).
(
10.1016/S0092-8240(84)80056-7
) / BULLETIN OF MATHEMATICAL BIOLOGY (1984) -
WATERMAN, M.S., LINE GEOMETRIES FOR SEQUENCE COMPARISONS, BULLETIN OF MATHEMATICAL BIOLOGY 46: 567 (1984).
(
10.1007/BF02459504
) / BULLETIN OF MATHEMATICAL BIOLOGY (1984) -
WOOTTON, J.C., STATISTICS OF LOCAL COMPLEXITY IN AMINO-ACID-SEQUENCES AND SEQUENCE DATABASES, COMPUTERS & CHEMISTRY 17: 149 (1993).
(
10.1016/0097-8485(93)85006-X
) / COMPUTERS & CHEMISTRY (1993) - YUAN, H.S., THE MOLECULAR-STRUCTURE OF WILD-TYPE AND A MUTANT FIS PROTEIN - RELATIONSHIP BETWEEN MUTATIONAL CHANGES AND RECOMBINATIONAL ENHANCER FUNCTION OR DNA-BINDING, PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 88: 9558 (1991). / PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA (1991)
-
YUDKIN, M.D., MUTATIONS THAT ALTER THE HELIX-TURN-HELIX REGION OF THE SPOLLAC PROTEIN - A BACILLUS-SUBTILIS SPORULATION-SPECIFIC SIGMA FACTOR, MOLECULAR MICROBIOLOGY 3: 257 (1989).
(
10.1111/j.1365-2958.1989.tb01815.x
) / MOLECULAR MICROBIOLOGY (1989) -
YUDKIN, M.D., THE PREDICTION OF HELIX-TURN-HELIX DNA-BINDING REGIONS IN PROTEINS, PROTEIN ENGINEERING 1: 371 (1987).
(
10.1093/protein/1.5.371
) / PROTEIN ENGINEERING (1987)
Dates
Type | When |
---|---|
Created | 18 years, 10 months ago (Oct. 5, 2006, 7:05 p.m.) |
Deposited | 7 months, 3 weeks ago (Jan. 11, 2025, 3:32 a.m.) |
Indexed | 3 weeks, 5 days ago (Aug. 5, 2025, 8:55 a.m.) |
Issued | 31 years, 10 months ago (Oct. 8, 1993) |
Published | 31 years, 10 months ago (Oct. 8, 1993) |
Published Print | 31 years, 10 months ago (Oct. 8, 1993) |
@article{Lawrence_1993, title={Detecting Subtle Sequence Signals: a Gibbs Sampling Strategy for Multiple Alignment}, volume={262}, ISSN={1095-9203}, url={http://dx.doi.org/10.1126/science.8211139}, DOI={10.1126/science.8211139}, number={5131}, journal={Science}, publisher={American Association for the Advancement of Science (AAAS)}, author={Lawrence, Charles E. and Altschul, Stephen F. and Boguski, Mark S. and Liu, Jun S. and Neuwald, Andrew F. and Wootton, John C.}, year={1993}, month=oct, pages={208–214} }