Abstract
The rapid increase in genomic information requires new techniques to infer protein function and predict protein–protein interactions. Bioinformatics identifies modular signaling domains within protein sequences with a high degree of accuracy. In contrast, little success has been achieved in predicting short linear sequence motifs within proteins targeted by these domains to form complex signaling networks. Here we describe a peptide library-based searching algorithm, accessible over the World Wide Web, that identifies sequence motifs likely to bind to specific protein domains such as 14-3-3, SH2, and SH3 domains, or likely to be phosphorylated by specific protein kinases such as Src and AKT. Predictions from database searches for proteins containing motifs matching two different domains in a common signaling pathway provides a much higher success rate. This technology facilitates prediction of cell signaling networks within proteomes, and could aid in the identification of drug targets for the treatment of human diseases.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Rent or buy this article
Prices vary by article type
from$1.95
to$39.95
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Pawson, T. Protein modules and signalling networks. Nature 373, 573–580 (1995).
Luthy, R., Xenarios. I. & Bucher, P. Improving the sensitivity of the sequence profile method. Protein Sci. 3, 139–146 (1994).
Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997).
Ponting, C.P., Schultz, J., Milpetz, F. & Bork, P. SMART: identification and annotation of domains from signalling and extracellular protein sequences. Nucleic Acids Res. 27, 229–232 (1999).
Bairoch, A. & Apweiler, R. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28, 45–48 (2000).
Songyang Z. et al. SH2 domains recognize specific phosphopeptide sequences. Cell 72, 767–778 (1993).
Songyang Z. et al. Recognition of unique carboxyl-terminal motifs by distinct PDZ domains. Science 275, 73–77 (1997).
Yaffe, M.B. et al. The structural basis for 14-3-3:phosphopeptide binding specificity. Cell 91, 961–971 (1997).
Bateman, A. et al. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucleic Acids Res. 27, 260–262 (1999).
Emini, E.A., Hughes, J.V., Perlow, D.S. & Boger, J. Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide. J. Virol. 55, 836–839 (1985).
Janin, J., Wodak, S., Levitt, M. & Maigret, M. Conformation of amino acid side chains in proteins. J. Mol. Biol. 125, 357–386 (1978).
Downward, J., Waterfield, M.D. & Parker, P.J. Autophosphorylation and protein kinase C phosphorylation of the epidermal growth factor receptor. Effect on tyrosine kinase activity and ligand binding affinity. J. Biol. Chem. 260, 14538–14546 (1985).
Hunter, T., Ling, N. & Cooper, J.A. Protein kinase C phosphorylation of the EGF receptor at a threonine residue close to the cytoplasmic face of the plasma membrane. Nature 311, 480–483 (1984).
Morrison, P., Takishima, K. & Rosner, M.R. Role of threonine residues in regulation of the epidermal growth factor receptor by protein kinase C and mitogen-activated protein kinase. J. Biol. Chem. 268, 15536–15543 (1993).
Batzer, A.G., Rotin, D., Urena, J.M., Skolnik, E.Y. & Schlessinger, J. Hierarchy of binding sites for Grb2 and Shc on the epidermal growth factor receptor. Mol. Cell Biol. 14, 5192–5201 (1994).
Batzer, A.G., Blaikie, P., Nelson, K., Schlessinger, J. & Margolis, B. The phosphotyrosine interaction domain of Shc binds an LXNPXY motif on the epidermal growth factor receptor. Mol. Cell Biol. 15, 4403–4409 (1995).
Rotin, D. et al. SH2 domains prevent tyrosine dephosphorylation of the EGF receptor: identification of Tyr992 as the high-affinity binding site for SH2 domains of phospholipase C gamma. EMBO J. 11, 559–567 (1992).
Rebhan, M., Chalifa-Caspi, V. & Prilusky, J. & Lancet, D. GeneCards: encyclopedia for genes, proteins and diseases. (Weizmann Institute of Science, Bioinformatics Unit and Genome Center, Rehovot, Israel; 1997).
Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Obata, T. et al. Use of peptide and protein library screening to define optimal substrate motifs for AKT/PKB. J. Biol. Chem. 275, 36108–36115 (2000).
Bairoch, A. PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res. 20 Suppl., 2013–2018 (1992).
Brunet, A. et al. Akt promotes cell survival by phosphorylating and inhibiting a Forkhead transcription factor. Cell 96, 857–868 (1999).
Datta, S.R. et al. Akt phosphorylation of BAD couples survival signals to the cell-intrinsic death machinery. Cell 91, 231–241 (1997).
Xu, W., Harrison, S.C. & Eck, M.J. Three-dimensional structure of the tyrosine kinase c-Src. Nature 385, 595–602 (1997).
Kreegipuu, A., Blom, N. & Brunak, S. PhosphoBase, a database of phosphorylation sites: release 2.0. Nucleic Acids Res. 27, 237–239 (1999).
Overbeek, R., Fonstein, M., D'Souza, M., Pusch, G.D. & Maltsev, N. The use of gene clusters to infer functional coupling. Proc. Natl. Acad. Sci. USA 96, 2896–2901 (1999).
Eisen, M.B., Spellman, P.T., Brown, P.O. & Botstein, D. Cluster analysis and display of genome-wide expression patterns. Proc. Natl. Acad. Sci. USA 95, 14863–14868 (1998).
Enright, A.J., Iliopoulos, I., Kyrpides, N.C. & Ouzounis, C.A. Protein interaction maps for complete genomes based on gene fusion events. Nature 402, 86–90 (1999).
Marcotte, E.M., Pellegrini, M., Thompson, M.J., Yeates, T.O. & Eisenberg, D. A combined algorithm for genome-wide prediction of protein function. Nature 402, 83–86 (1999).
Songyang, Z. & Cantley, L.C. The use of peptide library for the determination of kinase peptide substrates. Methods Mol. Biol. 87, 87–98 (1998).
Yaffe, M.B. & Cantley, L.C. Mapping specificity determinants for protein–protein association using protein fusions and random peptide libraries. Methods Enzymol. 328, 157–170 (2000).
Acknowledgements
We thank members of the Division of Signal Transduction for beta-testing the software. This work was supported by National Institutes of Health grants HL03601 (M.B.Y.), GM59281 (M.B.Y.), GM56203 (L.C.C.), a Burroughs-Wellcome Career Development Award (M.B.Y.), a Merck Genome Research Institute Award (L.C.C./M.B.Y.), a Human Frontier Long Term Fellowship (S.V.), and Telethon 60% and 40% grants (S.V.).
Author information
Authors and Affiliations
Corresponding author
Supplementary information
Rights and permissions
About this article
Cite this article
Yaffe, M., Leparc, G., Lai, J. et al. A motif-based profile scanning approach for genome-wide prediction of signaling pathways. Nat Biotechnol 19, 348–353 (2001). https://doi.org/10.1038/86737
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1038/86737
This article is cited by
-
TTC3-Mediated Protein Quality Control, A Potential Mechanism for Cognitive Impairment
Cellular and Molecular Neurobiology (2022)
-
Non-cooperative 4E-BP2 folding with exchange between eIF4E-binding and binding-incompatible states tunes cap-dependent translation inhibition
Nature Communications (2020)
-
Kinase inhibition profiles as a tool to identify kinases for specific phosphorylation sites
Nature Communications (2020)
-
Prevalence and significance of the commonest phosphorylated motifs in the human proteome: a global analysis
Cellular and Molecular Life Sciences (2020)
-
The tyrosine 73 and serine 83 dephosphorylation of H1N1 swine influenza virus NS1 protein attenuates virus replication and induces high levels of beta interferon
Virology Journal (2019)