Abstract
Informative proteins are the proteins that play critical functional roles inside cells. They are the fundamental knowledge of translating bioinformatics into clinical practices. Many methods of identifying informative biomarkers have been developed which are heuristic and arbitrary, without considering the dynamics characteristics of biological processes. In this paper, we present a generative model of identifying the informative proteins by systematically analyzing the topological variety of dynamic protein-protein interaction networks (PPINs). In this model, the common representation of multiple PPINs is learned using a deep feature generation model, based on which the original PPINs are rebuilt and the reconstruction errors are analyzed to locate the informative proteins. Experiments were implemented on data of yeast cell cycles and different prostate cancer stages. We analyze the effectiveness of reconstruction by comparing different methods, and the ranking results of informative proteins were also compared with the results from the baseline methods. Our method is able to reveal the critical members in the dynamic progresses which can be further studied to testify the possibilities for biomarker research.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Overby CL, Tarczy-Hornoch P. Personalized medicine: challenges and opportunities for translational bioinformatics. Pers Med, 2013, 10: 453–462
Olson S, Beachy SH, Giammaria CF, Berger AC. Integrating Large-Scale Genomic Information Into Clinical Practice: Workshop Summary. Washington, DC: National Academies Press, 2012
Hood L, Flores M. A personal view on systems medicine and the emergence of proactive P4 medicine: predictive, preventive, personalized and participatory. New Biotechnol, 2012, 29: 613–624
Chang X, Xu T, Li Y, Wang K. Dynamic modular architecture of protein-protein interaction networks beyond the dichotomy of ‘date’ and ‘party’ hubs. Sci Rep-Uk, 2013, 3: 1691
Komurov K, White M. Revealing static and dynamic modular architecture of the eukaryotic protein interaction network. Mol Syst Biol, 2007, 3: 110
de Lichtenberg U, Jensen LJ, Brunak S, Bork P. Dynamic complex formation during the yeast cell cycle. Science, 2005, 307: 724–727
Han JDJ, Bertin N, Hao T, Goldberg DS, Berriz GF, Zhang LV, Dupuy D, Walhout AJM, Cusick ME, Roth FP, Vidal M. Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature, 2004, 430: 88–93
Taylor IW, Linding R, Warde-Farley D, Liu Y, Pesquita C, Faria D, Bull S, Pawson T, Morris Q, Wrana JL. Dynamic modularity in protein interaction networks predicts breast cancer outcome. Nat Biotech, 2009, 27: 199–204
Wang H, Li M, Wang J, Pan Y. A New Method for Identifying Essential Proteins Based on Edge Clustering Coefficient. In: Chen J, Wang J, Zelikovsky A, eds. Bioinformatics Research and Applications. Berlin Heidelberg: Springer, 2011. 87–98
Wang J, Peng W, Wu FX. Computational approaches to predicting essential proteins: a survey. Proteomics Clin Appl, 2013, 7: 181–192
He X, Zhang J. Why do hubs tend to be essential in protein networks? Plos Genet, 2006, 2: e88
Barabasi A-L, Oltvai ZN. Network biology: understanding the cell’s functional organization. Nat Rev Genet, 2004, 5: 101–113
Wang Z, Lucas FA, Qiu P, Liu Y. Improving the sensitivity of sample clustering by leveraging gene co-expression networks in variable selection. BMC Bioinformatics, 2014, 15: 153
Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn, 2002, 46: 389–422
Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response (vol 98, pg 5116, 2001). Proc Natl Acad Sci USA, 2001, 98: 10515–10534
Pan W. A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments. Bioinformatics, 2002, 18: 546–554
Handcock MS, Robins G, Snijders TA, Moody J, Besag J. Assessing degeneracy in statistical models of social networks. CSSS working paper 2003: 39
Ge L, Gao J, Yu X, Fan W, Zhang A. Estimating local information trustworthiness via multi-source joint matrix factorization. In: 12th IEEE International Conference on Data Mining, 2012. 876–881
The cyclebase database. http://www.cyclebase.org/
Tomlins SA, Mehra R, Rhodes DR, Cao X, Wang L, Dhanasekaran SM, Kalyana-Sundaram S, Wei JT, Rubin MA, Pienta KJ, Shah RB, Chinnaiyan AM. Integrative molecular concept modeling of prostate cancer progression. Nat Genet, 2007, 39: 41–51
Arnau V, Mars S, Marin I. Iterative cluster analysis of protein interaction data. Bioinformatics, 2005, 21: 364–378
Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science, 2006, 313: 504–507
Rumelhart DE, McClelland JL. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations. Cambridge, MA, USA, 1986
Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput, 2006, 18: 1527–1554
Yuille A. The convergence of contrastive divergences. Advances in Neural Information Processing Systems 17. Cambridge, MA, USA, 2005. 1593–1600
Bengio Y, Delalleau O. Justifying and generalizing contrastive divergence. Neural Comput, 2009, 21: 1601–1621
Sutskever I, Tieleman T. On the convergence properties of contrastive divergence. In: International Conference on Artificial Intelligence and Statistics, 2010. 789–795
Bengio Y. Learning deep architectures for AI. Found Trends Mach Learn, 2009, 2: 1–127
Zhang Y, Du N, Li K, Jia K, Zhang A. Co-regulated protein functional modules with varying activities in dynamic PPI networks. Tsinghua Sci Tech, 2013, 18: 530–540
Bhardwaj N, Lu H. Correlation between gene expression profiles and protein-protein interactions within and across genomes. Bioinformatics, 2005, 21: 2730–2738
Tu BP, Kudlicki A, Rowicka M, McKnight SL. Logic of the yeast metabolic cycle: temporal compartmentalization of cellular processes. Science, 2005, 310: 1152–1158
Pu S, Wong J, Turner B, Cho E, Wodak SJ. Up-to-date catalogues of yeast protein complexes. Nucleic Acids Res, 2009, 37: 825–831
Author information
Authors and Affiliations
Corresponding author
Additional information
This article is published with open access at springerlink.fh-diploma.de
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Zhang, Y., Cheng, Y., Jia, K. et al. A generative model of identifying informative proteins from dynamic PPI networks. Sci. China Life Sci. 57, 1080–1089 (2014). https://doi.org/10.1007/s11427-014-4744-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11427-014-4744-9