A Novel Statistical Method for Thermostable Protein Discrimination

Download Full Text
Elham Nikookar, Kambiz Badie, Mehdi Sadeghi
Published Date:
July 05, 2012
Volume 2, Issue 4
1 - 5

protein mesophile thermophile, discrimination, thermostability, amino acid frequency, feature extraction
Elham Nikookar, Kambiz Badie, Mehdi Sadeghi, "A Novel Statistical Method for Thermostable Protein Discrimination". International Journal of Research in Computer Science, 2 (4): pp. 1-5, July 2012. doi:10.7815/ijorcs.24.2012.032 Other Formats


In this study, we used features that can be extracted from protein sequences to discriminate mesophilic, thermophilic and hyper-thermophilic proteins. Amino acid frequency, dipeptide amino acid frequency and physical-chemical features are used in this study. The effect of mentioned features on proposed discrimination algorithm was evaluated both separately and in combination. Statistical methods are used in the proposed algorithm. The results of implementing the algorithm on a dataset containing 239 mesophilic proteins, 69 thermophilic proteins and 59 hyper-thermophilic proteins show the effect of each bunch of features on the evaluation measures.

  1. C-I Branden, J.Tooze, “Introduction to protein structure”, 2nd edition, New York: Garland Pub., 1999.
  2. Todd T, Iosif V, “Discrimination of thermophilic and mesophilic protein”, Taylor and Vaisman BMC Structural Biology, vol. 10, 2010. doi: 10.1186/1472-6807-10-S1-S5
  3. Xingyu, W., Shouliang, C., Mingde, G., “General Biology”, Version 2, Higher Education Press, Beijing, 2005.
  4. Brock T, Freeze H, “Thermus aquaticus gen. n.and sp. n.., a Nonsporulating Extreme Thermophile”, J Bactriol, vol. 98, pp. 289-297, 1969.
  5. Jingru Xu, Yuehui Chen, “Discrimination of Protein Thermostability Based on a New Integrated Neural Network”, vol. 1, pp. 107-112, Springer, 2011.
  6. Kumar S, Nussinov R, “How do thermophilic proteins deal with heat?”, Cell Mol Life Sci, vol. 58, pp. 1216-1233, 2001. doi: 10.1007/PL00000935
  7. Thompson MJ, Eisenberg D, “Transprotemic evidence of a loop-deletion mechanism for enhancing protein thermostability”, J Mol Biol, vol. 290, pp. 595-604, 1999. doi: 10.1006/jmbi.1999.2889
  8. Haney PJ, Jonathan HB, Berald LB, “Thermal adaption analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species”, PNAS, vol. 96, pp. 3578-3583, 1999. doi: 10.1073/pnas.96.7.3578
  9. Vielle C, Zeikus GJ, “Hyperthermophilic enzymes: sources, uses, and molecular mechanisms for thermostability”, Microbiol Mol Biol Rev, vol. 65, pp. 1-43, 2001.
  10. Ding YR, Cai YJ, Zhang GX, “The influence of dipeptide composition on protein thermostability”, FEBS Lett, vol. 569, pp. 284-288, 2004. doi: 10.1016/j.febslet.2004.06.009
  11. Kumarevel TS, Gromiha MM, Ponnuswamy MN, “Structural class predication: an application of residue distribution along the sequence”, Biophys Chemist, vol. 88, pp. 81-101, 2000. doi: 10.1016/S0301-4622(00)00201-5
  12. Gromiha MM, Shandar A, Makiko S, “Application of residue distribution along the sequence for discriminating outer membrane proteins”, Comput Biol Chem, vol. 29, pp. 135-142, 2005. doi: 10.1016/j.compbiolchem.2005.02.006
  13. Seung P, Young J, “Protein Thermostability: Structure-Based Difference of Amino acid between Thermophilic and Mesophilic Proteins”, Elsevier Journal of Biotech, vol. 111, pp. 269-277, 2004.
  14. Michael G, Xavier S, “Discrimination and Classification of Mesophilic and Thermophilic Protein using Machine Learning Algorithms”, Willy InterSience, pp. 1274-1279, 2007.
  15. http://www.pdb.org. Available at June 24, 2012.
  16. Zahng G, Fang B, “Study on the Discrimination of Thermophilic and Mesophilic Proteins Based on Dipeptide Composition”, Chinese journal of biotechnology, vol. 22, pp. 293-298, 2006.
  17. Eshkin A, Ghafuri H, “Predication of relative solvent accessibility by support vector regression and best-first method”, Excli Journal, vol. 9 , pp. 29-38, 2010.
  18. Platzer A, Percpo P, “Characterization of protein-interaction networks in tumors”, BMC Bioinformatics, vol. 8, 2007. doi:10.1186/1471-2105-8-224
  19. Szilagyi A, Zavodszky P, “Structural differences between mesophilic, moderately thermophilic and extremely thermophilic protein subunits: results of comprehensive survey”, Elsevier, vol. 8, pp. 493-504, 2000.
  20. Chen Yu, Han, Kyungsook, “BSFINDER: Finding Binding Sites of HCV Proteins Using a Support Vector Machine”, Protein and Peptide Letters, vol. 16, pp. 373-382(10), 2009. doi:10.2174/092986609787848153

  • Nath, Abhigyan, and Karthikeyan Subbiah. "Inferring biological basis about psychrophilicity by interpreting the rules generated from the correctly classified input instances by a classifier." Computational biology and chemistry 53 (2014): 198-203.
  • Tek, Chand Bhalla. "Computational Analysis of Amino Acid Sequences in Relation to Thermostability of Interspecific Nitrile Degrading Enzyme (Amidase) from Various Thermophiles/Hyperthermophiles." (2012).