Identifying Protein Structural Classes Using MVP Algorithm

Full Text (PDF, 143KB), PP.8-12

Views: 0 Downloads: 0

Author(s)

Tong Wang 1,* Xiaoming Hu 1 Xiaoxia Cao 1

1. Institute of Computer and Information, Shanghai Second Polytechnic University, Shanghai, 201209, China

* Corresponding author.

DOI: https://doi.org/10.5815/ijwmt.2012.04.02

Received: 12 Apr. 2012 / Revised: 17 May 2012 / Accepted: 3 Jul. 2012 / Published: 15 Aug. 2012

Index Terms

Protein structural classes, MVP, sequence encoding scheme

Abstract

A new method for the prediction of protein structural classes is constructed based on MVP (Maximum variance projection) algorithm, which is a manifold learning-based data mining method. DC (Dipeptide Composition) and PseAA (Pseudo Amino Acid) are used as conditional attributes for the construction of decision system. A DR (Dimensionality Reduction) algorithm, the so-called MVP is introduced to reduce the decision system, which can be used to classify new objects. Experimental results thus obtained are quite encouraging, which indicate that the above method is used effectively to deal with this complicated problem of protein structural classes.

Cite This Paper

Tong Wang,Xiaoming Hu,Xiaoxia Cao,"Identifying Protein Structural Classes Using MVP Algorithm", IJWMT, vol.2, no.4, pp.8-12, 2012. DOI: 10.5815/ijwmt.2012. 04.02 

Reference

[1]Y.S. Ding, T.L. Zhang, Q. Gu, P.Y Zha, K.C. Chou. Using maximum entropy model to predict protein secondary structure with single sequence. Protein Pept Lett. vol. 16, pp.552-60, 2009.

[2]K.C. Chou. Review: progress in protein structural class prediction and its impact to bioinformatics and proteomics. Curr. Protein Pept. vol. 6, pp.423–436, 2005.

[3]T.H. Zhang, J. Yang, H. H. Wang, C.H. Du. Maximum variance projections for face recognition. Optical Engineering, vol. 46, pp.2007067206-1-067206-8, 2007.

[4]K.C. Chou. Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins, vol.43, pp.246-255, 2001.

[5]M.Reczko and H.Bohr. The DEF data base of sequence based protein fold class predictions. Nucleic Acids Res, vol. 22, pp.3616-3619,1994.

[6]Chou, K.C. A key driving force in determination of protein structural classes. Biochem. Biophys. Res. Commun. vol.264, pp.216–224, 1999.