Identifying 5-methylcytosine sites in RNA sequence using composite encoding feature into Chou's PseKNC.

Journal: Journal of theoretical biology
Published Date:

Abstract

This study examines accurate and efficient computational method for identification of 5-methylcytosine sites in RNA modification. The occurrence of 5-methylcytosine (mC) plays a vital role in a number of biological processes. For better comprehension of the biological functions and mechanism it is necessary to recognize mC sites in RNA precisely. The laboratory techniques and procedures are available to identify mC sites in RNA, but these procedures require a lot of time and resources. This study develops a new computational method for extracting the features of RNA sequence. In this method, first the RNA sequence is encoded via composite feature vector, then, for the selection of discriminate features, the minimum-redundancy-maximum-relevance algorithm was used. Secondly, the classification method used has been based on a support vector machine by using jackknife cross validation test. The suggested method efficiently identifies mC sites from non- mC sites and the outcome of the suggested algorithm is 93.33% with sensitivity of 90.0 and specificity of 96.66 on bench mark datasets. The result exhibits that proposed algorithm shown significant identification performance compared to the existing computational techniques. This study extends the knowledge about the occurrence sites of RNA modification which paves the way for better comprehension of the biological uses and mechanism.

Authors

  • M Fazli Sabooh
    Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.
  • Nadeem Iqbal
    Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.
  • Mukhtaj Khan
    Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.
  • Muslim Khan
    Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.
  • H F Maqbool
    University of Engineering & Technology Lahore, Pakistan.