Identifying RNA 5-methylcytosine sites via pseudo nucleotide compositions.

Journal: Molecular bioSystems
Published Date:

Abstract

RNA 5-methylcytosine (mC) plays an important role in numerous biological processes. Accurate identification of the mC site is helpful for a better understanding of its biological functions. However, the drawbacks of the experimental methods available preclude progress towards the identification of the mC site. As an excellent complement to experimental techniques, computational methods will facilitate the identification of mC. In the present study, a support vector machine based-method is proposed to identify mC sites in Homo sapiens. In this method, RNA sequences are encoded using the pseudo dinucleotide composition in which three RNA physiochemical properties are incorporated. It was observed by the jackknife cross-validation that the overall success rate achieved by the proposed model is 90.42%. This result indicates that the proposed model holds the potential to become a useful tool for the identification of mC sites.

Authors

  • Pengmian Feng
    School of Public Health, North China University of Science and Technology, Tangshan, 063000, China.
  • Hui Ding
    Medical School, Huanghe Science & Technology University, Zhengzhou 450063, PR China.
  • Wei Chen
    Department of Urology, Zigong Fourth People's Hospital, Sichuan, China.
  • Hao Lin
    Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, China.