DeepOCR: A multi-species deep-learning framework for accurate identification of open chromatin regions in livestock.

Journal: Computational biology and chemistry
PMID:

Abstract

A wealth of experimental evidence has suggested that open chromatin regions (OCRs) are involved in many critical biological activities, such as DNA replication, enhancer activity, and gene transcription. Accurately identifying OCRs in livestock species can provide critical insights into the distribution and characteristics of OCRs for disease treatment in livestock, thereby improving animal welfare. However, most current machine-learning methods for OCR prediction were originally designed for a limited number of model organisms, such as humans and some model organisms, and thus their performance on non-model organisms, specifically livestock, is often unsatisfactory. To bridge this gap, we propose DeepOCR, a lightweight depth-separable residual network model for predicting OCRs in livestock, including chicken, cattle, and sheep. DeepOCR integrates a single convolution layer and two improved residue structure blocks to extract and learn important features from the input DNA sequences. A fully connected layer was also employed to further process the extracted features and improve the robustness of the entire network. Our benchmarking experiments demonstrated superior prediction performance of DeepOCR compared to state-of-the-art approaches on testing datasets of the three species. The source code of DeepOCR is freely available for academic purposes at https://github.com/jasonzhao371/DeepOCR/. We anticipate DeepOCR servers as a practical and reliable computational tool for OCR-related studies in livestock species.

Authors

  • Liangwei Zhao
    College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Ran Hao
    Department of Sciences and Education, Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China.
  • Ziyi Chai
    College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Weiwei Fu
    University of Science and Technology of China, Hefei, 230000.
  • Wei Yang
    Key Laboratory of Structure-Based Drug Design and Discovery (Shenyang Pharmaceutical University), Ministry of Education, School of Traditional Chinese Materia Medica, Shenyang Pharmaceutical University, Wenhua Road 103, Shenyang 110016, PR China. Electronic address: 421063202@qq.com.
  • Chen Li
    School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, China.
  • Quanzhong Liu
    College of Information Engineering, Northwest A&F University, Yangling 712100, China.
  • Yu Jiang
    School of Pharmacy and Food Engineering, Wuyi University, Jiangmen, China.