A successful hybrid deep learning model aiming at promoter identification.

Journal: BMC bioinformatics
PMID:

Abstract

BACKGROUND: The zone adjacent to a transcription start site (TSS), namely, the promoter, is primarily involved in the process of DNA transcription initiation and regulation. As a result, proper promoter identification is critical for further understanding the mechanism of the networks controlling genomic regulation. A number of methodologies for the identification of promoters have been proposed. Nonetheless, due to the great heterogeneity existing in promoters, the results of these procedures are still unsatisfactory. In order to establish additional discriminative characteristics and properly recognize promoters, we developed the hybrid model for promoter identification (HMPI), a hybrid deep learning model that can characterize both the native sequences of promoters and the morphological outline of promoters at the same time. We developed the HMPI to combine a method called the PSFN (promoter sequence features network), which characterizes native promoter sequences and deduces sequence features, with a technique referred to as the DSPN (deep structural profiles network), which is specially structured to model the promoters in terms of their structural profile and to deduce their structural attributes.

Authors

  • Ying Wang
    Key Laboratory of Macromolecular Science of Shaanxi Province, School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xi'an, Shaanxi 710062, China.
  • Qinke Peng
    Systems Engineering Institute, Xi'an Jiaotong University, 28 Xianning West Road, Xi'an, Shaanxi 710049, China. Electronic address: qkpeng@xjtu.edu.cn.
  • Xu Mou
    Systems Engineering Institute, Xi'an Jiaotong University, Xi'an, China.
  • Xinyuan Wang
    Proteomics and Metabolomics Core Facilities, West China Hospital, Sichuan University, Chengdu 610041, China.
  • Haozhou Li
    Systems Engineering Institute, Xi'an Jiaotong University, Xi'an, China.
  • Tian Han
    Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90095, U.S.A. hantian@ucla.edu.
  • Zhao Sun
    Systems Engineering Institute, Xi'an Jiaotong University, Xi'an, China.
  • Xiao Wang
    Research Centre of Basic Integrative Medicine, School of Basic Medical Sciences, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China.