A semi-supervised deep learning approach for predicting the functional effects of genomic non-coding variations.

Journal: BMC bioinformatics
PMID:

Abstract

BACKGROUND: Understanding the functional effects of non-coding variants is important as they are often associated with gene-expression alteration and disease development. Over the past few years, many computational tools have been developed to predict their functional impact. However, the intrinsic difficulty in dealing with the scarcity of data leads to the necessity to further improve the algorithms. In this work, we propose a novel method, employing a semi-supervised deep-learning model with pseudo labels, which takes advantage of learning from both experimentally annotated and unannotated data.

Authors

  • Hao Jia
    School of Environmental Science and Engineering, Shaanxi University of Science and Technology, Xi'an 170021, China.
  • Sung-Joon Park
    Department of Computer Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.
  • Kenta Nakai
    Department of Computational Biology and Medical Sciences, Graduate school of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba-ken, 277-8562, Japan. knakai@ims.u-tokyo.ac.jp.