Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure.

Journal: Scientific reports
Published Date:

Abstract

Machine learning (ML) has been extensively involved in assistant disease diagnosis and prediction systems to emancipate the serious dependence on medical resources and improve healthcare quality. Moreover, with the booming of pre-training language models (PLMs), the application prospect and promotion potential of machine learning methods in the relevant field have been further inspired. PLMs have recently achieved tremendous success in diverse text processing tasks, whereas limited by the significant semantic gap between the pre-training corpus and the structured electronic health records (EHRs), PLMs cannot converge to anticipated disease diagnosis and prediction results. Unfortunately, establishing connections between PLMs and EHRs typically requires the extraction of curated predictor variables from structured EHR resources, which is tedious and labor-intensive, and even discards vast implicit information.In this work, we propose an Input Prompting and Discriminative language model with the Mixture-of-experts framework (IPDM) by promoting the model's capabilities to learn knowledge from heterogeneous information and facilitating the feature-aware ability of the model. Furthermore, leveraging the prompt-tuning mechanism, IPDM can inherit the impacts of the pre-training in downstream tasks exclusively through minor modifications. IPDM remarkably outperforms existing models, proved by experiments on one disease diagnosis task and two disease prediction tasks. Finally, experiments with few-feature and few-sample demonstrate that IPDM achieves significant stability and impressive performance in predicting chronic diseases with unclear early-onset characteristics or sudden diseases with insufficient data, which verifies the superiority of IPDM over existing mainstream methods, and reveals the IPDM can powerfully address the aforementioned challenges via establishing a stable and low-resource medical diagnostic system for various clinical scenarios.

Authors

  • Bowen Dong
    Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Johor Bahru, Johor, Malaysia.
  • Zhuo Wang
    Sichuan Center for Disease Control and Prevention, Chengdu 610500, China.
  • Zhenyu Li
    Department of Urology, Peking University First Hospital, Institute of Urology, Peking University, National Urological Cancer Center, Beijing, China.
  • Zhichao Duan
    Department of Computer Science and Technology, Tsinghua University, Beijing, China.
  • Jiacheng Xu
    Department of Computer Science and Technology, Tsinghua University, Beijing, China.
  • Tengyu Pan
    Department of Computer Science and Technology, Tsinghua University, Beijing, China.
  • Rui Zhang
    Department of Cardiology, Zhongda Hospital, Medical School of Southeast University, Nanjing, China.
  • Ning Liu
    School of Public Health, Hangzhou Normal University, Hangzhou, China.
  • Xiuxing Li
    Key Laboratory of Intelligent Information Processing Institute of Computing Technology, Chinese Academy of Sciences (ICT/CAS), Beijing, China.
  • Jie Wang
  • Caiyan Liu
    Department of Neurology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Beijing, China.
  • Liling Dong
    Department of Neurology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Beijing, China.
  • Chenhui Mao
    Department of Neurology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Beijing, China.
  • Jing Gao
    Department of Gastroenterology 3, Hubei University of Medicine, Renmin Hospital, Shiyan, Hubei, China.
  • Jianyong Wang
    1 Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu, Sichuan, P. R. China.