Cross domains adversarial learning for Chinese named entity recognition for online medical consultation.

Journal: Journal of biomedical informatics
Published Date:

Abstract

Deep learning methods have been applied to Chinese named entity recognition for the online medical consultation. They require a large number of marked samples. However, no such database is available at present. This paper begins with constructing a larger labelled Chinese texts database for the online medical consultation. Second, a basic framework unit is proposed, which is pre-trained by the transfer learning from both Bidirectional language model and Mask language model trained on the larger unlabelled data. Finally, cross domains adversarial learning (CDAL) for Chinese named entity recognition is proposed to further improve the performance, which not only uses the pre-trained basic framework unit, but also uses the adversarial multi-task learning on both electronic medical record texts and online medical consultation texts. Experimental results validate the effectiveness of CDAL.

Authors

  • Guihua Wen
    School of Computer Science and Engineering, South China University of Technology, Guangzhou 510000, China. Electronic address: crghwen@scut.edu.cn.
  • Hehong Chen
    School of Computer Science & Engineering, South China University of Technology, Guangzhou, China.
  • Huihui Li
    School of Computer Science and Engineering, South China University of Technology, Guangzhou 510000, China. Electronic address: 29777562@qq.com.
  • Yang Hu
    Kweichow Moutai Co., Ltd, Renhuai, Guizhou 564501, China.
  • Yanghui Li
    School of Computer Science & Engineering, South China University of Technology, Guangzhou, China.
  • Changjun Wang
    Guangdong General Hospital, Guangzhou 510000, China. Electronic address: gzwchj@126.com.