Korean clinical entity recognition from diagnosis text using BERT.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: While clinical entity recognition mostly aims at electronic health records (EHRs), there are also the demands of dealing with the other type of text data. Automatic medical diagnosis is an example of new applications using a different data source. In this work, we are interested in extracting Korean clinical entities from a new medical dataset, which is completely different from EHRs. The dataset is collected from an online QA site for medical diagnosis. Bidirectional Encoder Representations from Transformers (BERT), which is one of the best language representation models, is used to extract the entities.

Authors

  • Young-Min Kim
    College of Pharmacy, Chonnam National University, Gwangju 61186, Republic of Korea. Electronic address: u9897854@jnu.ac.kr.
  • Tae-Hoon Lee
    Division of Interdisciplinary Industrial Studies, Hanyang University, 222 Wangsimni-ro, Seongdong-gu, Seoul, South Korea.