De-identification of clinical notes with pseudo-labeling using regular expression rules and pre-trained BERT.
Journal:
BMC medical informatics and decision making
PMID:
39962485
Abstract
BACKGROUND: De-identification of clinical notes is essential to utilize the rich information in unstructured text data in medical research. However, only limited work has been done in removing personal information from clinical notes in Korea.