Using Natural Language Processing to Predict Risk in Electronic Health Records.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Clinical narratives recording behaviours and emotions of patients are available from EHRs in a forensic psychiatric centre located in Tasmania. This rich data has not been used in risk prediction. Prior work demonstrates natural language processing can be used to identify patient symptoms in these free-text records and can then be used to predict risk. Four dictionaries containing descriptive words of harm were created using the Diagnostic and Statistical Manual of Mental Disorders, the Unified Medical Language System repository, English negative and positive sentiment words, and high-frequency words from the Corpus of Contemporary American English. However, a model based only on these keywords is limited in predictive power. In this study, we introduce an improved NLP approach with a social interaction component to extract additional information about the behavioural and emotional state of patients. These social interactions are subsequently used in a machine-learning model to enhance risk prediction performance.

Authors

  • Duy Van Le
    School of ICT, University of Tasmania, Australia.
  • James Montgomery
    School of Technology, Environments and Design, College of Sciences and Engineering, University of Tasmania, Private Bag 87, Hobart 7001, TAS, Australia.
  • Kenneth Kirkby
    School of Medicine, University of Tasmania, Australia.
  • Joel Scanlan
    School of Technology, Environments and Design, College of Sciences and Engineering, University of Tasmania, Private Bag 87, Hobart 7001, TAS, Australia.