Developing Robust Clinical Text Deep Learning Models - A "Painless" Approach.

Journal: Studies in health technology and informatics
PMID:

Abstract

The success of deep learning in natural language processing relies on ample labelled training data. However, models in the health domain often face data inadequacy due to the high cost and difficulty of acquiring training data. Developing such models thus requires robustness and performance on new data. A generalised incremental multiphase framework is proposed for developing robust and performant clinical text deep learning classifiers. It incorporates incremental multiphases for training data size assessments, cross-validation setup to avoid test data bias, and robustness testing through inter/intra-model significance analysis. The framework's effectiveness and generalisation were confirmed by the task of identifying patients presenting in 'pain' to the emergency department.

Authors

  • Yutong Wu
    The Australian e-Health Research Centre, CSIRO, Brisbane, Australia.
  • James A Hughes
    School of Nursing, Centre for Healthcare Transformation, Queensland University of Technology, Brisbane, Australia.
  • Anna-Lisa Lyrstedt
    Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Brisbane, Australia.
  • Sarah Hazelwood
    Emergency Department, The Prince Charles Hospital, Brisbane, Australia.
  • Nathan J Brown
    Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Brisbane, Australia.
  • Lee Jones
    School of Public Health and Social Work, Queensland University of Technology, Brisbane, Australia.
  • Clint Douglas
    School of Nursing, Centre for Healthcare Transformation, Queensland University of Technology, Brisbane, Australia.
  • Rajeev Jarugula
    Emergency Department, The Prince Charles Hospital, Brisbane, Australia.
  • Kevin Chu
    Royal Brisbane andWomens Hospital, Brisbane, QLD, Australia.
  • Anthony Nguyen
    Australian e-Health Research Centre, CSIRO, Brisbane, QLD, Australia.