Injury narrative text classification using factorization model.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

Narrative text is a useful way of identifying injury circumstances from the routine emergency department data collections. Automatically classifying narratives based on machine learning techniques is a promising technique, which can consequently reduce the tedious manual classification process. Existing works focus on using Naive Bayes which does not always offer the best performance. This paper proposes the Matrix Factorization approaches along with a learning enhancement process for this task. The results are compared with the performance of various other classification approaches. The impact on the classification results from the parameters setting during the classification of a medical text dataset is discussed. With the selection of right dimension k, Non Negative Matrix Factorization-model method achieves 10 CV accuracy of 0.93.

Authors

  • Lin Chen
    College of Sports, Nanjing Tech University, Nanjing, China.
  • Kirsten Vallmuur
  • Richi Nayak