An efficient prototype method to identify and correct misspellings in clinical text.

Journal: BMC research notes

Published Date: Jan 18, 2019

Abstract

OBJECTIVE: Misspellings in clinical free text present challenges to natural language processing. With an objective to identify misspellings and their corrections, we developed a prototype spelling analysis method that implements Word2Vec, Levenshtein edit distance constraints, a lexical resource, and corpus term frequencies. We used the prototype method to process two different corpora, surgical pathology reports, and emergency department progress and visit notes, extracted from Veterans Health Administration resources. We evaluated performance by measuring positive predictive value and performing an error analysis of false positive output, using four classifications. We also performed an analysis of spelling errors in each corpus, using common error classifications.

Authors

T Elizabeth Workman

VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
Yijun Shao

Veterans Affairs Medical Center, Washington, DC; George Washington University, Washington, DC.
Guy Divita

VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
Qing Zeng-Treitler

Veterans Affairs Medical Center, Washington, DC; George Washington University, Washington, DC.

Keywords

Algorithms Dictionaries as Topic Humans Language Medical Informatics Medical Records Systems, Computerized Natural Language Processing Pathology, Surgical Reproducibility of Results Research Report Unified Medical Language System Vocabulary, Controlled

External Resources

View on PubMed Access via DOI PubMed (30658682)

An efficient prototype method to identify and correct misspellings in clinical text.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals