Recommended practices and ethical considerations for natural language processing-assisted observational research: A scoping review.

Journal: Clinical and translational science
PMID:

Abstract

An increasing number of studies have reported using natural language processing (NLP) to assist observational research by extracting clinical information from electronic health records (EHRs). Currently, no standardized reporting guidelines for NLP-assisted observational studies exist. The absence of detailed reporting guidelines may create ambiguity in the use of NLP-derived content, knowledge gaps in the current research reporting practices, and reproducibility challenges. To address these issues, we conducted a scoping review of NLP-assisted observational clinical studies and examined their reporting practices, focusing on NLP methodology and evaluation. Through our investigation, we discovered a high variation regarding the reporting practices, such as inconsistent use of references for measurement studies, variation in the reporting location (reference, appendix, and manuscript), and different granularity of NLP methodology and evaluation details. To promote the wide adoption and utilization of NLP solutions in clinical research, we outline several perspectives that align with the six principles released by the World Health Organization (WHO) that guide the ethical use of artificial intelligence for health.

Authors

  • Sunyang Fu
    Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, USA.
  • Liwei Wang
    Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
  • Sungrim Moon
    Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.
  • Nansu Zong
    Health System Department of Biomedical Informatics, University of California, San Diego, La Jolla, California, USA.
  • Huan He
    Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.
  • Vikas Pejaver
    Department of Biomedical Informatics and Medical Education and the eScience Institute, University of Washington, Seattle, Washington 98109, USA; email: vpejaver@uw.edu.
  • Rose Relevo
    The National Center for Data to Health, Bethesda, Maryland, USA.
  • Anita Walden
    University of Arkansas for Medical Sciences, Little Rock, Arkansas.
  • Melissa Haendel
    Department of Medical Informatics & Clinical Epidemiology, Oregon Health & Science University, Portland, OR 97239, USA.
  • Christopher G Chute
  • Hongfang Liu
    Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.