Standardization and accuracy of race and ethnicity data: Equity implications for medical AI.

Journal: PLOS digital health

Published Date: May 29, 2025

Abstract

The rapid integration of artificial intelligence (AI) into healthcare has raised many concerns about race bias in AI models. Yet, overlooked in this dialogue is the lack of quality control for the accuracy of patient race and ethnicity (r/e) data in electronic health records (EHR). This article critically examines the factors driving inaccurate and unrepresentative r/e datasets. These include conceptual uncertainties about how to categorize races and ethnicity, shortcomings in data collection practices, EHR standards, and the misclassification of patients' race or ethnicity. To address these challenges, we propose a two-pronged action plan. First, we present a set of best practices for healthcare systems and medical AI researchers to improve r/e data accuracy. Second, we call for developers of medical AI models to transparently warrant the quality of their r/e data. Given the ethical and scientific imperatives of ensuring high-quality r/e data in AI-driven healthcare, we argue that these steps should be taken immediately.

Authors

Alexandra Tsalidis

Future of Life Institute, Brussels, Belgium.
Lakshmi Bharadwaj

Laboratory of Biochemical Pharmacology, Emory University School of Medicine, Atlanta, Georgia, United States of America.
Francis X Shen

University of Minnesota Law School, Minneapolis, Minnesota, United States of America.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40440407)

Standardization and accuracy of race and ethnicity data: Equity implications for medical AI.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Standardization and accuracy of race and ethnicity data: Equity implications for medical AI.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals