Protocol for evaluating the fitness for purpose of an artificial intelligence product for radiology reporting in the BreastScreen New South Wales breast cancer screening programme.
Journal:
BMJ open
PMID:
38806433
Abstract
INTRODUCTION: Radiologist shortages threaten the sustainability of breast cancer screening programmes. Artificial intelligence (AI) products that can interpret mammograms could mitigate this risk. While previous studies have suggested this technology has accuracy comparable to radiologists most have been limited by using 'enriched' datasets and/or not considering the interaction between the algorithm and human readers. This study will address these limitations by comparing the accuracy of a workflow using AI alongside radiologists on a large consecutive cohort of examinations from a breast cancer screening programme. The study will combine the strengths of a large retrospective design with the benefit of prospective data collection. It will test this technology without risk to screening programme participants nor the need to wait for follow-up data. With a sample of 2 years of consecutive screening examinations, it is likely the largest test of this technology to date. The study will help determine whether this technology can safely be introduced into the BreastScreen New South Wales (NSW) population-based screening programme to address radiology workforce risks without compromising cancer detection rates or increasing false-positive recalls.