Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness.

Journal: BMJ (Clinical research ed.)
Published Date:

Abstract

Machine learning, artificial intelligence, and other modern statistical methods are providing new opportunities to operationalise previously untapped and rapidly growing sources of data for patient benefit. Despite much promising research currently being undertaken, particularly in imaging, the literature as a whole lacks transparency, clear reporting to facilitate replicability, exploration for potential ethical concerns, and clear demonstrations of effectiveness. Among the many reasons why these problems exist, one of the most important (for which we provide a preliminary solution here) is the current lack of best practice guidance specific to machine learning and artificial intelligence. However, we believe that interdisciplinary groups pursuing research and impact projects involving machine learning and artificial intelligence for health would benefit from explicitly addressing a series of questions concerning transparency, reproducibility, ethics, and effectiveness (TREE). The 20 critical questions proposed here provide a framework for research groups to inform the design, conduct, and reporting; for editors and peer reviewers to evaluate contributions to the literature; and for patients, clinicians and policy makers to critically appraise where new findings may deliver patient benefit.

Authors

  • Sebastian Vollmer
    Alan Turing Institute, Kings Cross, London, UK.
  • Bilal A Mateen
    Alan Turing Institute, Kings Cross, London, UK.
  • Gergo Bohner
    Alan Turing Institute, Kings Cross, London, UK.
  • Franz J Király
    Alan Turing Institute, Kings Cross, London, UK.
  • Rayid Ghani
    Ian Pan is with the Department of Biostatistics, School of Public Health, Brown University, Providence, RI. Laura B. Nolan is with the Population Research Center, School of Social Work, Columbia University, New York, NY. Rashida R. Brown is with the Division of Epidemiology, School of Public Health, University of California, Berkeley. Romana Khan is with the Kellogg School of Management, Northwestern University, Evanston, IL. Paul van der Boor and Rayid Ghani are with the Center for Data Science and Public Policy, University of Chicago, Chicago, IL. Daniel G. Harris is with the Department of Human Services, Illinois State Government, Chicago.
  • Pall Jonsson
    National Institute for Health and Care Excellence, London, UK.
  • Sarah Cumbers
    Health and Social Care Directorate, National Institute for Health and Care Excellence, London, UK.
  • Adrian Jonas
    Data and Analytics Group, National Institute for Health and Care Excellence, London, UK.
  • Katherine S L McAllister
    Data and Analytics Group, National Institute for Health and Care Excellence, London, UK.
  • Puja Myles
    Clinical Practice Research Datalink, Medicines and Healthcare products Regulatory Agency, London, UK.
  • David Granger
    Medicines and Healthcare products Regulatory Agency, London, UK.
  • Mark Birse
    Medicines and Healthcare products Regulatory Agency, London, UK.
  • Richard Branson
    Medicines and Healthcare products Regulatory Agency, London, UK.
  • Karel G M Moons
    Julius Center for Health Sciences and Primary Care, and Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands.
  • Gary S Collins
    Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Botnar Research Centre, University of Oxford, Windmill Road, Oxford, OX3 7LD UK; Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
  • John P A Ioannidis
    Stanford Prevention Research Center, Department of Medicine, Stanford University, Stanford, California.
  • Chris Holmes
    Department of Statistics, University of Oxford, Oxford, UK.
  • Harry Hemingway
    Institute of Health Informatics, University College London, London, UK.