Machine learning in medicine: a practical introduction to techniques for data pre-processing, hyperparameter tuning, and model comparison.

Journal: BMC medical research methodology
Published Date:

Abstract

BACKGROUND: There is growing enthusiasm for the application of machine learning (ML) and artificial intelligence (AI) techniques to clinical research and practice. However, instructions on how to develop robust high-quality ML and AI in medicine are scarce. In this paper, we provide a practical example of techniques that facilitate the development of high-quality ML systems including data pre-processing, hyperparameter tuning, and model comparison using open-source software and data.

Authors

  • AndrĂ© Pfob
    University Breast Unit, Department of Obstetrics and Gynecology, Heidelberg University Hospital, Im Neuenheimer Feld 440, 69120, Heidelberg, Germany.
  • Sheng-Chieh Lu
    MD Anderson Center for INSPiRED Cancer Care (Integrated Systems for Patient-Reported Data), The University of Texas MD Anderson Cancer Center, Houston, TX, USA.
  • Chris Sidey-Gibbons
    MD Anderson Center for INSPiRED Cancer Care, University of Texas, Houston, TX, United States.