Self-supervised learning in medicine and healthcare.

Journal: Nature biomedical engineering
Published Date:

Abstract

The development of medical applications of machine learning has required manual annotation of data, often by medical experts. Yet, the availability of large-scale unannotated data provides opportunities for the development of better machine-learning models. In this Review, we highlight self-supervised methods and models for use in medicine and healthcare, and discuss the advantages and limitations of their application to tasks involving electronic health records and datasets of medical images, bioelectrical signals, and sequences and structures of genes and proteins. We also discuss promising applications of self-supervised learning for the development of models leveraging multimodal datasets, and the challenges in collecting unbiased data for their training. Self-supervised learning may accelerate the development of medical artificial intelligence.

Authors

  • Rayan Krishnan
    Department of Computer Science, Stanford University, Stanford, CA, USA.
  • Pranav Rajpurkar
    Harvard Medical School, Department of Biomedical Informatics, Cambridge, MA, 02115, US.
  • Eric J Topol
    Scripps Research Translational Institute, La Jolla, CA 92037, USA; Scripps Clinic Division of Cardiovascular Diseases, La Jolla, CA 92037, USA. Electronic address: etopol@scripps.edu.