Effective and efficient active learning for deep learning-based tissue image analysis.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Deep learning attained excellent results in digital pathology recently. A challenge with its use is that high quality, representative training datasets are required to build robust models. Data annotation in the domain is labor intensive and demands substantial time commitment from expert pathologists. Active learning (AL) is a strategy to minimize annotation. The goal is to select samples from the pool of unlabeled data for annotation that improves model accuracy. However, AL is a very compute demanding approach. The benefits for model learning may vary according to the strategy used, and it may be hard for a domain specialist to fine tune the solution without an integrated interface.

Authors

  • André L S Meirelles
    Department of Computer Science, University of Brasília, Brasília 70910-900, Brazil.
  • Tahsin Kurc
    Department of Biomedical Informatics, Stony Brook Medicine, Stony Brook, NY 11794, USA.
  • Jun Kong
    Stony Brook University, Stony Brook, NY.
  • Renato Ferreira
    Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil.
  • Joel Saltz
    Department of Biomedical Informatics, Stony Brook University, Stony Brook, New York.
  • George Teodoro
    Department of Computer Science, Federal University of Minas Gerais, Belo Horizonte, MG, 31270, USA.