Deep learning is combined with massive-scale citizen science to improve large-scale image classification.

Journal: Nature biotechnology
Published Date:

Abstract

Pattern recognition and classification of images are key challenges throughout the life sciences. We combined two approaches for large-scale classification of fluorescence microscopy images. First, using the publicly available data set from the Cell Atlas of the Human Protein Atlas (HPA), we integrated an image-classification task into a mainstream video game (EVE Online) as a mini-game, named Project Discovery. Participation by 322,006 gamers over 1 year provided nearly 33 million classifications of subcellular localization patterns, including patterns that were not previously annotated by the HPA. Second, we used deep learning to build an automated Localization Cellular Annotation Tool (Loc-CAT). This tool classifies proteins into 29 subcellular localization patterns and can deal efficiently with multi-localization proteins, performing robustly across different cell types. Combining the annotations of gamers and deep learning, we applied transfer learning to create a boosted learner that can characterize subcellular protein distribution with F1 score of 0.72. We found that engaging players of commercial computer games provided data that augmented deep learning and enabled scalable and readily improved image classification.

Authors

  • Devin P Sullivan
    Computational Biology Department, Carnegie Mellon University, Pittsburgh, United States.
  • Casper F Winsnes
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden.
  • Lovisa Åkesson
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden.
  • Martin Hjelmare
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden.
  • Mikaela Wiking
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden.
  • Rutger Schutten
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden.
  • Linzi Campbell
    CCP hf, Reyjkavik, Iceland.
  • Hjalti Leifsson
    CCP hf, Reyjkavik, Iceland.
  • Scott Rhodes
    CCP hf, Reyjkavik, Iceland.
  • Andie Nordgren
    CCP hf, Reyjkavik, Iceland.
  • Kevin Smith
  • Bernard Revaz
    MMOS Sàrl, Monthey, Switzerland.
  • Bergur Finnbogason
    CCP hf, Reyjkavik, Iceland.
  • Attila Szantner
    MMOS Sàrl, Monthey, Switzerland.
  • Emma Lundberg
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH - Royal Institute of Technology, Stockholm, Sweden.