Classification of crystallization outcomes using deep convolutional neural networks.

Journal: PloS one
Published Date:

Abstract

The Machine Recognition of Crystallization Outcomes (MARCO) initiative has assembled roughly half a million annotated images of macromolecular crystallization experiments from various sources and setups. Here, state-of-the-art machine learning algorithms are trained and tested on different parts of this data set. We find that more than 94% of the test images can be correctly labeled, irrespective of their experimental origin. Because crystal recognition is key to high-density screening and the systematic analysis of crystallization experiments, this approach opens the door to both industrial and fundamental research applications.

Authors

  • Andrew E Bruno
    Center for Computational Research, University at Buffalo, Buffalo, New York, United States of America.
  • Patrick Charbonneau
    Department of Chemistry, Duke University, Durham, North Carolina, United States of America.
  • Janet Newman
    Collaborative Crystallisation Centre, CSIRO, Parkville, Victoria, Australia.
  • Edward H Snell
    Hauptman-Woodward Medical Research Institute, Buffalo, New York, United States of America.
  • David R So
    Google Brain, Google Inc., Mountain View, California, United States of America.
  • Vincent Vanhoucke
    Google Brain, Google Inc., Mountain View, California, United States of America.
  • Christopher J Watkins
    IM&T Scientific Computing, CSIRO, Clayton South, Victoria, Australia.
  • Shawn Williams
    Platform Technology and Sciences, GlaxoSmithKline Inc., Collegeville, Pennsylvania, United States of America.
  • Julie Wilson
    Department of Mathematics, University of York, York, United Kingdom.