DiversityScanner: Robotic handling of small invertebrates with machine learning methods.

Journal: Molecular ecology resources
Published Date:

Abstract

Invertebrate biodiversity remains poorly understood although it comprises much of the terrestrial animal biomass, most species and supplies many ecosystem services. The main obstacle is specimen-rich samples obtained with quantitative sampling techniques (e.g., Malaise trapping). Traditional sorting requires manual handling, while molecular techniques based on metabarcoding lose the association between individual specimens and sequences and thus struggle with obtaining precise abundance information. Here we present a sorting robot that prepares specimens from bulk samples for barcoding. It detects, images and measures individual specimens from a sample and then moves them into the wells of a 96-well microplate. We show that the images can be used to train convolutional neural networks (CNNs) that are capable of assigning the specimens to 14 insect taxa (usually families) that are particularly common in Malaise trap samples. The average assignment precision for all taxa is 91.4% (75%-100%). This ability of the robot to identify common taxa then allows for taxon-specific subsampling, because the robot can be instructed to only pick a prespecified number of specimens for abundant taxa. To obtain biomass information, the images are also used to measure specimen length and estimate body volume. We outline how the DiversityScanner can be a key component for tackling and monitoring invertebrate diversity by combining molecular and morphological tools: the images generated by the robot become training images for machine learning once they are labelled with taxonomic information from DNA barcodes. We suggest that a combination of automation, machine learning and DNA barcoding has the potential to tackle invertebrate diversity at an unprecedented scale.

Authors

  • Lorenz Wührl
    Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany.
  • Christian Pylatiuk
    Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Eggenstein, Germany.
  • Matthias Giersch
    Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany.
  • Florian Lapp
    Institute for Automation and Applied Informatics (IAI), Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany.
  • Thomas von Rintelen
    Museum für Naturkunde, Center for Integrative Biodiversity Discovery, Leibniz-Institut für Evolutions- und Biodiversitätsforschung, Berlin, Germany.
  • Michael Balke
    SNSB - Zoologische Staatssammlung München, Munich, Germany.
  • Stefan Schmidt
    Heidelberg Engineering GmbH, 69115, Heidelberg, Germany.
  • Pierfilippo Cerretti
    Department of Biology and Biotechnology 'Charles Darwin', Sapienza University of Rome, Rome, Italy.
  • Rudolf Meier
    Museum für Naturkunde, Center for Integrative Biodiversity Discovery, Leibniz-Institut für Evolutions- und Biodiversitätsforschung, Berlin, Germany.