On herbarium specimen images and artificial intelligence.

Journal: The New phytologist
Published Date:

Abstract

Digitized herbarium specimens are increasingly used to train artificial intelligence (AI) models in plant identification and other botanical applications. The abundant specimen images available in public repositories are especially amenable to AI. For instance, digitized herbarium sheets are relatively standardized - generally flattened portions of plant specimens mounted on paper with written metadata, imaged at a similar scale with uniform color-corrected illumination. Herbarium specimen identifications rely on standardized taxonomies that have also been reviewed by one or more professionals, providing high label accuracy - a critical advantage for AI model training. In this review, we tackle the basics of AI computer vision as it relates to digitized plant specimens: how AI is applied, what hypotheses can be tested, how datasets should be constructed, and how to produce a general workflow. Lastly, we provide recommendations for best practices along with recommendations for ways that future AI researchers may refine herbarium-focused models. In an era of declining taxonomic and specimen-based botanical expertise, we believe that this form of AI-based plant research presents an opportunity to augment human capacity and provides opportunity for hypothesis-based research that must be capitalized upon.

Authors

  • Michael Tessler
    Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA.
  • Damon P Little
    New York Botanical Garden, Bronx, NY, United States.

Keywords

No keywords available for this article.