Training instance segmentation neural network with synthetic datasets for crop seed phenotyping.

Journal: Communications biology
Published Date:

Abstract

In order to train the neural network for plant phenotyping, a sufficient amount of training data must be prepared, which requires time-consuming manual data annotation process that often becomes the limiting step. Here, we show that an instance segmentation neural network aimed to phenotype the barley seed morphology of various cultivars, can be sufficiently trained purely by a synthetically generated dataset. Our attempt is based on the concept of domain randomization, where a large amount of image is generated by randomly orienting the seed object to a virtual canvas. The trained model showed 96% recall and 95% average Precision against the real-world test dataset. We show that our approach is effective also for various crops including rice, lettuce, oat, and wheat. Constructing and utilizing such synthetic data can be a powerful method to alleviate human labor costs for deploying deep learning-based analysis in the agricultural domain.

Authors

  • Yosuke Toda
    Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama, 332-0012, Japan. tyosuke@aquaseerser.com.
  • Fumio Okura
    Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama, 332-0012, Japan.
  • Jun Ito
    Kihara Institute for Biological Research, Yokohama City University, Maioka 641-12, Totsuka, Yokohama, 244-0813, Japan.
  • Satoshi Okada
    Institute of Plant Science and Resources, Okayama University, Chuo 2-20-1, Kurashiki, Okayama, 710-0046, Japan.
  • Toshinori Kinoshita
    Institute of Transformative Bio-Molecules (WPI-ITbM), Nagoya University, Chikusa, Nagoya, 464-8602, Japan.
  • Hiroyuki Tsuji
    Kihara Institute for Biological Research, Yokohama City University, Maioka 641-12, Totsuka, Yokohama, 244-0813, Japan.
  • Daisuke Saisho
    Institute of Plant Science and Resources, Okayama University, Chuo 2-20-1, Kurashiki, Okayama, 710-0046, Japan.