TCMP-300: A Comprehensive Traditional Chinese Medicinal Plant Dataset for Plant Recognition.

Journal: Scientific data
Published Date:

Abstract

Traditional Chinese Medicinal Plants (TCMPs) are often used to prevent and treat diseases for the human body. Since various medicinal plants have different therapeutic effects, plant recognition has become an important topic. Traditional identification of medicinal plants mainly relies on human experts, which does not meet the increased requirements in clinical practice. Artificial Intelligence (AI) research for plant recognition faces challenges due to the lack of a comprehensive medicinal plant dataset. Therefore, we present a TCMP dataset that includes 52,089 images in 300 categories. Compared to the existing medicinal plant datasets, our dataset has more categories and fine-grained plant parts to facilitate comprehensive plant recognition. The plant images were collected through the Bing search engine and cleaned by a pretrained vision foundation model with human verification. We conduct technical validation by training several state-of-the-art image classification models with advanced data augmentation on the dataset, and achieve 89.64% accuracy. Our dataset promotes the development and validation of advanced AI models for robust and accurate plant recognition.

Authors

  • Yanling Zhang
    1 School of Chinese Pharmacy, Beijing University of Chinese Medicine, Beijing 100102, P. R. China.
  • Wanhui Sun
    School of pharmacy, Xinyang Agriculture and Forestry University, Xinyang, 464000, China.
  • Chuanguang Yang
    Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.
  • Libo Huang
  • Zhulin An
    Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.
  • Weilun Feng
    Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China.
  • Wenjing Tang
    Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China.
  • Yongjun Xu
    Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.