Intelligent identification analysis and process design for highly similar categories using Platycerium as an example.

Journal: Scientific reports
Published Date:

Abstract

This study tackles the challenge of image recognition for datasets with high inter-class similarity, using 18 native Platycerium species as a case study. Due to their substantial visual similarities, initial training with ResNet50 yielded a baseline accuracy of less than 10%. To address this, we conducted a comprehensive analysis using multidimensional confusion matrices to identify seven primary confusion factors, such as image edges, textures, and shapes, and stratified the dataset into processed and unprocessed images optimized for these factors through adjustments in saturation, brightness, and sharpening. A refinement process leveraging confusion matrices and bootstrapping was proposed to address ambiguous classes, significantly improving recognition of highly similar species. Recognition accuracy increased to approximately 60% after applying confusion factor analysis and image optimization, with further gains to over 80% using EfficientNet-b4 and over 90% using EfficientNet-b7. These findings highlight the importance of feature selection and grouped analysis in recognizing highly similar images, offering a robust framework for optimizing recognition accuracy in challenging datasets and providing valuable insights for advancing image recognition technologies.

Authors

  • Li-Wei Chen
    Department of Biomedical Engineering, College of Medicine and College of Engineering, National Taiwan University, Taipei, Taiwan.
  • Wei-Lun Lin
    Communications Engineering, Feng Chia University, Taichung, Taiwan. weilunlin@mail.fcu.edu.tw.

Keywords

No keywords available for this article.