On construction of data preprocessing for real-life SoyLeaf dataset & disease identification using Deep Learning Models.
Journal:
Computational biology and chemistry
PMID:
40086344
Abstract
The vast volumes of data are needed to train Deep Learning Models from scratch to identify illnesses in soybean leaves. However, there is still a lack of sufficient high-quality samples. To overcome this problem, we have developed the real-life SoyLeaf dataset and used the pre-trained Deep Learning Models to identify leaf diseases. In this paper, we have initially developed the real-life SoyLeaf dataset collected from the ICAR-Indian Institute of Soybean Research (IISR) Center, Indore field. This SoyLeaf dataset contains 9786 high-quality soybean leaf images, including healthy and diseased leaves. Following this, we have adapted data preprocessing techniques to enhance the quality of images. In addition, we have utilized several Deep Learning Models, i.e., fourteen Keras Transfer Learning Models, to determine which model best fits the dataset on SoyLeaf diseases. The accuracies of the proposed fine-tuned models using the Adam optimizer are as follows: ResNet50V2 achieves 99.79%, ResNet101V2 achieves 99.89%, ResNet152V2 achieves 99.59%, InceptionV3 achieves 99.83%, InceptionResNetV2 achieves 99.79%, MobileNet achieves 99.82%, MobileNetV2 achieves 99.89%, DenseNet121 achieves 99.87%, and DenseNet169 achieves 99.87%. Similarly, the accuracies of the proposed fine-tuned models using the RMSprop optimizer are as follows: ResNet50V2 achieves 99.49%, ResNet101V2 achieves 99.45%, ResNet152V2 achieves 99.45%, InceptionV3 achieves 99.58%, InceptionResNetV2 achieves 99.88%, MobileNet achieves 99.73%, MobileNetV2 achieves 99.83%, DenseNet121 achieves 99.89%, and DenseNet169 achieves 99.77%. The experimental results of the proposed fine-tuned models show that only ResNet50V2, ResNet101V2, InceptionV3, InceptionResNetV2, MobileNet, MobileNetV2, DenseNet121, and DenseNet169 have performed better in terms of training, validation, and testing accuracies than other state-of-the-art models.