Multi-class rice seed recognition based on deep space and channel residual network combined with double attention mechanism.
Journal:
PloS one
PMID:
40378167
Abstract
Accurately recognizing rice seed varieties poses significant challenges due to their diverse morphological characteristics and complex classification requirements. Traditional image recognition methods often struggle with both accuracy and efficiency in this context. To address these limitations, this study proposes the Deep Space and Channel Residual Network with Double Attention Mechanism (RSCD-Net) to enhance the recognition accuracy of 36 rice seed varieties. The core innovation of RSCD-Net is the introduction of the Space and Channel Feature Extraction Residual Block (SCR-Block), which improves inter-class differentiation while minimizing redundant features, thereby optimizing computational efficiency. The RSCD-Net architecture consists of 16 layers of SCR-Blocks, structured into four convolutional stages with 3, 4, 6, and 3 units, respectively. Additionally, a Double Attention Mechanism (A2Net) is incorporated to enhance the network's global receptive field, improving its capacity to distinguish subtle variations among seed types. Experimental results on a self-collected dataset demonstrate that RSCD-Net achieves an average accuracy of 81.94%, surpassing the baseline model by 4.16%. Compared with state-of-the-art models such as InceptionResNetV2, ConvNeXt, MobileNetV3, and Swin Transformer, RSCD Net has improved by 1.17%, 3%, 24.72%, and 13.22%, respectively, showcasing its superior performance. These findings confirm that RSCD-Net provides an effective and efficient solution for rice seed classification, offering a promising reference for addressing similar fine-grained recognition challenges in agricultural applications.