Multimodal fish maw type recognition based on Wasserstein generative adversarial network combined with gradient penalty and spectral fusion.

Journal: Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy
PMID:

Abstract

There are many types of fish maw with significantly varying prices. The specific type directly affects its market value and medicinal efficacy. This paper proposes a fish maw type recognition method based on Wasserstein generative adversarial network combined with gradient penalty (WGAN-GP) and spectral fusion. By collecting Raman and near-infrared (NIR) spectral data of four types of fish maw (Beihai Male Fish Maw, Beihai Female Fish Maw, Yellow Croaker Fish Maw, and Red Mouth Croaker Fish Maw), we used WGAN-GP for data augmentation. The performance of three spectral fusion strategies (data layer, feature layer, and decision layer) was explored based on two one-dimensional convolutional neural network (1D-CNN) models. The results indicate that, after applying data augmentation and expanding the training set to 3,600 samples, the performances of the 1D-VGG (NIR), 1D-VGG (Raman), 1D-ResNet (NIR), and 1D-ResNet (Raman) models all reach optimal levels. The accuracies on the test set are improved by 15.48%, 13.10%, 1.19%, and 5.95%, respectively. Under different fusion strategies, the 1D-VGG (Raman)-1D-VGG (NIR) model at the feature layer and 1D-ResNet (Raman)(1.0)-1D-ResNet (NIR)(1.0) model at the decision layer achieved the same classification results. They exceeded other models in accuracy (98.21%), precision (98.27%), recall (98.21%), and F1-score (98.21%) on the test set. In summary, this study demonstrated the great potential of data enhancement and multimodal spectral data fusion in fish maw type identification, providing analytical tools for the development of fish maw detection equipment based on multimodal techniques.

Authors

  • Hai Yin
    Department of Optoelectronic Engineering, Jinan University, Guangzhou, Guangdong 510632, China.
  • Qihang Yang
    Department of Optoelectronic Engineering, Jinan University, Guangzhou, Guangdong 510632, China.
  • Fangyuan Huang
    Guangdong Experimental High School, Guangzhou, Guangdong 510000, China.
  • Hongjie Li
    School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, Guangdong 518172, China.
  • Hui Wang
    Department of Vascular Surgery, Xuanwu Hospital, Capital Medical University, Beijing, China.
  • Huadan Zheng
    Department of Optoelectronic Engineering, Jinan University, Guangzhou, Guangdong 510632, China. Electronic address: zhenghuadan@jnu.edu.cn.
  • Furong Huang
    Guangdong Provincial Key Laboratory of Optical Fiber Sensing and Communications, Jinan University, Guangzhou 510632, China; Research Institute of Jinan University in Dongguan, Dongguan 523000, China. Electronic address: furong_huang@jnu.edu.cn.