STCNet: Spatio-Temporal Cross Network with subject-aware contrastive learning for hand gesture recognition in surface EMG.

Journal: Computers in biology and medicine
PMID:

Abstract

This paper introduces the Spatio-Temporal Cross Network (STCNet), a novel deep learning architecture tailored for robust hand gesture recognition in surface electromyography (sEMG) across multiple subjects. We address the challenges associated with the inter-subject variability and environmental factors such as electrode shift and muscle fatigue, which traditionally undermine the robustness of gesture recognition systems. STCNet integrates a convolutional-recurrent architecture with a spatio-temporal block that extracts features over segmented time intervals, enhancing both spatial and temporal analysis. Additionally, a rolling convolution technique designed to reflect the circular band structure of the sEMG measurement device is incorporated, thus capturing the inherent spatial relationships more effectively. We further propose a subject-aware contrastive learning framework that utilizes both subject and gesture label information to align the representation of vector space. Our comprehensive experimental evaluations demonstrate the superiority of STCNet under aggregated conditions, achieving state-of-the-art performance on benchmark datasets and effectively managing the variability among different subjects. The implemented code can be found at https://github.com/KNU-BrainAI/STCNet.

Authors

  • Jaemo Yang
    School of Electronics Engineering, Kyungpook National University, Daegu, South Korea.
  • Doheun Cha
    School of Electronic and Electrical Engineering, Kyungpook National University, Daegu, South Korea.
  • Dong-Gyu Lee
    Department of Artificial Intelligence, Kyungpook National University, Daehak-ro, Buk-gu, Daegu 41566, Republic of Korea.
  • Sangtae Ahn
    GE Research, 1 Research Circle KWC-1310C, Niskayuna, NY 12309, USA.