Applying MLP-Mixer and gMLP to Human Activity Recognition.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

The development of deep learning has led to the proposal of various models for human activity recognition (HAR). Convolutional neural networks (CNNs), initially proposed for computer vision tasks, are examples of models applied to sensor data. Recently, high-performing models based on Transformers and multi-layer perceptrons (MLPs) have also been proposed. When applying these methods to sensor data, we often initialize hyperparameters with values optimized for image processing tasks as a starting point. We suggest that comparable accuracy could be achieved with fewer parameters for sensor data, which typically have lower dimensionality than image data. Reducing the number of parameters would decrease memory requirements and computational complexity by reducing the model size. We evaluated the performance of two MLP-based models, MLP-Mixer and gMLP, by reducing the values of hyperparameters in their MLP layers from those proposed in the respective original papers. The results of this study suggest that the performance of MLP-based models is positively correlated with the number of parameters. Furthermore, these MLP-based models demonstrate improved computational efficiency for specific HAR tasks compared to representative CNNs.

Authors

  • Takeru Miyoshi
    Graduate School of National Science and Technology, Kanazawa University, Kanazawa 920-1192, Japan.
  • Makoto Koshino
    National Institute of Technology, Ishikawa College, Tsubata 929-0392, Japan.
  • Hidetaka Nambo
    School of Electrical Information Communication Engineering, College of Science and Engineering, Kanazawa University Kanazawa Japan.