Compressing 3DCNNs based on tensor train decomposition.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Three-dimensional convolutional neural networks (3DCNNs) have been applied in many tasks, e.g., video and 3D point cloud recognition. However, due to the higher dimension of convolutional kernels, the space complexity of 3DCNNs is generally larger than that of traditional two-dimensional convolutional neural networks (2DCNNs). To miniaturize 3DCNNs for the deployment in confining environments such as embedded devices, neural network compression is a promising approach. In this work, we adopt the tensor train (TT) decomposition, a straightforward and simple in situ training compression method, to shrink the 3DCNN models. Through proposing tensorizing 3D convolutional kernels in TT format, we investigate how to select appropriate TT ranks for achieving higher compression ratio. We have also discussed the redundancy of 3D convolutional kernels for compression, core significance and future directions of this work, as well as the theoretical computation complexity versus practical executing time of convolution in TT. In the light of multiple contrast experiments based on VIVA challenge, UCF11, UCF101, and ModelNet40 datasets, we conclude that TT decomposition can compress 3DCNNs by around one hundred times without significant accuracy loss, which will enable its applications in extensive real world scenarios.

Authors

  • Dingheng Wang
    School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: wangdai11@stu.xjtu.edu.cn.
  • Guangshe Zhao
    School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China. Electronic address: zhaogs@xjtu.edu.cn.
  • Guoqi Li
    University of Chinese Academy of Sciences, Beijing 100049, China.
  • Lei Deng
    1] Center for Brain Inspired Computing Research (CBICR), Department of Precision Instrument, Tsinghua University, Beijing 100084, China [2] Optical Memory National Engineering Research Center, Department of Precision Instrument, Tsinghua University, Beijing 100084, China.
  • Yang Wu