MSDCNN: A multiscale dilated convolution neural network for fine-grained 3D shape classification.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Multi-view deep neural networks have shown excellent performance on 3D shape classification tasks. However, global features aggregated from multiple views data often lack content information and spatial relationship, which leads to difficult identification the small variance among subcategories in the same category. To solve this problem, in this paper, a novel multiscale dilated convolution neural network termed as MSDCNN is proposed for multi-view fine-grained 3D shape classification. Firstly, a sequence of views are rendered from 12-viewpoints around the input 3D shape by the sequential view capturing module. Then, the first 22 convolution layers of ResNeXt50 is employed to extract the semantic features of each view, and a global mixed feature map is obtained through the element-wise maximum operation of the 12 output feature maps. Furthermore, attention dilated module (ADM), which combines four concatenated attention dilated block (ADB), is designed to extract larger receptive field features from global mixed feature map to enhance context information among the views. Specifically, each ADB is consisted by an attention mechanism module and a dilated convolution with different dilation rates. In addition, prediction module with label smoothing is proposed to classify features, which contains 3 × 3 convolution and adaptive average pooling. The performance of our method is validated experimentally on the ModelNet10, ModelNet40 and FG3D datasets. Experimental results demonstrate the effectiveness and superiority of the proposed MSDCNN framework for 3D shape fine-grained classification.

Authors

  • Wei Zhou
    Department of Eye Function Laboratory, Eye Hospital, China Academy of Chinese Medical Sciences, Beijing, China.
  • Fujian Zheng
    CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China.
  • Yiheng Zhao
    College of Intelligent Technology and Engineering, Chongqing University of Science and Technology, Chongqing 401331, PR China. Electronic address: z_yiheng@live.concordia.ca.
  • Yiran Pang
    Department of Computer & Electrical Engineering and Computer Science, Florida Atlantic University, FL 33431, United States of America. Electronic address: ypang2022@fau.edu.
  • Jun Yi
    Department of Cardiothoracic Surgery, Jinling Hospital, Medical School of Nanjing University, Nanjing University, Nanjing, Jiangsu, China.