Multiple-Attention Mechanism Network for Semantic Segmentation.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

Contextual information and the dependencies between dimensions is vital in image semantic segmentation. In this paper, we propose a multiple-attention mechanism network (MANet) for semantic segmentation in a very effective and efficient way. Concretely, the contributions are as follows: (1) a novel dual-attention mechanism for capturing feature dependencies in spatial and channel dimensions, where the adjacent position attention captures the dependencies between pixels well; (2) a new cross-dimensional interactive attention feature fusion module, which strengthens the fusion of fine location structure information in low-level features and category semantic information in high-level features. We conduct extensive experiments on semantic segmentation benchmarks including PASCAL VOC 2012 and Cityscapes datasets. Our MANet achieves the mIoU scores of 75.5% and 72.8% on PASCAL VOC 2012 and Cityscapes datasets, respectively. The effectiveness of the network is higher than the previous popular semantic segmentation networks under the same conditions.

Authors

  • Dongli Wang
    School of Automation and Electronic Information, Xiangtan University, Xiangtan 411105, China.
  • Shengliang Xiang
    School of Automation and Electronic Information, Xiangtan University, Xiangtan 411105, China.
  • Yan Zhou
    Department of Computer Science, University of Texas at Dallas, Richardson, Texas 75080, United States.
  • Jinzhen Mu
    Shanghai Aerospace Control Technology Institute, Shanghai 201109, China.
  • Haibin Zhou
    School of Mathematics and Computational Science, Xiangtan University, Xiangtan 411105, China.
  • Richard Irampaye
    School of Mathematics and Computational Science, Xiangtan University, Xiangtan 411105, China.