Improved U-Net based on ResNet and SE-Net with dual attention mechanism for glottis semantic segmentation.

Journal: Medical engineering & physics

PMID: 39979012

Abstract

In previous tasks of glottis image segmentation, the position attention mechanism was rarely incorporated, neglecting the detailed information in glottis position detection. Aiming to improve the U-Net architecture, this study introduces the dual attention mechanism based on the squeeze and excitation (SE)-Net model. This mechanism can integrate traditional channel attention with position attention mechanisms to effectively adjust the weights of crucial features and significance of positions. Replacing the weight adjustment mechanism in SE-Net with the dual attention mechanism creates a broader perspective, enhancing the sensitivity to important features in the model. Furthermore, based on the characteristics of SE-Net, the skip-connection feature of U-Net can still be retained. The architecture proposed in this paper further replaces the convolutional layers in the U-Net encoder with the bottleneck to preserve the information on the features without significantly increasing the amount of computation. In addition, the decoder is replaced with residual blocks to reduce overfitting. The results of the experiment showed that models with retained features demonstrate better accuracy while reducing overfitting. The proposed model achieved positive results in predicting the scores on the public benchmark for automatic glottis segmentation (BAGLS) dataset.

Authors

Jui-Chung Ni

Institute of Computer and Communication Engineering, National Cheng Kung University, Tainan, 701, Taiwan.
Shih-Hsiung Lee

Department of Intelligent Commerce, National Kaohsiung University of Science and Technology, Kaohsiung, 82444, Taiwan. Electronic address: shlee@nkust.edu.tw.
Yen-Cheng Shen

Department of Electrical Engineering, National Cheng Kung University, Tainan, 701, Taiwan.
Chu-Sing Yang

Department of Electrical Engineering, National Cheng Kung University, Tainan, 701, Taiwan.

Keywords

Glottis Humans Image Processing, Computer-Assisted Neural Networks, Computer Semantics

External Resources

View on PubMed Access via DOI PubMed (39979012)

Improved U-Net based on ResNet and SE-Net with dual attention mechanism for glottis semantic segmentation.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals