MVD-Net: Semantic Segmentation of Cataract Surgery Using Multi-View Learning.

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Published Date:

Abstract

Semantic segmentation of surgery scenarios is a fundamental task for computer-aided surgery systems. Precise segmentation of surgical instruments and anatomies contributes to capturing accurate spatial information for tracking. However, uneven reflection and class imbalance lead the segmentation in cataract surgery to a challenging task. To desirably conduct segmentation, a network with multi-view decoders (MVD-Net) is proposed to present a generalizable segmentation for cataract surgery. Two discrepant decoders are implemented to achieve multi-view learning with the backbone of U-Net. The experiment is carried out on the Cataract Dataset for Image Segmentation (CaDIS). The ablation study verifies the effectiveness of the proposed modules in MVD-Net, and superior performance is provided by MVD-Net in the comparison with the state-of-the-art methods. The source code will be publicly released.

Authors

  • Mingyang Ou
  • Heng Li
    Department of Anesthesiology, Affiliated Nanhua Hospital, University of South China, Hengyang 421002, Hunan Province, China.
  • Haofeng Liu
  • Xiaoxuan Wang
    Economics and Management Department, North China Electric Power University, Baoding, 071000, Hebei, China. Wxx12345@ncepu.edu.cn.
  • Chenlang Yi
  • Luoying Hao
  • Yan Hu
    Department of Thoracic Surgery, The Second Xiangya Hospital of Central South University, Changsha, Hunan, China.
  • Jiang Liu
    Department of Pharmacy, The Fourth Hospital of Hebei Medical University Shijiazhuang 050000, Hebei, China.