GO-MAE: Self-supervised pre-training via masked autoencoder for OCT image classification of gynecology.

Journal: Neural networks : the official journal of the International Neural Network Society
PMID:

Abstract

Genitourinary syndrome of menopause (GSM) is a physiological disorder caused by reduced levels of oestrogen in menopausal women. Gradually, its symptoms worsen with age and prolonged menopausal status, which gravely impacts the quality of life as well as the physical and mental health of the patients. In this regard, optical coherence tomography (OCT) system effectively reduces the patient's burden in clinical diagnosis with its noncontact, noninvasive tomographic imaging process. Consequently, supervised computer vision models applied on OCT images have yielded excellent results for disease diagnosis. However, manual labeling on an extensive number of medical images is expensive and time-consuming. To this end, this paper proposes GO-MAE, a pretraining framework for self-supervised learning of GSM OCT images based on Masked Autoencoder (MAE). To the best of our knowledge, this is the first study that applies self-supervised learning methods on the field of GSM disease screening. Focusing on the semantic complexity and feature sparsity of GSM OCT images, the objective of this study is two-pronged: first, a dynamic masking strategy is introduced for OCT characteristics in downstream tasks. This method can reduce the interference of invalid features on the model and shorten the training time. In the encoder design of MAE, we propose a convolutional neural network and transformer parallel network architecture (C&T), which aims to fuse the local and global representations of the relevant lesions in an interactive manner such that the model can still learn the richer differences between the feature information without labels. Thereafter, a series of experimental results on the acquired GSM-OCT dataset revealed that GO-MAE yields significant improvements over existing state-of-the-art techniques. Furthermore, the superiority of the model in terms of robustness and interpretability was verified through a series of comparative experiments and visualization operations, which consequently demonstrated its great potential for screening GSM symptoms.

Authors

  • Haoran Wang
    Department of Urology, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510700, China.
  • Xinyu Guo
    Keya Medical Co., Ltd, Longong, Shenzhen, China. Electronic address: xinyug@keyamedical.com.
  • Kaiwen Song
  • Mingyang Sun
    Key Laboratory of Geophysical Exploration Equipment, Ministry of Education, College of Instrumentation and Electrical Engineering, Jilin University, Changchun 130012, China. Electronic address: smy19@mails.jlu.edu.cn.
  • Yanbin Shao
    Key Laboratory of Geophysical Exploration Equipment, Ministry of Education, College of Instrumentation and Electrical Engineering, Jilin University, Changchun 130012, China. Electronic address: shaoyb21@emails.jlu.edu.cn.
  • Songfeng Xue
    Key Laboratory of Geophysical Exploration Equipment, Ministry of Education, College of Instrumentation and Electrical Engineering, Jilin University, Changchun 130012, China. Electronic address: xuesf21@mails.jlu.edu.cn.
  • Hongwei Zhang
    Jiangsu Provincial Key Laboratory for TCM Evaluation and Translational Development, China Pharmaceutical University, Nanjing, Jiangsu 211198, China.
  • Tianyu Zhang
    State Key Laboratory of Respiratory Disease, Joint School of Life Sciences, Guangzhou Chest Hospital, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou Medical University, Guangzhou, China.