L-MAE: Longitudinal masked auto-encoder with time and severity-aware encoding for diabetic retinopathy progression prediction.

Journal: Computers in biology and medicine
Published Date:

Abstract

Pre-training strategies based on self-supervised learning (SSL) have demonstrated success as pretext tasks for downstream tasks in computer vision. However, while SSL methods are often domain-agnostic, their direct application to medical imaging is challenging due to the distinct nature of medical images, including specific anatomical and temporal patterns relevant to disease progression. Additionally, traditional SSL pretext tasks often lack the contextual knowledge that is essential for clinical decision support. In this paper, we developed a longitudinal masked auto-encoder (MAE) that builds on the Transformer-based MAE architecture, specifically introducing a time-aware position embedding and a disease progression-aware masking strategy. Unlike traditional sequential approaches, our method incorporates the actual time intervals between examinations, allowing for better capture of temporal trends. Furthermore, the masking strategy evolves in alignment with disease progression during follow-up exams to capture pathological changes, improving disease progression assessments. Using the OPHDIAT dataset, a large-scale longitudinal screening dataset for diabetic retinopathy (DR), we evaluated our pre-trained model by predicting the severity level at the next visit within three years, based on past examination series. Our findings demonstrate that both the time-aware position embedding and the disease progression-informed masking significantly enhance predictive accuracy. Compared to conventional baseline models and standard longitudinal Transformers, these simple yet effective adaptations substantially improve the predictive power of deep classification models in this domain.

Authors

  • Rachid Zeghlache
    LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France.
  • Pierre-Henri Conze
    Inserm, UMR 1101, Brest F-29200, France; Institut Mines-Télécom Atlantique, Brest F-29200, France.
  • Mostafa El Habib Daho
    LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France. Electronic address: mostafa.elhabibdaho@univ-brest.fr.
  • Yihao Li
    LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France.
  • Alireza Rezaei
    LaTIM UMR 1101, Inserm, Brest, France; University of Western Brittany, Brest, France.
  • Hugo Le Boité
    Sorbonne University, Paris, France; Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France.
  • Ramin Tadayoni
    Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France; Paris Cité University, Paris, France.
  • Pascal Massin
    Ophthalmology Department, Lariboisière Hospital, AP-HP, Paris, France; Paris Cité University, Paris, France.
  • Béatrice Cochener
    Université de Bretagne Occidentale, 3 rue des Archives, Brest F-29200, France; Inserm, UMR 1101, 22 avenue Camille-Desmoulins, Brest F-29200, France; Service d'Ophtalmologie, CHRU Brest, 2 avenue Foch, Brest F-29200, France.
  • Ikram Brahim
    Inserm, UMR 1101, Brest, France.
  • Gwenolé Quellec
    Inserm, UMR 1101, 22 avenue Camille-Desmoulins, Brest F-29200, France. Electronic address: gwenole.quellec@inserm.fr.
  • Mathieu Lamard
    Université de Bretagne Occidentale, 3 rue des Archives, Brest F-29200, France; Inserm, UMR 1101, 22 avenue Camille-Desmoulins, Brest F-29200, France.