Robust Polyp Detection and Diagnosis through Compositional Prompt-Guided Diffusion Models.

Journal: IEEE transactions on medical imaging
Published Date:

Abstract

Colorectal cancer (CRC) is a significant global health concern, and early detection through screening plays a critical role in reducing mortality. While deep learning models have shown promise in improving polyp detection, classification, and segmentation, their generalization across diverse clinical environments, particularly with out-of-distribution (OOD) data, remains a challenge. Multi-center datasets like PolypGen have been developed to address these issues, but their collection is costly and time-consuming. Traditional data augmentation techniques provide limited variability, failing to capture the complexity of medical images. Diffusion models have emerged as a promising solution for generating synthetic polyp images, but the image generation process in current models mainly relies on segmentation masks as the condition, limiting their ability to capture the full clinical context. To overcome these limitations, we propose a Progressive Spectrum Diffusion Model (PSDM) that integrates diverse clinical annotations-such as segmentation masks, bounding boxes, and colonoscopy reports-by transforming them into compositional prompts. These prompts are organized into coarse and fine components, allowing the model to capture both broad spatial structures and fine details, generating clinically accurate synthetic images. By augmenting training data with PSDM-generated samples, our model significantly improves polyp detection, classification, and segmentation. For instance, on the PolypGen dataset, PSDM increases the F1 score by 2.12% and the mean average precision by 3.09%, demonstrating superior performance in OOD scenarios and enhanced generalization.

Authors

  • Jia Yu
    Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu 610212, Sichuan, China; School of Clinical Medicine, Soochow University, Suzhou 215123, Jiangsu, China.
  • Yan Zhu
    Department of Chemistry, Xixi Campus, Zhejiang University, Hangzhou, 310028, China. Electronic address: zhuyan@zju.edu.cn.
  • Peiyao Fu
  • Tianyi Chen
    Department of Computer Science, City University of Hong Kong, Kowloon 999077, Hong Kong.
  • Junbo Huang
  • Quanlin Li
    State Key Laboratory of Digital Manufacturing Equipment and Technology, School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, 430074, China.
  • Pinghong Zhou
    Endoscopy Center and Endoscopy Research Institute, Zhongshan Hospital, Fudan University, Shanghai, China.
  • Zhihua Wang
  • Fei Wu
    Zhejiang University, 38 Zheda Road, Hangzhou 310058, Zhejiang, China.
  • Shuo Wang
    College of Tea & Food Science, Anhui Agricultural University, Hefei, China.
  • Xian Yang

Keywords

No keywords available for this article.