Improving AI models for rare thyroid cancer subtype by text guided diffusion models.

Journal: Nature communications
PMID:

Abstract

Artificial intelligence applications in oncology imaging often struggle with diagnosing rare tumors. We identify significant gaps in detecting uncommon thyroid cancer types with ultrasound, where scarce data leads to frequent misdiagnosis. Traditional augmentation strategies do not capture the unique disease variations, hindering model training and performance. To overcome this, we propose a text-driven generative method that fuses clinical insights with image generation, producing synthetic samples that realistically reflect rare subtypes. In rigorous evaluations, our approach achieves substantial gains in diagnostic metrics, surpasses existing methods in authenticity and diversity measures, and generalizes effectively to other private and public datasets with various rare cancers. In this work, we demonstrate that text-guided image augmentation substantially enhances model accuracy and robustness for rare tumor detection, offering a promising avenue for more reliable and widespread clinical adoption.

Authors

  • Fang Dai
    College of Science, Xi'an University of Technology, Xi'an, China.
  • Siqiong Yao
    State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.
  • Min Wang
    National and Local Joint Engineering Research Center of Ecological Treatment Technology for Urban Water Pollution, Wenzhou University, Wenzhou 325035, China.
  • Yicheng Zhu
    Research School of Biology, The Australian National University, Canberra, Australian Capital Territory 2601, Australia yicheng.zhu@anu.edu.au gavin.huttley@anu.edu.au.
  • Xiangjun Qiu
    Department of Automation, Tsinghua University, Beijing, PR China.
  • Peng Sun
    Department of Microelectronics, Nankai University, Tianjin, 300350, PR China.
  • Cheng Qiu
    Department of Psychology, University of Pennsylvania, Philadelphia, PA, USA.
  • Jisheng Yin
    Shcool of Artificial Intelligence, University of Chinese Academy of sciences, Beijing, PR China.
  • Guangtai Shen
    Xin'an League People's Hospital, Xing'an League, Inner Mongolia, PR China.
  • Jingjing Sun
    School of Public Administration, Guangzhou University, Guangzhou, China.
  • Maofeng Wang
    Department of Biomedical Sciences Laboratory, Affiliated Dongyang Hospital of Wenzhou Medical University, Dongyang, Zhejiang, PR China.
  • Yun Wang
    Department of Anesthesiology, Beijing Friendship Hospital, Capital Medical University, Beijing, 100050, People's Republic of China.
  • Zheyu Yang
    Center for Brain-Inspired Computing Research (CBICR), Beijing Advanced Innovation Center for Integrated Circuits, Optical Memory National Engineering Research Center, & Department of Precision Instrument, Tsinghua University, 100084, Beijing, China.
  • Jianfeng Sang
    Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, PR China.
  • Xiaolei Wang
    Ministry of Education Key Laboratory of Pollution Processes and Environmental Criteria, Tianjin Key Laboratory of Environmental Remediation and Pollution Control, College of Environmental Science and Engineering of Nankai University, Tianjin 300350, China.
  • Fenyong Sun
    Department of Clinical Laboratory Medicine, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, PR China. sunfenyong@263.net.
  • Wei Cai
    Department of Gastrointestinal Surgery, The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Xingcai Zhang
    John A. Paulson School of Engineering and Applied Science, Harvard University, Cambridge, MA 02138, USA.
  • Hui Lu
    Key Laboratory of the plateau of environmental damage control, Lanzhou General Hospital of Lanzhou Military Command, Lanzhou, China.