Transfer learning prediction of type 2 diabetes with unpaired clinical and genetic data.

Journal: Scientific reports

Published Date: Jul 29, 2025

Abstract

The prevalence of type 2 diabetes mellitus (T2DM) in Korea has risen in recent years, yet many cases remain undiagnosed. Advanced artificial intelligence models using multi-modal data have shown promise in disease prediction, but two major challenges persist: the scarcity of samples containing all desired data modalities and class imbalance in T2DM datasets. We propose a novel transfer learning framework to predict T2DM onset within five years, using two Korean cohorts (KoGES and SNUH). To utilize unpaired multi-modal data, our approach transfers knowledge between clinical and genetic domains, leveraging unpaired clinical data alongside paired data. We also address class imbalance by applying a positively weighted binary cross-entropy (BCE) loss and a weighted random sampler (WRS). The transfer learning framework improved T2DM prediction performance. Using WRS and weighted BCE loss increased the model's balanced accuracy and AUC (achieving test AUC 0.8441). Furthermore, combining transfer learning with intermediate data fusion yielded even higher performance (test AUC 0.8715). These enhancements were achieved despite limited paired multi-modal samples. Our framework effectively handles scarce paired data and class imbalance, leading to improved T2DM risk prediction. This approach can be adapted to other medical prediction tasks and integrated with additional data modalities, potentially aiding earlier diagnosis and better disease management in clinical settings.

Authors

YounSung Jung

Department of Life Science, Handong Global University, Pohang, Republic of Korea.
SeanKyo Han

Department of Life Science, Handong Global University, Pohang, Republic of Korea.
Eunhee Kang
Soyoung Park

Department of Pediatric Dentistry, School of Dentistry, Pusan National University, 50612 Yangsan, Republic of Korea.
Minhee Kim

Biomedical Research Center, Korea University Ansan Hospital, Ansan-si, Gyeonggi-do, Republic of Korea.
NanHee Kim

Division of Endocrinology and Metabolism, Department of Internal Medicine, Korea University Ansan Hospital, Ansan, Republic of Korea. nhkendo@gmail.com.
TaeJin Ahn

Department of Life Sciences, Handong Global University, Pohang 37554, Korea.

Keywords

Artificial Intelligence Diabetes Mellitus, Type 2 Female Humans Machine Learning Male Middle Aged Republic of Korea

External Resources

View on PubMed Access via DOI PubMed (40730802)

Transfer learning prediction of type 2 diabetes with unpaired clinical and genetic data.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Transfer learning prediction of type 2 diabetes with unpaired clinical and genetic data.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals