Integrating machine learning and bioinformatics approaches to identify novel diagnostic gene biomarkers for diabetic mice.
Journal:
Scientific reports
Published Date:
Jul 2, 2025
Abstract
Diabetes is a complex metabolic disorder, and its pathogenesis involves the interplay of genetic, environmental factors, and lifestyle choices. With the rising prevalence and increasing associated chronic complications, identifying and understanding the molecular mechanisms of diabetes has become an important direction in bioinformatics research. The aim of this study is the identification of diagnostic genes associated with streptozotocin (STZ)-induced diabetic mice. GSE179717 and GSE179718 gene expression datasets were downloaded from the Gene Expression Omnibus (GEO) database. Differentially expressed genes (DEGs) were identified by screening diabetic mice and controls. A total of 45 overlapping genes were recognized as potential DEGs across the two datasets. Next, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were performed on the overlapping genes. Protein-protein interaction (PPI) networks were constructed to identify hub genes in the DEGs. After the application of Least absolute shrinkage and selection operator (LASSO), Random Forest analysis, three genes (Scd1, Sirt1 and Hmgb1) were identified as diagnostic genes. Real-time quantitative polymerase chain reaction (RT-qPCR) was supplied to detect the expressions of diagnostic genes in aged diabetic mice. In conclusion, the results of this study suggest that focusing on these genes may provide new targets for the diagnosis and treatment of diabetes.