Galactose-Induced Cataracts in Rats: A Machine Learning Analysis.
Journal:
International journal of medical sciences
PMID:
40027191
Abstract
Rat models are widely used to study cataracts due to their cost-effectiveness and prominent physiological and genetic similarities to humans The objective of this study was to identify genes involved in cataractogenesis due to galactose exposure in rats. We analyzed four datasets from the Gene Expression Omnibus, including both and models of cataracts in different rat strains. Feature selection tools were used to identify genes potentially relevant in cataract-related gene expression. A decision tree algorithm was implemented, and its predictions were interpreted using SHAP and LIME. To validate gene expression levels, PCR was conducted on six rat lenses cultured in M199 medium and galactose to induce cataract and six lenses cultured in M199 alone. Using feature selection tools, four key genes-PLAGL2, CMTM7, PCYT1B, and NR1D2-were identified. Only PCYT1B was significantly differentially expressed between the cataract and control groups across analyzed datasets. The model showed strong predictive performance, particularly in datasets. SHAP and LIME analyses revealed that CMTM7 had the largest impact on model predictions. PCR results did not show significant differences in gene expression between the cataract and control groups. The decision tree model trained on an dataset could predict and cataracts despite no significant gene expression differences found between the cataract and control groups. Given a small number of samples, larger studies are needed to validate our findings.