Developing a prognostic model using machine learning for disulfidptosis related lncRNA in lung adenocarcinoma.

Journal: Scientific reports
PMID:

Abstract

Disulfidptosis represents a novel cell death mechanism triggered by disulfide stress, with potential implications for advancements in cancer treatments. Although emerging evidence highlights the critical regulatory roles of long non-coding RNAs (lncRNAs) in the pathobiology of lung adenocarcinoma (LUAD), research into lncRNAs specifically associated with disulfidptosis in LUAD, termed disulfidptosis-related lncRNAs (DRLs), remains insufficiently explored. Using The Cancer Genome Atlas (TCGA)-LUAD dataset, we implemented ten machine learning techniques, resulting in 101 distinct model configurations. To assess the predictive accuracy of our model, we employed both the concordance index (C-index) and receiver operating characteristic (ROC) curve analyses. For a deeper understanding of the underlying biological pathways, we referred to the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) for functional enrichment analysis. Moreover, we explored differences in the tumor microenvironment between high-risk and low-risk patient cohorts. Additionally, we thoroughly assessed the prognostic value of the DRLs signatures in predicting treatment outcomes. The Kaplan-Meier (KM) survival analysis demonstrated a significant difference in overall survival (OS) between the high-risk and low-risk cohorts (p < 0.001). The prognostic model showed robust performance, with an area under the ROC curve exceeding 0.75 at one year and maintaining a value above 0.72 in the two and three-year follow-ups. Further research identified variations in tumor mutational burden (TMB) and differential responses to immunotherapies and chemotherapies. Our validation, using three GEO datasets (GSE31210, GSE30219, and GSE50081), revealed that the C-index exceeded 0.67 for GSE31210 and GSE30219. Significant differences in disease-free survival (DFS) and OS were observed across all validation cohorts among different risk groups. The prognostic model offers potential as a molecular biomarker for LUAD prognosis.

Authors

  • Yang Pan
    Department of Biochemistry and Molecular Medicine, George Washington University, Washington, DC 20037, USA, Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Center for Bioinformatics and Information Technology, National Cancer Institute, 9609 Medical Center Drive, Rockville, MD 20892-9760, USA, NASA Jet Propulsion Laboratory, Pasadena, CA, USA, Division of Cancer Prevention, National Cancer Institute, 9609 Medical Center Drive, Rockville, MD 20892-9760, USA, Wellcome Trust Sanger Institute, Cambridge, UK and McCormick Genomic and Proteomic Center, George Washington University, Washington, DC 20037, USA.
  • Xuanhong Jin
    Department of Medical Oncology, Sir Run Run Shaw Hospital, College of Medicine, Zhejiang University, Hangzhou, China.
  • Haoting Xu
    Department of Pulmonary Surgery, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, China.
  • Jiandong Hong
    Department of Pulmonary Surgery, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, China.
  • Feng Li
    Department of General Surgery, Shanghai Traditional Chinese Medicine (TCM)-INTEGRATED Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai, China.
  • Taobo Luo
    Department of Pulmonary Surgery, Zhejiang Cancer Hospital, Hangzhou Institute of Medicine (HIM), Chinese Academy of Sciences, Hangzhou, China. luotaobo@163.com.
  • Jian Zeng
    Longgang District Central Hospital of Shenzhen Shenzhen China.