Multilevel predictors categorization for post-CABG atrial fibrillation prediction

Journal: medRxiv
Published Date:

Abstract

Postoperative atrial fibrillation (PoAF) is known as common coronary artery bypass grafting (CABG) complication. Despite its association with increased risk of ischemic stroke, bleeding, acute renal failure and mortality there is still no ideal predictive tool with proper clinical interpretability. A retrospective single-center cohort study enrolled 1305 electronic medical records of patients with elective isolated CABG. PoAF was identified in 280 (21.5%) patients. Prognostic models with continuous variables were developed utilizing multivariate logistic regression (MLR), random forest and eXtreme gradient boosting methods. Predictors were dichotomized via grid search for optimal cut-off points, centroid calculation, and Shapley additive explanation (SHAP). For multilevel categorization, we proposed to use threshold values combination identified during dichotomization, as well as ranking cut-off thresholds by MLR weighting coefficients (multimetric categorization method). Based on multistage selection, nine PoAF predictors were identified and validated. After categorization, prognostic models with continuous and multilevel categorical variables were developed. Multilevel categorical models advantage lies in their ability to explain PoAF prediction results and provide clinical interpretation, with comparable quality (AUC: 0.802 and 0.795). Multilevel predictor categorization shows promise for PoAF predictions explanation, with the developed models demonstrating high accuracy and transparency in their conclusions. Validation of new 1st diagnosed atrial fibrillation predictors were performed in patients with coronary heart disease after coronary artery bypass grafting with subsequent development of predictive models utilizing machine learning methods. A new multilevel categorization method was tested, allowing to identify threshold values of predictors with the greatest predictive value, which were classified as risk factors for postoperative atrial fibrillation. The best quality metrics (AUC - 0.802) were demonstrated by a stochastic gradient boosting prognostic model based on predictors identified by the multilevel categorization method.

Authors

  • Karina I. Shakhgeldyan; Vladislav Y. Rublev; Nikita S. Kuksin; Boris I. Geltser; Regina L. Pak