A framework for real-time traffic risk prediction incorporating cost-sensitive learning and dynamic thresholds.
Journal:
Accident; analysis and prevention
Published Date:
May 5, 2025
Abstract
In recent years, researchers have explored an innovative approach that leverages real vehicle trajectory data to simultaneously derive traffic state and risk level for real-time risk prediction, which is crucial for traffic safety. However, existing studies largely overlook the costs associated with incorrect predictions and the varying consequences of different misclassifications, which undermines the reliability of the obtained prediction results. To address these gaps, this study refined traffic risk classification into four levels (i.e., no, low, medium, and high risks) and incorporated misclassification costs into the prediction process through cost-sensitive learning (CSL). Furthermore, considering that multi-class prediction tasks often face performance degradation and increased risk level granularity worsens class imbalance, further amplifying this degradation, this study introduced dynamic thresholds (DTs) to improve model performance. The aforementioned cost coefficients and thresholds were pinpointed using a genetic algorithm (GA). Furthermore, the employed data, comprising variables related to traffic state and associated risk data, were sourced from the HighD dataset. Subsequently, CSL-DTs-based models were built by integrating CSL and DTs with four distinct baseline machine/deep learning models, and the prediction performance (e.g., precision) and computation time of these models were compared. Results show that, compared to the corresponding baseline models, the proposed models perform better for multi-class prediction tasks. Additionally, the computation time of the CSL-DTs-based models is found to be acceptable for real-time prediction purposes. Finally, to ensure the reliability of the results obtained through the GA optimization (e.g., avoiding local optima), convergence curves were plotted, confirming the robustness of the optimization process. A robustness analysis also demonstrates that the models are highly stable under slight perturbations of cost coefficients and thresholds, with minimal impact on performance. Findings of this study are expected to enhance the reliability of real-time traffic risk prediction, holding the promise of significantly promoting proactive traffic safety management.