Enabling CMF estimation in data-constrained scenarios: A semantic-encoding knowledge mining model.

Journal: Accident; analysis and prevention
Published Date:

Abstract

Availability of more accurate Crash Modification Factors (CMFs) is crucial for evaluating the effectiveness of various road safety treatments and prioritizing infrastructure investment accordingly. While customized study for each countermeasure scenario is desired, the conventional CMF estimation approaches rely heavily on the availability of crash data at specific sites. This dependency may hinder the development of CMFs when it is impractical to collect data for recent implementations. Additionally, the transferability of CMF knowledge faces challenges, as the intrinsic similarities between different safety countermeasure scenarios are not fully explored. Aiming to fill these gaps, this study introduces a novel knowledge-mining framework for CMF prediction. This framework delves into the connections of existing countermeasure scenarios and reduces the reliance of CMF estimation on crash data availability and manual data collection. Specifically, it draws inspiration from human comprehension processes and introduces advanced Natural Language Processing (NLP) techniques to extract intricate variations and patterns from existing CMF knowledge. It effectively encodes unstructured countermeasure scenarios into machine-readable representations and models the complex relationships between scenarios and CMF values. This new data-driven framework provides a cost-effective and adaptable solution that complements the case-specific approaches for CMF estimation, which is particularly beneficial when availability of crash data imposes constraints. Experimental validation using real-world CMF Clearinghouse data demonstrates the effectiveness of this new approach, which shows significant accuracy improvements compared to the baseline methods. This approach provides insights into new possibilities of harnessing accumulated transportation knowledge in various applications.

Authors

  • Yanlin Qi
    Institute of Transportation Studies, University of California, Davis, CA 95616, USA.
  • Jia Li
    Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan Tsuihang New District, Guangdong, 528400, PR China; School of Pharmacy, Zunyi Medical University, Zunyi, 563000, PR China; National Center for Drug Screening, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, PR China.
  • Michael Zhang
    Thayer School of Engineering at Dartmouth College Hanover NH USA john.zhang@dartmouth.edu.