Jiming Xie , Yaqin Qin , Yan Zhang , Jianhua Li , Tianshun Chen , Xiaohua Zhao , Yulan Xia
{"title":"Could vehicles analyze driving risks using human fuzzy semantic logic? A data-knowledge-driven new perspective","authors":"Jiming Xie , Yaqin Qin , Yan Zhang , Jianhua Li , Tianshun Chen , Xiaohua Zhao , Yulan Xia","doi":"10.1016/j.aap.2025.108037","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate risk identification is crucial for ensuring the safe operation of Host vehicles (HoVs) in environments shared with Neighboring vehicles (NeVs). Traditional risk identification mechanisms typically rely on large amounts of precise numerical data, making it difficult to comprehensively and accurately identify traffic crash risks under conditions of imperfect data associated with fuzzy information. However, human drivers rely on knowledge-driven, subjective assessments using fuzzy descriptors like distance and speed semantics to evaluate driving risk. These insights provide significant value for addressing the limitations of precise data-driven methods. This study proposes a novel traffic crash risk analysis framework called Token Tree Generation and Parsing (TTGP). It integrates knowledge-driven insights from human drivers with data-driven methods. TTGP includes the Token Tree Generation Module (Module 1) and the Token Tree Parsing Module (Module 2). In Module 1, we apply the token-tree-of-thoughts method to transform natural language traffic regulations and vehicles’ traffic behaviors and attribute parameters into token tree based on semantic rules. This module simulates the generation of human fuzzy semantics in traffic scenarios. In Module 2, we integrate three encoders and decoders to extract traffic crash risk semantic features and identify traffic crash risk level from the digitized token tree. Experiments in the highway and urban expressway interweaving areas demonstrate that TTGP can accurately analyze risk using imprecise data. The TTGP performs better than traditional methods such as Tree, Naïve Bayes, RUSBoost and Efficient Logistic Regression models. This study significantly enhances the flexibility, generalization, and reliability of risk assessment. It bridges the gap in how HoVs handle fuzzy information in risk analysis.</div></div>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"218 ","pages":"Article 108037"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S000145752500123X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate risk identification is crucial for ensuring the safe operation of Host vehicles (HoVs) in environments shared with Neighboring vehicles (NeVs). Traditional risk identification mechanisms typically rely on large amounts of precise numerical data, making it difficult to comprehensively and accurately identify traffic crash risks under conditions of imperfect data associated with fuzzy information. However, human drivers rely on knowledge-driven, subjective assessments using fuzzy descriptors like distance and speed semantics to evaluate driving risk. These insights provide significant value for addressing the limitations of precise data-driven methods. This study proposes a novel traffic crash risk analysis framework called Token Tree Generation and Parsing (TTGP). It integrates knowledge-driven insights from human drivers with data-driven methods. TTGP includes the Token Tree Generation Module (Module 1) and the Token Tree Parsing Module (Module 2). In Module 1, we apply the token-tree-of-thoughts method to transform natural language traffic regulations and vehicles’ traffic behaviors and attribute parameters into token tree based on semantic rules. This module simulates the generation of human fuzzy semantics in traffic scenarios. In Module 2, we integrate three encoders and decoders to extract traffic crash risk semantic features and identify traffic crash risk level from the digitized token tree. Experiments in the highway and urban expressway interweaving areas demonstrate that TTGP can accurately analyze risk using imprecise data. The TTGP performs better than traditional methods such as Tree, Naïve Bayes, RUSBoost and Efficient Logistic Regression models. This study significantly enhances the flexibility, generalization, and reliability of risk assessment. It bridges the gap in how HoVs handle fuzzy information in risk analysis.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.