{"title":"High Fuel Consumption Driving Behavior Causal Analysis Based on LightGBM and SHAP","authors":"Hongru Liu, Shuyan Chen, Yongfeng Ma, Fengxiang Qiao, Qianqian Pang, Ziyu Zhang, Zhuopeng Xie","doi":"10.1007/s40996-024-01541-3","DOIUrl":null,"url":null,"abstract":"<p>Accurate identification of high fuel consumption driving behaviors provides good theoretical support for eco-driving training. To gain a deeper understanding of contributing factors and their impacts on fuel consumption, this study acquired a driving data set based on a driving simulator test and employed the light gradient-boosting machine (LightGBM) algorithm to identify driving behaviors related to high fuel consumption and the SHapley Additive exPlanations (SHAP) algorithm for causal analysis. The LightGBM algorithm learns the intrinsic connection between the input variable X and the output variable Y and examines the learning effect. The SHAP algorithm analyzes how the output variable changes with the input variable from different perspectives. First, the vehicle kinematics and fuel consumption data were collected and preprocessed. Secondly, the LightGBM algorithm was employed to classify fuel consumption levels, including low, medium, and high. Thirdly, several evaluation metrics, precision, recall, and F1-score, were used to evaluate the identification results comprehensively, whereas SVM and XGBoost algorithms were employed for comparison. The results show that the LightGBM algorithm significantly outperforms SVM and XGBoost algorithms in precision, recall, and F1-score, respectively. The results show that the LightGBM algorithm performs well in terms of precision, recall, and F1-score. Finally, the SHAP algorithm was used to interpret the influence of contributing factors on high fuel consumption from three perspectives, global interpretation, interaction interpretation, and individual interpretation. The SHAP algorithm can intuitively display the relationship between high fuel consumption and its contributing factors. Specifically, acceleration, speed, roll speed, pitch speed, and engine speed significantly increased the probability of high fuel consumption. This study proposed an efficient combined method for high fuel consumption identification and interpretation, which can reduce the occurrence of high fuel consumption driving behavior, thus achieving the purpose of eco-driving training.</p>","PeriodicalId":14550,"journal":{"name":"Iranian Journal of Science and Technology, Transactions of Civil Engineering","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Iranian Journal of Science and Technology, Transactions of Civil Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s40996-024-01541-3","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate identification of high fuel consumption driving behaviors provides good theoretical support for eco-driving training. To gain a deeper understanding of contributing factors and their impacts on fuel consumption, this study acquired a driving data set based on a driving simulator test and employed the light gradient-boosting machine (LightGBM) algorithm to identify driving behaviors related to high fuel consumption and the SHapley Additive exPlanations (SHAP) algorithm for causal analysis. The LightGBM algorithm learns the intrinsic connection between the input variable X and the output variable Y and examines the learning effect. The SHAP algorithm analyzes how the output variable changes with the input variable from different perspectives. First, the vehicle kinematics and fuel consumption data were collected and preprocessed. Secondly, the LightGBM algorithm was employed to classify fuel consumption levels, including low, medium, and high. Thirdly, several evaluation metrics, precision, recall, and F1-score, were used to evaluate the identification results comprehensively, whereas SVM and XGBoost algorithms were employed for comparison. The results show that the LightGBM algorithm significantly outperforms SVM and XGBoost algorithms in precision, recall, and F1-score, respectively. The results show that the LightGBM algorithm performs well in terms of precision, recall, and F1-score. Finally, the SHAP algorithm was used to interpret the influence of contributing factors on high fuel consumption from three perspectives, global interpretation, interaction interpretation, and individual interpretation. The SHAP algorithm can intuitively display the relationship between high fuel consumption and its contributing factors. Specifically, acceleration, speed, roll speed, pitch speed, and engine speed significantly increased the probability of high fuel consumption. This study proposed an efficient combined method for high fuel consumption identification and interpretation, which can reduce the occurrence of high fuel consumption driving behavior, thus achieving the purpose of eco-driving training.
期刊介绍:
The aim of the Iranian Journal of Science and Technology is to foster the growth of scientific research among Iranian engineers and scientists and to provide a medium by means of which the fruits of these researches may be brought to the attention of the world’s civil Engineering communities. This transaction focuses on all aspects of Civil Engineering
and will accept the original research contributions (previously unpublished) from all areas of established engineering disciplines. The papers may be theoretical, experimental or both. The journal publishes original papers within the broad field of civil engineering which include, but are not limited to, the following:
-Structural engineering-
Earthquake engineering-
Concrete engineering-
Construction management-
Steel structures-
Engineering mechanics-
Water resources engineering-
Hydraulic engineering-
Hydraulic structures-
Environmental engineering-
Soil mechanics-
Foundation engineering-
Geotechnical engineering-
Transportation engineering-
Surveying and geomatics.