High Fuel Consumption Driving Behavior Causal Analysis Based on LightGBM and SHAP

IF 1.7 4区 工程技术 Q3 ENGINEERING, CIVIL
Hongru Liu, Shuyan Chen, Yongfeng Ma, Fengxiang Qiao, Qianqian Pang, Ziyu Zhang, Zhuopeng Xie
{"title":"High Fuel Consumption Driving Behavior Causal Analysis Based on LightGBM and SHAP","authors":"Hongru Liu, Shuyan Chen, Yongfeng Ma, Fengxiang Qiao, Qianqian Pang, Ziyu Zhang, Zhuopeng Xie","doi":"10.1007/s40996-024-01541-3","DOIUrl":null,"url":null,"abstract":"<p>Accurate identification of high fuel consumption driving behaviors provides good theoretical support for eco-driving training. To gain a deeper understanding of contributing factors and their impacts on fuel consumption, this study acquired a driving data set based on a driving simulator test and employed the light gradient-boosting machine (LightGBM) algorithm to identify driving behaviors related to high fuel consumption and the SHapley Additive exPlanations (SHAP) algorithm for causal analysis. The LightGBM algorithm learns the intrinsic connection between the input variable X and the output variable Y and examines the learning effect. The SHAP algorithm analyzes how the output variable changes with the input variable from different perspectives. First, the vehicle kinematics and fuel consumption data were collected and preprocessed. Secondly, the LightGBM algorithm was employed to classify fuel consumption levels, including low, medium, and high. Thirdly, several evaluation metrics, precision, recall, and F1-score, were used to evaluate the identification results comprehensively, whereas SVM and XGBoost algorithms were employed for comparison. The results show that the LightGBM algorithm significantly outperforms SVM and XGBoost algorithms in precision, recall, and F1-score, respectively. The results show that the LightGBM algorithm performs well in terms of precision, recall, and F1-score. Finally, the SHAP algorithm was used to interpret the influence of contributing factors on high fuel consumption from three perspectives, global interpretation, interaction interpretation, and individual interpretation. The SHAP algorithm can intuitively display the relationship between high fuel consumption and its contributing factors. Specifically, acceleration, speed, roll speed, pitch speed, and engine speed significantly increased the probability of high fuel consumption. This study proposed an efficient combined method for high fuel consumption identification and interpretation, which can reduce the occurrence of high fuel consumption driving behavior, thus achieving the purpose of eco-driving training.</p>","PeriodicalId":14550,"journal":{"name":"Iranian Journal of Science and Technology, Transactions of Civil Engineering","volume":null,"pages":null},"PeriodicalIF":1.7000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Iranian Journal of Science and Technology, Transactions of Civil Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s40996-024-01541-3","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate identification of high fuel consumption driving behaviors provides good theoretical support for eco-driving training. To gain a deeper understanding of contributing factors and their impacts on fuel consumption, this study acquired a driving data set based on a driving simulator test and employed the light gradient-boosting machine (LightGBM) algorithm to identify driving behaviors related to high fuel consumption and the SHapley Additive exPlanations (SHAP) algorithm for causal analysis. The LightGBM algorithm learns the intrinsic connection between the input variable X and the output variable Y and examines the learning effect. The SHAP algorithm analyzes how the output variable changes with the input variable from different perspectives. First, the vehicle kinematics and fuel consumption data were collected and preprocessed. Secondly, the LightGBM algorithm was employed to classify fuel consumption levels, including low, medium, and high. Thirdly, several evaluation metrics, precision, recall, and F1-score, were used to evaluate the identification results comprehensively, whereas SVM and XGBoost algorithms were employed for comparison. The results show that the LightGBM algorithm significantly outperforms SVM and XGBoost algorithms in precision, recall, and F1-score, respectively. The results show that the LightGBM algorithm performs well in terms of precision, recall, and F1-score. Finally, the SHAP algorithm was used to interpret the influence of contributing factors on high fuel consumption from three perspectives, global interpretation, interaction interpretation, and individual interpretation. The SHAP algorithm can intuitively display the relationship between high fuel consumption and its contributing factors. Specifically, acceleration, speed, roll speed, pitch speed, and engine speed significantly increased the probability of high fuel consumption. This study proposed an efficient combined method for high fuel consumption identification and interpretation, which can reduce the occurrence of high fuel consumption driving behavior, thus achieving the purpose of eco-driving training.

Abstract Image

基于 LightGBM 和 SHAP 的高油耗驾驶行为原因分析
准确识别高油耗驾驶行为为生态驾驶培训提供了良好的理论支持。为深入了解高油耗驾驶行为的诱因及其对油耗的影响,本研究基于驾驶模拟器测试获取了驾驶数据集,并采用轻梯度提升机(LightGBM)算法识别与高油耗相关的驾驶行为,同时采用SHAPLE Additive exPlanations(SHAP)算法进行因果分析。LightGBM 算法学习输入变量 X 和输出变量 Y 之间的内在联系,并检查学习效果。SHAP 算法从不同角度分析输出变量如何随输入变量变化。首先,收集并预处理车辆运动学和燃料消耗数据。其次,采用 LightGBM 算法对油耗水平进行分类,包括低、中和高。第三,采用精确度、召回率和 F1 分数等几个评价指标对识别结果进行综合评价,同时采用 SVM 和 XGBoost 算法进行比较。结果表明,LightGBM 算法的精确度、召回率和 F1 分数分别明显优于 SVM 算法和 XGBoost 算法。结果表明,LightGBM 算法在精确度、召回率和 F1 分数方面表现良好。最后,使用 SHAP 算法从全局解释、交互解释和个体解释三个角度解释了高油耗诱因的影响。SHAP 算法可以直观地显示高油耗与其促成因素之间的关系。具体来说,加速度、速度、滚转速度、俯仰速度和发动机转速会明显增加高油耗的概率。本研究提出了一种高效的高油耗识别和解释组合方法,可减少高油耗驾驶行为的发生,从而达到生态驾驶培训的目的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.30
自引率
11.80%
发文量
203
期刊介绍: The aim of the Iranian Journal of Science and Technology is to foster the growth of scientific research among Iranian engineers and scientists and to provide a medium by means of which the fruits of these researches may be brought to the attention of the world’s civil Engineering communities. This transaction focuses on all aspects of Civil Engineering and will accept the original research contributions (previously unpublished) from all areas of established engineering disciplines. The papers may be theoretical, experimental or both. The journal publishes original papers within the broad field of civil engineering which include, but are not limited to, the following: -Structural engineering- Earthquake engineering- Concrete engineering- Construction management- Steel structures- Engineering mechanics- Water resources engineering- Hydraulic engineering- Hydraulic structures- Environmental engineering- Soil mechanics- Foundation engineering- Geotechnical engineering- Transportation engineering- Surveying and geomatics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信