XGBoost-based risk prediction model for massive vehicle recalls using consumer complaints.

IF 3.3 3区 医学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
Risk Analysis Pub Date : 2025-05-29 DOI:10.1111/risa.70052
Yi-Na Li, Ming Jiang, Likun Wang, Jiuchang Wei
{"title":"XGBoost-based risk prediction model for massive vehicle recalls using consumer complaints.","authors":"Yi-Na Li, Ming Jiang, Likun Wang, Jiuchang Wei","doi":"10.1111/risa.70052","DOIUrl":null,"url":null,"abstract":"<p><p>This study employed the XGBoost model to conduct an in-depth analysis of consumer complaints and identified the key risk factors predicting vehicle recalls in the US market, providing valuable proactive risk management support for automakers and regulatory agencies. We leveraged the extensive data resources from National Highway Traffic Safety Administration to construct high-precision recall risk prediction models to predict the risk of recall. The models exhibited exceptional performance across different time windows, particularly maintaining a high level of area under the curve values over a prediction timespan of up to 18 months, demonstrating their predictive accuracy and stability. Our study contributes to risk management theory by addressing the challenges of integrating consumer complaints into predictive models for vehicle recall risk. While prior research has primarily focused on text mining of complaint content, our work systematically incorporates structured complaint data and recall records to enhance predictive accuracy. Also, our research distinguishes the indicators for the initial recall after launch to the market and the indicators for subsequent recalls, and bridges a critical gap in recall risk prediction at different stages of a vehicle's life cycle.</p>","PeriodicalId":21472,"journal":{"name":"Risk Analysis","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Risk Analysis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/risa.70052","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

This study employed the XGBoost model to conduct an in-depth analysis of consumer complaints and identified the key risk factors predicting vehicle recalls in the US market, providing valuable proactive risk management support for automakers and regulatory agencies. We leveraged the extensive data resources from National Highway Traffic Safety Administration to construct high-precision recall risk prediction models to predict the risk of recall. The models exhibited exceptional performance across different time windows, particularly maintaining a high level of area under the curve values over a prediction timespan of up to 18 months, demonstrating their predictive accuracy and stability. Our study contributes to risk management theory by addressing the challenges of integrating consumer complaints into predictive models for vehicle recall risk. While prior research has primarily focused on text mining of complaint content, our work systematically incorporates structured complaint data and recall records to enhance predictive accuracy. Also, our research distinguishes the indicators for the initial recall after launch to the market and the indicators for subsequent recalls, and bridges a critical gap in recall risk prediction at different stages of a vehicle's life cycle.

基于xgboost的大规模汽车召回消费者投诉风险预测模型
本研究采用XGBoost模型对消费者投诉进行了深入分析,并确定了预测美国市场汽车召回的关键风险因素,为汽车制造商和监管机构提供了有价值的前瞻性风险管理支持。我们利用美国国家公路交通安全管理局广泛的数据资源,构建了高精度的召回风险预测模型来预测召回风险。该模型在不同的时间窗口中表现出优异的性能,特别是在长达18个月的预测时间跨度内,曲线值下的面积保持较高水平,证明了其预测的准确性和稳定性。我们的研究通过解决将消费者投诉整合到汽车召回风险预测模型中的挑战,为风险管理理论做出了贡献。虽然之前的研究主要集中在投诉内容的文本挖掘上,但我们的工作系统地结合了结构化的投诉数据和召回记录,以提高预测的准确性。此外,我们的研究区分了上市后首次召回的指标和后续召回的指标,填补了汽车生命周期不同阶段召回风险预测的关键空白。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Risk Analysis
Risk Analysis 数学-数学跨学科应用
CiteScore
7.50
自引率
10.50%
发文量
183
审稿时长
4.2 months
期刊介绍: Published on behalf of the Society for Risk Analysis, Risk Analysis is ranked among the top 10 journals in the ISI Journal Citation Reports under the social sciences, mathematical methods category, and provides a focal point for new developments in the field of risk analysis. This international peer-reviewed journal is committed to publishing critical empirical research and commentaries dealing with risk issues. The topics covered include: • Human health and safety risks • Microbial risks • Engineering • Mathematical modeling • Risk characterization • Risk communication • Risk management and decision-making • Risk perception, acceptability, and ethics • Laws and regulatory policy • Ecological risks.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信