主动悬架系统的机制-数据驱动控制策略:将深度强化学习与微分几何相结合以提高车辆的乘坐舒适性

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Cheng Wang, Guanyu Tao, Xiaoxian Cui, Quan Yao, Xinran Zhou, Konghui Guo
{"title":"主动悬架系统的机制-数据驱动控制策略:将深度强化学习与微分几何相结合以提高车辆的乘坐舒适性","authors":"Cheng Wang,&nbsp;Guanyu Tao,&nbsp;Xiaoxian Cui,&nbsp;Quan Yao,&nbsp;Xinran Zhou,&nbsp;Konghui Guo","doi":"10.1016/j.aei.2025.103326","DOIUrl":null,"url":null,"abstract":"<div><div>Long-standing research on vehicle comfort optimization has centered on active and semi-active suspension control using experience-based or optimization-based algorithms. However, these methods often require substantial engineering resources and pose challenges in acquiring theoretical knowledge. The emergence of advanced Artificial Intelligence (AI), particularly data-driven approaches, has transformed how engineers tackle knowledge-intensive tasks like suspension control. Yet, the interpretability challenges of data-driven methods limit their widespread use in engineering. This study proposes a mechanism-data-driven active suspension control strategy that integrates Differential Geometry (DG) and Deep Reinforcement Learning (DRL) to achieve theoretical fusion of mechanism and data models. A DRL control architecture (DGRL) based on DG theory is introduced, enabling mechanism-level analysis of suspension control and dividing the control strategy into mechanism and data models. For the data model, a DRL optimal control framework is constructed, incorporating the Twin-Delayed Deep Deterministic policy (TD3) with an expert-guided soft-hard module (TD3-SH) and the Deterministic Experience Tracing (DET) mechanism. This effectively explores and utilizes the knowledge in massive data. Simulation results show that the DGRL strategy outperforms baseline algorithms such as Deep Deterministic Policy Gradient (DDPG), TD3, Linear Quadratic Regulator (LQR), Model Predictive Control (MPC), and TD3-SH by 75.8%, 65.5%, 77.5%, 56.3%, and 46.5%, respectively. In complex environments with varying road features and considering the domain randomization of the suspension system, the DGRL strategy can improve ride comfort by up to 85%, demonstrating its robustness and significant potential for widespread application in industrial and real-world scenarios.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103326"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Mechanism-data-driven control strategy for active suspension systems: Integrating deep reinforcement learning with differential geometry to enhance vehicle ride comfort\",\"authors\":\"Cheng Wang,&nbsp;Guanyu Tao,&nbsp;Xiaoxian Cui,&nbsp;Quan Yao,&nbsp;Xinran Zhou,&nbsp;Konghui Guo\",\"doi\":\"10.1016/j.aei.2025.103326\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Long-standing research on vehicle comfort optimization has centered on active and semi-active suspension control using experience-based or optimization-based algorithms. However, these methods often require substantial engineering resources and pose challenges in acquiring theoretical knowledge. The emergence of advanced Artificial Intelligence (AI), particularly data-driven approaches, has transformed how engineers tackle knowledge-intensive tasks like suspension control. Yet, the interpretability challenges of data-driven methods limit their widespread use in engineering. This study proposes a mechanism-data-driven active suspension control strategy that integrates Differential Geometry (DG) and Deep Reinforcement Learning (DRL) to achieve theoretical fusion of mechanism and data models. A DRL control architecture (DGRL) based on DG theory is introduced, enabling mechanism-level analysis of suspension control and dividing the control strategy into mechanism and data models. For the data model, a DRL optimal control framework is constructed, incorporating the Twin-Delayed Deep Deterministic policy (TD3) with an expert-guided soft-hard module (TD3-SH) and the Deterministic Experience Tracing (DET) mechanism. This effectively explores and utilizes the knowledge in massive data. Simulation results show that the DGRL strategy outperforms baseline algorithms such as Deep Deterministic Policy Gradient (DDPG), TD3, Linear Quadratic Regulator (LQR), Model Predictive Control (MPC), and TD3-SH by 75.8%, 65.5%, 77.5%, 56.3%, and 46.5%, respectively. In complex environments with varying road features and considering the domain randomization of the suspension system, the DGRL strategy can improve ride comfort by up to 85%, demonstrating its robustness and significant potential for widespread application in industrial and real-world scenarios.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"65 \",\"pages\":\"Article 103326\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034625002198\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002198","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

长期以来对车辆舒适性优化的研究主要集中在基于经验或基于优化算法的主动和半主动悬架控制上。然而,这些方法往往需要大量的工程资源,并且在获取理论知识方面存在挑战。先进人工智能(AI)的出现,特别是数据驱动的方法,已经改变了工程师处理悬挂控制等知识密集型任务的方式。然而,数据驱动方法的可解释性挑战限制了它们在工程中的广泛应用。本研究提出了一种机制-数据驱动的主动悬架控制策略,该策略将微分几何(DG)和深度强化学习(DRL)相结合,实现了机制模型和数据模型的理论融合。提出了一种基于DG理论的DRL控制体系结构(DGRL),实现了悬架控制的机构级分析,并将控制策略划分为机制模型和数据模型。针对数据模型,构建了包含专家引导软硬模块(TD3- sh)的双延迟深度确定性策略(TD3)和确定性经验跟踪(DET)机制的DRL最优控制框架。这有效地挖掘和利用了海量数据中的知识。仿真结果表明,DGRL策略分别优于深度确定性策略梯度(DDPG)、TD3、线性二次型调节器(LQR)、模型预测控制(MPC)和TD3- sh等基准算法75.8%、65.5%、77.5%、56.3%和46.5%。在具有不同道路特征的复杂环境中,考虑到悬挂系统的域随机化,DGRL策略可以将乘坐舒适性提高高达85%,显示出其稳健性和在工业和现实场景中广泛应用的巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Mechanism-data-driven control strategy for active suspension systems: Integrating deep reinforcement learning with differential geometry to enhance vehicle ride comfort
Long-standing research on vehicle comfort optimization has centered on active and semi-active suspension control using experience-based or optimization-based algorithms. However, these methods often require substantial engineering resources and pose challenges in acquiring theoretical knowledge. The emergence of advanced Artificial Intelligence (AI), particularly data-driven approaches, has transformed how engineers tackle knowledge-intensive tasks like suspension control. Yet, the interpretability challenges of data-driven methods limit their widespread use in engineering. This study proposes a mechanism-data-driven active suspension control strategy that integrates Differential Geometry (DG) and Deep Reinforcement Learning (DRL) to achieve theoretical fusion of mechanism and data models. A DRL control architecture (DGRL) based on DG theory is introduced, enabling mechanism-level analysis of suspension control and dividing the control strategy into mechanism and data models. For the data model, a DRL optimal control framework is constructed, incorporating the Twin-Delayed Deep Deterministic policy (TD3) with an expert-guided soft-hard module (TD3-SH) and the Deterministic Experience Tracing (DET) mechanism. This effectively explores and utilizes the knowledge in massive data. Simulation results show that the DGRL strategy outperforms baseline algorithms such as Deep Deterministic Policy Gradient (DDPG), TD3, Linear Quadratic Regulator (LQR), Model Predictive Control (MPC), and TD3-SH by 75.8%, 65.5%, 77.5%, 56.3%, and 46.5%, respectively. In complex environments with varying road features and considering the domain randomization of the suspension system, the DGRL strategy can improve ride comfort by up to 85%, demonstrating its robustness and significant potential for widespread application in industrial and real-world scenarios.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信