Variable admittance control via Reinforcement Learning: Enhancing UAV interactions across diverse platforms

IF 5.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yuting Feng, Tao Yang, Kaidi Wang, Jiali Sun, Yushu Yu
{"title":"Variable admittance control via Reinforcement Learning: Enhancing UAV interactions across diverse platforms","authors":"Yuting Feng,&nbsp;Tao Yang,&nbsp;Kaidi Wang,&nbsp;Jiali Sun,&nbsp;Yushu Yu","doi":"10.1016/j.neucom.2025.130667","DOIUrl":null,"url":null,"abstract":"<div><div>A compliant control model based on Reinforcement Learning (RL) is proposed to allow UAVs (Unmanned Aerial Vehicles) to interact with the environment more effectively and autonomously execute force control tasks. The model learns an optimal admittance adjustment policy for interaction and simultaneously optimizes energy consumption and trajectory tracking of the UAV state. This facilitates stable manipulation of UAVs in unknown environments with interaction forces. Furthermore, the model ensures safe, compliant, and flexible interaction while safeguarding the UAV’s external structures from damage. To assess the model performance, we validated the approach in a simulation environment using a UAV. The model was also tested across different UAV types and various low-level control parameters, demonstrating superior performance in all scenarios. Additionally, we applied this methodology to two distinct UAV types used in real-world applications. Empirical evidence shows that our proposed methods consistently achieve superior results. We also applied similar methodologies to verify 6D interaction in a simulation of a fully actuated platform consisting of three UAVs. Using a high-level training strategy, we evaluated the platform’s ability to slide along a bevel and achieve optimal results in our comparative experiments.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"648 ","pages":"Article 130667"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225013396","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

A compliant control model based on Reinforcement Learning (RL) is proposed to allow UAVs (Unmanned Aerial Vehicles) to interact with the environment more effectively and autonomously execute force control tasks. The model learns an optimal admittance adjustment policy for interaction and simultaneously optimizes energy consumption and trajectory tracking of the UAV state. This facilitates stable manipulation of UAVs in unknown environments with interaction forces. Furthermore, the model ensures safe, compliant, and flexible interaction while safeguarding the UAV’s external structures from damage. To assess the model performance, we validated the approach in a simulation environment using a UAV. The model was also tested across different UAV types and various low-level control parameters, demonstrating superior performance in all scenarios. Additionally, we applied this methodology to two distinct UAV types used in real-world applications. Empirical evidence shows that our proposed methods consistently achieve superior results. We also applied similar methodologies to verify 6D interaction in a simulation of a fully actuated platform consisting of three UAVs. Using a high-level training strategy, we evaluated the platform’s ability to slide along a bevel and achieve optimal results in our comparative experiments.
通过强化学习的可变导纳控制:增强无人机在不同平台上的交互
提出了一种基于强化学习(RL)的柔性控制模型,使无人机能够更有效地与环境交互并自主执行力控制任务。该模型学习最优导纳调整策略进行交互,同时优化无人机状态的能耗和轨迹跟踪。这有利于无人机在未知环境中与相互作用力的稳定操纵。此外,该模型确保了安全、兼容和灵活的交互,同时保护无人机的外部结构免受损坏。为了评估模型的性能,我们在使用无人机的仿真环境中验证了该方法。该模型还在不同的无人机类型和各种低级控制参数上进行了测试,在所有场景中展示了优越的性能。此外,我们将此方法应用于实际应用中使用的两种不同类型的无人机。经验证据表明,我们提出的方法一贯取得优异的结果。我们还应用了类似的方法来验证由三架无人机组成的完全驱动平台的模拟中的6D交互。使用高水平的训练策略,我们评估了平台沿斜面滑动的能力,并在我们的比较实验中获得了最佳结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信