Variable admittance control via Reinforcement Learning: Enhancing UAV interactions across diverse platforms

IF 5.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-06-18 DOI:10.1016/j.neucom.2025.130667

Yuting Feng, Tao Yang, Kaidi Wang, Jiali Sun, Yushu Yu

{"title":"Variable admittance control via Reinforcement Learning: Enhancing UAV interactions across diverse platforms","authors":"Yuting Feng, Tao Yang, Kaidi Wang, Jiali Sun, Yushu Yu","doi":"10.1016/j.neucom.2025.130667","DOIUrl":null,"url":null,"abstract":"<div><div>A compliant control model based on Reinforcement Learning (RL) is proposed to allow UAVs (Unmanned Aerial Vehicles) to interact with the environment more effectively and autonomously execute force control tasks. The model learns an optimal admittance adjustment policy for interaction and simultaneously optimizes energy consumption and trajectory tracking of the UAV state. This facilitates stable manipulation of UAVs in unknown environments with interaction forces. Furthermore, the model ensures safe, compliant, and flexible interaction while safeguarding the UAV’s external structures from damage. To assess the model performance, we validated the approach in a simulation environment using a UAV. The model was also tested across different UAV types and various low-level control parameters, demonstrating superior performance in all scenarios. Additionally, we applied this methodology to two distinct UAV types used in real-world applications. Empirical evidence shows that our proposed methods consistently achieve superior results. We also applied similar methodologies to verify 6D interaction in a simulation of a fully actuated platform consisting of three UAVs. Using a high-level training strategy, we evaluated the platform’s ability to slide along a bevel and achieve optimal results in our comparative experiments.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"648 ","pages":"Article 130667"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225013396","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

A compliant control model based on Reinforcement Learning (RL) is proposed to allow UAVs (Unmanned Aerial Vehicles) to interact with the environment more effectively and autonomously execute force control tasks. The model learns an optimal admittance adjustment policy for interaction and simultaneously optimizes energy consumption and trajectory tracking of the UAV state. This facilitates stable manipulation of UAVs in unknown environments with interaction forces. Furthermore, the model ensures safe, compliant, and flexible interaction while safeguarding the UAV’s external structures from damage. To assess the model performance, we validated the approach in a simulation environment using a UAV. The model was also tested across different UAV types and various low-level control parameters, demonstrating superior performance in all scenarios. Additionally, we applied this methodology to two distinct UAV types used in real-world applications. Empirical evidence shows that our proposed methods consistently achieve superior results. We also applied similar methodologies to verify 6D interaction in a simulation of a fully actuated platform consisting of three UAVs. Using a high-level training strategy, we evaluated the platform’s ability to slide along a bevel and achieve optimal results in our comparative experiments.

查看原文本刊更多论文

通过强化学习的可变导纳控制：增强无人机在不同平台上的交互

提出了一种基于强化学习（RL）的柔性控制模型，使无人机能够更有效地与环境交互并自主执行力控制任务。该模型学习最优导纳调整策略进行交互，同时优化无人机状态的能耗和轨迹跟踪。这有利于无人机在未知环境中与相互作用力的稳定操纵。此外，该模型确保了安全、兼容和灵活的交互，同时保护无人机的外部结构免受损坏。为了评估模型的性能，我们在使用无人机的仿真环境中验证了该方法。该模型还在不同的无人机类型和各种低级控制参数上进行了测试，在所有场景中展示了优越的性能。此外，我们将此方法应用于实际应用中使用的两种不同类型的无人机。经验证据表明，我们提出的方法一贯取得优异的结果。我们还应用了类似的方法来验证由三架无人机组成的完全驱动平台的模拟中的6D交互。使用高水平的训练策略，我们评估了平台沿斜面滑动的能力，并在我们的比较实验中获得了最佳结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.