Design of imitation learning fusion algorithm for mobile robotic arm control

IF 6.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-07-16 DOI:10.1016/j.asoc.2025.113628

Yang Wang , Xiaoling Yan , Liming Wang , Wei Pan , Fei Song , Xinlei Zhou

{"title":"Design of imitation learning fusion algorithm for mobile robotic arm control","authors":"Yang Wang , Xiaoling Yan , Liming Wang , Wei Pan , Fei Song , Xinlei Zhou","doi":"10.1016/j.asoc.2025.113628","DOIUrl":null,"url":null,"abstract":"<div><div>Despite the considerable potential of artificial intelligence technology in industrial applications, it still faces challenges such as high data requirements, limited generalization capabilities, and concerns with safety and stability. These issues become particularly prominent in the execution of precision operations such as opening button-lock cabinet doors during robotic inspection tasks. The tasks involve complex environmental perception, dynamically changing operational settings, high stability requirements, and the challenge of generalization, all of which necessitate the integration of multiple advanced algorithms into a fusion system design. This research focuses on the system design and algorithm integration challenges for mobile robotic arms in inspection tasks, proposing a Deep Meta-Imitation Learning (DMIL) algorithm that combines deep meta-learning with imitation learning (IL) to enhance the adaptability and efficiency of mobile robotic arms in such tasks. Expert trajectories were generated and analyzed in the CoppeliaSim simulation environment, and actual interactions were evaluated. The study employs a deep learning-based 6D pose estimation method to determine the position and orientation of button locks, with visual recognition of key operational reference points. In the imitation learning phase, the operational strategies of the robotic arm are enhanced by combining Adversarial Inverse Reinforcement Learning (AIRL) and Variable Impedance Control (VIC) technologies, supported by expert-guided trajectories and force feedback data from real-world environments. Additionally, a Latent Embedding Optimization (LEO) module is introduced into the deep meta-learning framework, enabling the model to quickly adapt to new tasks, significantly improving its generalization ability. Experiments were conducted in the CoppeliaSim simulation environment and on a mobile robotic arm platform, focusing on the recognition process, trajectory planning, and compliance control management. To assess the algorithm's effectiveness, three button-lock cabinet door-opening tasks of varying difficulty were executed. The experimental results demonstrate that the mobile robotic arm was able to accurately locate and compliantly open multiple button locks, showcasing the practicality and feasibility of this approach in advancing robotic precision operations in inspection tasks.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"182 ","pages":"Article 113628"},"PeriodicalIF":6.6000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625009391","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Despite the considerable potential of artificial intelligence technology in industrial applications, it still faces challenges such as high data requirements, limited generalization capabilities, and concerns with safety and stability. These issues become particularly prominent in the execution of precision operations such as opening button-lock cabinet doors during robotic inspection tasks. The tasks involve complex environmental perception, dynamically changing operational settings, high stability requirements, and the challenge of generalization, all of which necessitate the integration of multiple advanced algorithms into a fusion system design. This research focuses on the system design and algorithm integration challenges for mobile robotic arms in inspection tasks, proposing a Deep Meta-Imitation Learning (DMIL) algorithm that combines deep meta-learning with imitation learning (IL) to enhance the adaptability and efficiency of mobile robotic arms in such tasks. Expert trajectories were generated and analyzed in the CoppeliaSim simulation environment, and actual interactions were evaluated. The study employs a deep learning-based 6D pose estimation method to determine the position and orientation of button locks, with visual recognition of key operational reference points. In the imitation learning phase, the operational strategies of the robotic arm are enhanced by combining Adversarial Inverse Reinforcement Learning (AIRL) and Variable Impedance Control (VIC) technologies, supported by expert-guided trajectories and force feedback data from real-world environments. Additionally, a Latent Embedding Optimization (LEO) module is introduced into the deep meta-learning framework, enabling the model to quickly adapt to new tasks, significantly improving its generalization ability. Experiments were conducted in the CoppeliaSim simulation environment and on a mobile robotic arm platform, focusing on the recognition process, trajectory planning, and compliance control management. To assess the algorithm's effectiveness, three button-lock cabinet door-opening tasks of varying difficulty were executed. The experimental results demonstrate that the mobile robotic arm was able to accurately locate and compliantly open multiple button locks, showcasing the practicality and feasibility of this approach in advancing robotic precision operations in inspection tasks.

查看原文本刊更多论文

移动机械臂控制的模仿学习融合算法设计

尽管人工智能技术在工业应用中具有相当大的潜力，但它仍然面临着数据要求高、泛化能力有限、安全性和稳定性等挑战。这些问题在执行精确操作时变得尤为突出，例如在机器人检查任务期间打开按钮锁柜门。这些任务涉及复杂的环境感知、动态变化的操作设置、高稳定性要求和泛化挑战，所有这些都需要将多种先进算法集成到融合系统设计中。本研究针对移动机械臂在检测任务中的系统设计和算法集成挑战，提出了一种将深度元学习与模仿学习相结合的深度元模仿学习（DMIL）算法，以提高移动机械臂在检测任务中的适应性和效率。在CoppeliaSim仿真环境中生成专家轨迹并对其进行分析，并对实际交互进行评估。本研究采用基于深度学习的6D姿态估计方法确定按钮锁的位置和方向，并对关键操作参考点进行视觉识别。在模仿学习阶段，通过结合对抗逆强化学习（AIRL）和变阻抗控制（VIC）技术，在专家指导的轨迹和来自现实环境的力反馈数据的支持下，增强机械臂的操作策略。此外，在深度元学习框架中引入了潜在嵌入优化（Latent Embedding Optimization， LEO）模块，使模型能够快速适应新的任务，显著提高了模型的泛化能力。在CoppeliaSim仿真环境和移动机械臂平台上进行了实验，重点研究了识别过程、轨迹规划和合规控制管理。为了评估算法的有效性，执行了三个不同难度的按钮锁柜门打开任务。实验结果表明，该移动机械臂能够准确定位并顺应打开多个按钮锁，证明了该方法在提高机器人检测任务精度方面的实用性和可行性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.