基于深度强化学习的无传感器机械臂控制策略网络

IF 1.5 4区 计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING
Jin Wu, Yaqiao Zhu, Jinfu Li
{"title":"基于深度强化学习的无传感器机械臂控制策略网络","authors":"Jin Wu,&nbsp;Yaqiao Zhu,&nbsp;Jinfu Li","doi":"10.1002/cpe.70250","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper proposes a novel attention-based convolutional neural network (CNN) for sensor-free robotic arm control, aiming to improve six dimensional (6D) pose estimation and end-effector operation in an end-to-end manner. Unlike traditional methods that rely on explicit feature engineering or sensor feedback, our approach leverages a sophisticated attention mechanism within the convolutional backbone to enhance spatial awareness. The proposed localization sub-module scores each prior regime through a weighted average of activation maps, allowing the network to focus on the most informative regions of the input. Additionally, we introduce a two-phase training methodology requiring only image-level annotations. In the first phase, the network learns to extract discriminative features from synthetic images, which are crucial for accurate 6D pose prediction. In the second phase, a reinforcement learning agent, equipped with the trained vision model as its sensory module, is optimized using a sparse reward function to refine action policies. Experimental evaluations in two virtual scenarios demonstrate that our method outperforms popular CNN-based approaches in terms of both accuracy and efficiency. Specifically, our method improves task success rates by 52.9% and reduces position error by 72.3% compared to baseline models, showcasing its effectiveness in sensor-free robotic arm control.</p>\n </div>","PeriodicalId":55214,"journal":{"name":"Concurrency and Computation-Practice & Experience","volume":"37 23-24","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attention-Based Policy Network for Sensor-Free Robotic Arm Control With Deep Reinforcement Learning\",\"authors\":\"Jin Wu,&nbsp;Yaqiao Zhu,&nbsp;Jinfu Li\",\"doi\":\"10.1002/cpe.70250\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>This paper proposes a novel attention-based convolutional neural network (CNN) for sensor-free robotic arm control, aiming to improve six dimensional (6D) pose estimation and end-effector operation in an end-to-end manner. Unlike traditional methods that rely on explicit feature engineering or sensor feedback, our approach leverages a sophisticated attention mechanism within the convolutional backbone to enhance spatial awareness. The proposed localization sub-module scores each prior regime through a weighted average of activation maps, allowing the network to focus on the most informative regions of the input. Additionally, we introduce a two-phase training methodology requiring only image-level annotations. In the first phase, the network learns to extract discriminative features from synthetic images, which are crucial for accurate 6D pose prediction. In the second phase, a reinforcement learning agent, equipped with the trained vision model as its sensory module, is optimized using a sparse reward function to refine action policies. Experimental evaluations in two virtual scenarios demonstrate that our method outperforms popular CNN-based approaches in terms of both accuracy and efficiency. Specifically, our method improves task success rates by 52.9% and reduces position error by 72.3% compared to baseline models, showcasing its effectiveness in sensor-free robotic arm control.</p>\\n </div>\",\"PeriodicalId\":55214,\"journal\":{\"name\":\"Concurrency and Computation-Practice & Experience\",\"volume\":\"37 23-24\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Concurrency and Computation-Practice & Experience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70250\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Concurrency and Computation-Practice & Experience","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cpe.70250","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种新的基于注意力的卷积神经网络(CNN)用于无传感器机械臂控制,旨在以端到端方式改进六维(6D)姿态估计和末端执行器操作。与依赖显式特征工程或传感器反馈的传统方法不同,我们的方法利用卷积主干内复杂的注意机制来增强空间感知。提出的定位子模块通过激活图的加权平均对每个先验区域进行评分,使网络能够关注输入中信息量最大的区域。此外,我们还介绍了一种只需要图像级注释的两阶段训练方法。在第一阶段,网络学习从合成图像中提取判别特征,这对于准确预测6D姿势至关重要。在第二阶段,将训练好的视觉模型作为感知模块,使用稀疏奖励函数优化强化学习智能体,以细化行动策略。在两个虚拟场景下的实验评估表明,我们的方法在准确性和效率方面都优于流行的基于cnn的方法。具体而言,与基线模型相比,该方法的任务成功率提高了52.9%,位置误差降低了72.3%,显示了其在无传感器机械臂控制中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Attention-Based Policy Network for Sensor-Free Robotic Arm Control With Deep Reinforcement Learning

This paper proposes a novel attention-based convolutional neural network (CNN) for sensor-free robotic arm control, aiming to improve six dimensional (6D) pose estimation and end-effector operation in an end-to-end manner. Unlike traditional methods that rely on explicit feature engineering or sensor feedback, our approach leverages a sophisticated attention mechanism within the convolutional backbone to enhance spatial awareness. The proposed localization sub-module scores each prior regime through a weighted average of activation maps, allowing the network to focus on the most informative regions of the input. Additionally, we introduce a two-phase training methodology requiring only image-level annotations. In the first phase, the network learns to extract discriminative features from synthetic images, which are crucial for accurate 6D pose prediction. In the second phase, a reinforcement learning agent, equipped with the trained vision model as its sensory module, is optimized using a sparse reward function to refine action policies. Experimental evaluations in two virtual scenarios demonstrate that our method outperforms popular CNN-based approaches in terms of both accuracy and efficiency. Specifically, our method improves task success rates by 52.9% and reduces position error by 72.3% compared to baseline models, showcasing its effectiveness in sensor-free robotic arm control.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Concurrency and Computation-Practice & Experience
Concurrency and Computation-Practice & Experience 工程技术-计算机:理论方法
CiteScore
5.00
自引率
10.00%
发文量
664
审稿时长
9.6 months
期刊介绍: Concurrency and Computation: Practice and Experience (CCPE) publishes high-quality, original research papers, and authoritative research review papers, in the overlapping fields of: Parallel and distributed computing; High-performance computing; Computational and data science; Artificial intelligence and machine learning; Big data applications, algorithms, and systems; Network science; Ontologies and semantics; Security and privacy; Cloud/edge/fog computing; Green computing; and Quantum computing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信