Sorting operation method of manipulator based on deep reinforcement learning

Qing An, Yanhua Chen, Hui Zeng, J. Wang
{"title":"Sorting operation method of manipulator based on deep reinforcement learning","authors":"Qing An, Yanhua Chen, Hui Zeng, J. Wang","doi":"10.1142/s1793962323410076","DOIUrl":null,"url":null,"abstract":"Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.","PeriodicalId":13657,"journal":{"name":"Int. J. Model. Simul. Sci. Comput.","volume":"263 1","pages":"2341007:1-2341007:22"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Model. Simul. Sci. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s1793962323410076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Radioactive waste sorting often faces an unstructured and locally radioactive working environment. At present, remote operation sorting has problems such as low sorting efficiency, greater difficulty in operation, longer training periods for personnel, and poor autonomous control capabilities. Based on the premise of improving the adaptability and autonomous operation ability of robots in an unstructured environment, this paper uses the dual deep Q learning algorithm to optimize the classic deep Q learning algorithm to improve training speed and improve sorting efficiency and stability. Secondly, the sorting algorithm model of deep reinforcement learning is used to determine the optimal behavior in this state. Set up multiple sets of simulations and physical experiments to verify the sorting method. The results show that the robotic arm can autonomously complete sorting tasks under complex conditions and can significantly improve work efficiency when pushing and grasping collaborative operations and will preferentially grasp objects with high radioactivity in the radioactive area. The algorithm has migration ability and good generalization.
基于深度强化学习的机械手分拣操作方法
放射性废物分类往往面临非结构化和局部放射性的工作环境。目前,远程操作分拣存在分拣效率低、操作难度大、人员培训时间长、自主控制能力差等问题。本文以提高机器人在非结构化环境中的适应性和自主操作能力为前提,采用双深度Q学习算法对经典深度Q学习算法进行优化,提高训练速度,提高分拣效率和稳定性。其次,利用深度强化学习的排序算法模型确定该状态下的最优行为;建立多组模拟和物理实验来验证分选方法。结果表明,该机械臂能够在复杂条件下自主完成分拣任务,在推抓协同作业时能显著提高工作效率,并优先抓取放射性区域内的高放射性物体。该算法具有迁移能力和良好的泛化能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信