Reinforcement Learning-Based Simulation of Seal Engraving Robot in the Context of Artificial Intelligence

Journal of Artificial Intelligence and Technology Pub Date : 2024-03-21 DOI:10.37965/jait.2024.0453

Ran Tan, Khayril Anwar, Bin Khairudin

{"title":"Reinforcement Learning-Based Simulation of Seal Engraving Robot in the Context of Artificial Intelligence","authors":"Ran Tan, Khayril Anwar, Bin Khairudin","doi":"10.37965/jait.2024.0453","DOIUrl":null,"url":null,"abstract":"The rapid development of robotics technology has made people's lives and work more convenient and efficient. The research and simulation of robots combined with reinforcement learning intelligent algorithms have become a hotspot in various fields of robot applications. In view of this, this study is based on deep reinforcement learning convolutional neural networks, combined with point cloud models, proximal strategy optimization algorithms, and flexible action evaluation algorithms. A seal cutting robot based on deep reinforcement learning has been proposed. The final results show that the descent speed of the seal cutting robot with the root mean square difference as the performance standard is about 1% faster than the flexible action evaluation algorithm. About 2% faster than the proximal strategy optimization algorithm. It is about 4% faster than the deep deterministic strategy gradient algorithm. This indicates that the research model has certain advantages in terms of actual accuracy after cutting. The fluctuation of this model is about 10% smaller than the evaluation of flexible actions and about 60% smaller than the gradient of deep deterministic strategies. Therefore, the research model has the highest overall stability without falling into local optima. In addition, compared to the near end strategy optimization algorithm, it falls into local optima, resulting in a low coincidence degree of about 17%. The deep deterministic strategy gradient algorithm has a large fluctuation amplitude during the seal cutting process, and the overall curve is relatively slow, with a final overlap of about 70%. The overlap degree of flexible action evaluation is slightly higher by about 83%. The maximum stability of the model's overlap is best around 90%. Through experiments, it can be found that the seal cutting robot proposed in the study based on deep reinforcement learning maintains certain advantages in performance indicators in various types of tests.","PeriodicalId":135863,"journal":{"name":"Journal of Artificial Intelligence and Technology","volume":"10 4p2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Artificial Intelligence and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37965/jait.2024.0453","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The rapid development of robotics technology has made people's lives and work more convenient and efficient. The research and simulation of robots combined with reinforcement learning intelligent algorithms have become a hotspot in various fields of robot applications. In view of this, this study is based on deep reinforcement learning convolutional neural networks, combined with point cloud models, proximal strategy optimization algorithms, and flexible action evaluation algorithms. A seal cutting robot based on deep reinforcement learning has been proposed. The final results show that the descent speed of the seal cutting robot with the root mean square difference as the performance standard is about 1% faster than the flexible action evaluation algorithm. About 2% faster than the proximal strategy optimization algorithm. It is about 4% faster than the deep deterministic strategy gradient algorithm. This indicates that the research model has certain advantages in terms of actual accuracy after cutting. The fluctuation of this model is about 10% smaller than the evaluation of flexible actions and about 60% smaller than the gradient of deep deterministic strategies. Therefore, the research model has the highest overall stability without falling into local optima. In addition, compared to the near end strategy optimization algorithm, it falls into local optima, resulting in a low coincidence degree of about 17%. The deep deterministic strategy gradient algorithm has a large fluctuation amplitude during the seal cutting process, and the overall curve is relatively slow, with a final overlap of about 70%. The overlap degree of flexible action evaluation is slightly higher by about 83%. The maximum stability of the model's overlap is best around 90%. Through experiments, it can be found that the seal cutting robot proposed in the study based on deep reinforcement learning maintains certain advantages in performance indicators in various types of tests.

查看原文本刊更多论文

人工智能背景下基于强化学习的篆刻机器人仿真

机器人技术的飞速发展使人们的生活和工作更加便捷高效。结合强化学习智能算法的机器人研究与仿真已成为机器人各领域应用的热点。有鉴于此，本研究基于深度强化学习卷积神经网络，结合点云模型、近端策略优化算法、柔性动作评估算法等进行研究。提出了一种基于深度强化学习的密封切割机器人。最终结果表明，以均方根差值为性能标准的海豹切割机器人的下降速度比灵活动作评估算法快约 1%。比近似策略优化算法快约 2%。比深度确定性策略梯度算法快约 4%。这表明该研究模型在切割后的实际精度方面具有一定优势。该模型的波动比灵活行动评估小 10%，比深度确定性策略梯度小 60%。因此，该研究模型具有最高的整体稳定性，不会陷入局部最优。此外，与近端策略优化算法相比，它陷入了局部最优，导致了约 17% 的低重合度。深度确定性策略梯度算法在密封切割过程中波动幅度较大，整体曲线相对较慢，最终重合度约为 70%。灵活动作评估的重叠度略高，约为 83%。模型重叠度的最大稳定性最好在 90% 左右。通过实验可以发现，本研究提出的基于深度强化学习的密封切割机器人在各类测试中的性能指标保持了一定的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Artificial Intelligence and Technology

自引率

0.00%

发文量