将物理反应转换为二维图像的目标识别的初步研究

2023 IEEE International Conference on Mechatronics (ICM) Pub Date : 2023-03-15 DOI:10.1109/ICM54990.2023.10101938

Kazuki Yane, T. Nozaki

{"title":"将物理反应转换为二维图像的目标识别的初步研究","authors":"Kazuki Yane, T. Nozaki","doi":"10.1109/ICM54990.2023.10101938","DOIUrl":null,"url":null,"abstract":"The use of robots is desired as a replacement for human labor. However, it is difficult for robots to respond flexibly to changes in objects and environments and perform tasks. Recently, many systems have been proposed that can flexibly respond to changes by generating robot motions using machine learning. Many machine learning methods use a camera to acquire environmental information, and feature extraction is performed using images acquired from the camera using CNN (Convolutional Neural Network), CAE (Convolutional Auto Encoder), or other methods. Many methods estimate the input values in the next step by inputting the image features, position data and reaction force data acquired from the robot together into the RNN (Recurrent Neural Network), etc. However, in most cases, the relationship between the image and robot data is learned without explicitly stating it. Therefore, in this paper, the data acquired from the robot is converted to images and used in combination with images from the camera to make the interaction between the robot and the environment explicit and to improve the estimation accuracy of NNs. In simulations, the proposed method was used to perform the task of discriminating the target of motion, and the high estimation accuracy was confirmed. In the future, we plan to use this method as input data for motion generation to generate motion according to the object.","PeriodicalId":416176,"journal":{"name":"2023 IEEE International Conference on Mechatronics (ICM)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Preliminary Study of Object Recognition by Converting Physical Responses to Images in Two Dimensions\",\"authors\":\"Kazuki Yane, T. Nozaki\",\"doi\":\"10.1109/ICM54990.2023.10101938\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of robots is desired as a replacement for human labor. However, it is difficult for robots to respond flexibly to changes in objects and environments and perform tasks. Recently, many systems have been proposed that can flexibly respond to changes by generating robot motions using machine learning. Many machine learning methods use a camera to acquire environmental information, and feature extraction is performed using images acquired from the camera using CNN (Convolutional Neural Network), CAE (Convolutional Auto Encoder), or other methods. Many methods estimate the input values in the next step by inputting the image features, position data and reaction force data acquired from the robot together into the RNN (Recurrent Neural Network), etc. However, in most cases, the relationship between the image and robot data is learned without explicitly stating it. Therefore, in this paper, the data acquired from the robot is converted to images and used in combination with images from the camera to make the interaction between the robot and the environment explicit and to improve the estimation accuracy of NNs. In simulations, the proposed method was used to perform the task of discriminating the target of motion, and the high estimation accuracy was confirmed. In the future, we plan to use this method as input data for motion generation to generate motion according to the object.\",\"PeriodicalId\":416176,\"journal\":{\"name\":\"2023 IEEE International Conference on Mechatronics (ICM)\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Mechatronics (ICM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICM54990.2023.10101938\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mechatronics (ICM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICM54990.2023.10101938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

人们希望使用机器人来代替人类劳动。然而，机器人很难对物体和环境的变化做出灵活的反应并执行任务。最近，人们提出了许多系统，通过使用机器学习产生机器人运动来灵活地响应变化。许多机器学习方法使用相机获取环境信息，并使用CNN(卷积神经网络)，CAE(卷积自动编码器)或其他方法使用从相机获取的图像进行特征提取。许多方法通过将从机器人获取的图像特征、位置数据和反作用力数据一起输入到RNN (Recurrent Neural Network，递归神经网络)中来估计下一步的输入值。然而，在大多数情况下，图像和机器人数据之间的关系是在没有明确说明的情况下学习的。因此，本文将机器人采集到的数据转换成图像，并与摄像头采集到的图像结合使用，使机器人与环境的交互作用更加明确，提高了神经网络的估计精度。仿真结果表明，该方法具有较高的运动目标识别精度。在未来，我们计划使用这种方法作为运动生成的输入数据，根据对象来生成运动。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Preliminary Study of Object Recognition by Converting Physical Responses to Images in Two Dimensions

The use of robots is desired as a replacement for human labor. However, it is difficult for robots to respond flexibly to changes in objects and environments and perform tasks. Recently, many systems have been proposed that can flexibly respond to changes by generating robot motions using machine learning. Many machine learning methods use a camera to acquire environmental information, and feature extraction is performed using images acquired from the camera using CNN (Convolutional Neural Network), CAE (Convolutional Auto Encoder), or other methods. Many methods estimate the input values in the next step by inputting the image features, position data and reaction force data acquired from the robot together into the RNN (Recurrent Neural Network), etc. However, in most cases, the relationship between the image and robot data is learned without explicitly stating it. Therefore, in this paper, the data acquired from the robot is converted to images and used in combination with images from the camera to make the interaction between the robot and the environment explicit and to improve the estimation accuracy of NNs. In simulations, the proposed method was used to perform the task of discriminating the target of motion, and the high estimation accuracy was confirmed. In the future, we plan to use this method as input data for motion generation to generate motion according to the object.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Conference on Mechatronics (ICM)

自引率

0.00%

发文量