Preliminary Study of Object Recognition by Converting Physical Responses to Images in Two Dimensions

Kazuki Yane, T. Nozaki
{"title":"Preliminary Study of Object Recognition by Converting Physical Responses to Images in Two Dimensions","authors":"Kazuki Yane, T. Nozaki","doi":"10.1109/ICM54990.2023.10101938","DOIUrl":null,"url":null,"abstract":"The use of robots is desired as a replacement for human labor. However, it is difficult for robots to respond flexibly to changes in objects and environments and perform tasks. Recently, many systems have been proposed that can flexibly respond to changes by generating robot motions using machine learning. Many machine learning methods use a camera to acquire environmental information, and feature extraction is performed using images acquired from the camera using CNN (Convolutional Neural Network), CAE (Convolutional Auto Encoder), or other methods. Many methods estimate the input values in the next step by inputting the image features, position data and reaction force data acquired from the robot together into the RNN (Recurrent Neural Network), etc. However, in most cases, the relationship between the image and robot data is learned without explicitly stating it. Therefore, in this paper, the data acquired from the robot is converted to images and used in combination with images from the camera to make the interaction between the robot and the environment explicit and to improve the estimation accuracy of NNs. In simulations, the proposed method was used to perform the task of discriminating the target of motion, and the high estimation accuracy was confirmed. In the future, we plan to use this method as input data for motion generation to generate motion according to the object.","PeriodicalId":416176,"journal":{"name":"2023 IEEE International Conference on Mechatronics (ICM)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Mechatronics (ICM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICM54990.2023.10101938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The use of robots is desired as a replacement for human labor. However, it is difficult for robots to respond flexibly to changes in objects and environments and perform tasks. Recently, many systems have been proposed that can flexibly respond to changes by generating robot motions using machine learning. Many machine learning methods use a camera to acquire environmental information, and feature extraction is performed using images acquired from the camera using CNN (Convolutional Neural Network), CAE (Convolutional Auto Encoder), or other methods. Many methods estimate the input values in the next step by inputting the image features, position data and reaction force data acquired from the robot together into the RNN (Recurrent Neural Network), etc. However, in most cases, the relationship between the image and robot data is learned without explicitly stating it. Therefore, in this paper, the data acquired from the robot is converted to images and used in combination with images from the camera to make the interaction between the robot and the environment explicit and to improve the estimation accuracy of NNs. In simulations, the proposed method was used to perform the task of discriminating the target of motion, and the high estimation accuracy was confirmed. In the future, we plan to use this method as input data for motion generation to generate motion according to the object.
将物理反应转换为二维图像的目标识别的初步研究
人们希望使用机器人来代替人类劳动。然而,机器人很难对物体和环境的变化做出灵活的反应并执行任务。最近,人们提出了许多系统,通过使用机器学习产生机器人运动来灵活地响应变化。许多机器学习方法使用相机获取环境信息,并使用CNN(卷积神经网络),CAE(卷积自动编码器)或其他方法使用从相机获取的图像进行特征提取。许多方法通过将从机器人获取的图像特征、位置数据和反作用力数据一起输入到RNN (Recurrent Neural Network,递归神经网络)中来估计下一步的输入值。然而,在大多数情况下,图像和机器人数据之间的关系是在没有明确说明的情况下学习的。因此,本文将机器人采集到的数据转换成图像,并与摄像头采集到的图像结合使用,使机器人与环境的交互作用更加明确,提高了神经网络的估计精度。仿真结果表明,该方法具有较高的运动目标识别精度。在未来,我们计划使用这种方法作为运动生成的输入数据,根据对象来生成运动。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信