Hyeon Cho, Hyungho Kim, Dae-Kwan Ko, Soo-Chul Lim, Wonjun Hwang
{"title":"哪种LSTM类型更适合相互作用力估计?","authors":"Hyeon Cho, Hyungho Kim, Dae-Kwan Ko, Soo-Chul Lim, Wonjun Hwang","doi":"10.1109/RITAPP.2019.8932854","DOIUrl":null,"url":null,"abstract":"Tactile, one of the five senses classified into the main senses of human, is the first sensation developed when human beings are formed. The tactile includes various information such as pressure, temperature, and texture of objects, it also helps the person to interact with the surrounding environment. One of the tactile information, the pressure is used in various fields such as medical, beauty, mobile devices and so on. However, humans can perceive the real world with multi-modal senses such as sound, vision. In this paper, we study interaction force estimation using haptic sensor and video. Interact ion force estimation through video analysis is one of a cross-modal approach that is applicable such as a software haptic feedback method that can give haptic feedback to remote control of robot arm by predicting interaction force even in absence of haptic sensor. we compare and analyze three types of a deep neural network to predict the interaction force. In particular, the best model for the stacking structure of CNN and LSTM is selected through a detailed analysis of how the structure change of LSTM affects the video regression problem. The average error of the best suit model is MSE 0.1306, RMSE 0.2740, MAE 0.1878.","PeriodicalId":234023,"journal":{"name":"2019 7th International Conference on Robot Intelligence Technology and Applications (RiTA)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Which LSTM Type is Better for Interaction Force Estimation?\",\"authors\":\"Hyeon Cho, Hyungho Kim, Dae-Kwan Ko, Soo-Chul Lim, Wonjun Hwang\",\"doi\":\"10.1109/RITAPP.2019.8932854\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Tactile, one of the five senses classified into the main senses of human, is the first sensation developed when human beings are formed. The tactile includes various information such as pressure, temperature, and texture of objects, it also helps the person to interact with the surrounding environment. One of the tactile information, the pressure is used in various fields such as medical, beauty, mobile devices and so on. However, humans can perceive the real world with multi-modal senses such as sound, vision. In this paper, we study interaction force estimation using haptic sensor and video. Interact ion force estimation through video analysis is one of a cross-modal approach that is applicable such as a software haptic feedback method that can give haptic feedback to remote control of robot arm by predicting interaction force even in absence of haptic sensor. we compare and analyze three types of a deep neural network to predict the interaction force. In particular, the best model for the stacking structure of CNN and LSTM is selected through a detailed analysis of how the structure change of LSTM affects the video regression problem. The average error of the best suit model is MSE 0.1306, RMSE 0.2740, MAE 0.1878.\",\"PeriodicalId\":234023,\"journal\":{\"name\":\"2019 7th International Conference on Robot Intelligence Technology and Applications (RiTA)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 7th International Conference on Robot Intelligence Technology and Applications (RiTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RITAPP.2019.8932854\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 7th International Conference on Robot Intelligence Technology and Applications (RiTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RITAPP.2019.8932854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
触觉是人类五种主要感觉之一,是人类形成时最先发展起来的感觉。触觉包括物体的压力、温度和纹理等各种信息,它还有助于人与周围环境的互动。触觉信息之一,压力被应用于医疗、美容、移动设备等各个领域。然而,人类可以通过声音、视觉等多模态感官来感知现实世界。本文研究了基于触觉传感器和视频的交互力估计方法。通过视频分析进行交互力估计是一种跨模态方法,如软件触觉反馈方法,即使在没有触觉传感器的情况下,也可以通过预测交互力来对机器人手臂的远程控制进行触觉反馈。我们比较和分析了三种类型的深度神经网络来预测相互作用力。特别是,通过详细分析LSTM的结构变化对视频回归问题的影响,选择了CNN和LSTM叠加结构的最佳模型。最佳西服模型的平均误差为MSE 0.1306, RMSE 0.2740, MAE 0.1878。
Which LSTM Type is Better for Interaction Force Estimation?
Tactile, one of the five senses classified into the main senses of human, is the first sensation developed when human beings are formed. The tactile includes various information such as pressure, temperature, and texture of objects, it also helps the person to interact with the surrounding environment. One of the tactile information, the pressure is used in various fields such as medical, beauty, mobile devices and so on. However, humans can perceive the real world with multi-modal senses such as sound, vision. In this paper, we study interaction force estimation using haptic sensor and video. Interact ion force estimation through video analysis is one of a cross-modal approach that is applicable such as a software haptic feedback method that can give haptic feedback to remote control of robot arm by predicting interaction force even in absence of haptic sensor. we compare and analyze three types of a deep neural network to predict the interaction force. In particular, the best model for the stacking structure of CNN and LSTM is selected through a detailed analysis of how the structure change of LSTM affects the video regression problem. The average error of the best suit model is MSE 0.1306, RMSE 0.2740, MAE 0.1878.