对不同深度回归模型进行基准测试，预测图像旋转角度和机器人末端执行器的位置

2019 7th International Conference on Mechatronics Engineering (ICOM) Pub Date : 2019-10-01 DOI:10.1109/ICOM47790.2019.8952047

Nouar Aldahoul, Z. Htike

{"title":"对不同深度回归模型进行基准测试，预测图像旋转角度和机器人末端执行器的位置","authors":"Nouar Aldahoul, Z. Htike","doi":"10.1109/ICOM47790.2019.8952047","DOIUrl":null,"url":null,"abstract":"Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography, and head-pose estimation. Deep learning has recently been cutting-edge research. Most of the research papers have focused on utilizing deep learning in classification tasks. There is still a lack of research that use deep learning for regression. This paper utilizes different deep learning models for two regression tasks. The first one is the prediction of the image rotation angle. The second task is to predict the position of the robot's end-effector in 2D space. Efficient features were learned or extracted in order to perform good regression. The paper demonstrates and compares various models such as a local Receptive Field-Extreme Learning Machine (LRF-ELM), Hierarchical ELM, Supervised Convolutional Neural Network (CNN), and pre-trained CNN such as AlexNet. Each model was trained to learn or extract features and map them to specific continuous output. The results show that all models gave good performance in terms of RMSE and accuracy. H-ELM was found to outperform other models in term of training speed.","PeriodicalId":415914,"journal":{"name":"2019 7th International Conference on Mechatronics Engineering (ICOM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Benchmarking different deep regression models for predicting image rotation angle and robot's end effector's position\",\"authors\":\"Nouar Aldahoul, Z. Htike\",\"doi\":\"10.1109/ICOM47790.2019.8952047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography, and head-pose estimation. Deep learning has recently been cutting-edge research. Most of the research papers have focused on utilizing deep learning in classification tasks. There is still a lack of research that use deep learning for regression. This paper utilizes different deep learning models for two regression tasks. The first one is the prediction of the image rotation angle. The second task is to predict the position of the robot's end-effector in 2D space. Efficient features were learned or extracted in order to perform good regression. The paper demonstrates and compares various models such as a local Receptive Field-Extreme Learning Machine (LRF-ELM), Hierarchical ELM, Supervised Convolutional Neural Network (CNN), and pre-trained CNN such as AlexNet. Each model was trained to learn or extract features and map them to specific continuous output. The results show that all models gave good performance in terms of RMSE and accuracy. H-ELM was found to outperform other models in term of training speed.\",\"PeriodicalId\":415914,\"journal\":{\"name\":\"2019 7th International Conference on Mechatronics Engineering (ICOM)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 7th International Conference on Mechatronics Engineering (ICOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOM47790.2019.8952047\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 7th International Conference on Mechatronics Engineering (ICOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOM47790.2019.8952047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

深度视觉回归模型对于发现学习模型在多大程度上符合视觉数据(图像)与预测连续输出之间的关系具有重要作用。近年来，深度视觉回归已被广泛应用于年龄预测、数字全息、头姿估计等领域。深度学习是最近的前沿研究。大多数研究论文都集中在将深度学习应用于分类任务上。目前仍然缺乏使用深度学习进行回归的研究。本文对两个回归任务使用了不同的深度学习模型。首先是图像旋转角度的预测。第二个任务是预测机器人末端执行器在二维空间中的位置。学习或提取有效的特征以进行良好的回归。本文演示并比较了各种模型，如局部接受场极限学习机(LRF-ELM)、分层ELM、监督卷积神经网络(CNN)和预训练CNN(如AlexNet)。每个模型都经过训练来学习或提取特征，并将它们映射到特定的连续输出。结果表明，所有模型在均方根误差和准确率方面都有较好的表现。H-ELM在训练速度方面优于其他模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Benchmarking different deep regression models for predicting image rotation angle and robot's end effector's position

Deep visual regression models have an important role to find how much the learning model fits the relationship between the visual data (images) and the predicted continuous output. Recently, deep visual regression has been utilized in different applications such as age prediction, digital holography, and head-pose estimation. Deep learning has recently been cutting-edge research. Most of the research papers have focused on utilizing deep learning in classification tasks. There is still a lack of research that use deep learning for regression. This paper utilizes different deep learning models for two regression tasks. The first one is the prediction of the image rotation angle. The second task is to predict the position of the robot's end-effector in 2D space. Efficient features were learned or extracted in order to perform good regression. The paper demonstrates and compares various models such as a local Receptive Field-Extreme Learning Machine (LRF-ELM), Hierarchical ELM, Supervised Convolutional Neural Network (CNN), and pre-trained CNN such as AlexNet. Each model was trained to learn or extract features and map them to specific continuous output. The results show that all models gave good performance in terms of RMSE and accuracy. H-ELM was found to outperform other models in term of training speed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 7th International Conference on Mechatronics Engineering (ICOM)

自引率

0.00%

发文量