基于自编码器和深度嵌入网络的多视图三维目标检索比较研究

Sakifa Aktar, Md. Al Mamun, Md. Ali Hossain
{"title":"基于自编码器和深度嵌入网络的多视图三维目标检索比较研究","authors":"Sakifa Aktar, Md. Al Mamun, Md. Ali Hossain","doi":"10.1109/ICCITECHN.2018.8631920","DOIUrl":null,"url":null,"abstract":"In many computer vision based problems, multiview 3D object retrieval is very useful with many application possibilities. Actually multiview 3D object is represented by a set of different views of 2D images. There are many hand-crafted features extraction techniques. Rather than using them, deep embedding network and autoencoder are used to extract features and calculate Euclidean distance to measure the similarity. This paper emphasizes on the process of retrieving 3D object from multi-view 2D images. Two established deep learning based solutions are used to retrieve images of multi-view 3D object. Finally different evaluation metrics are used to compare image retrieval performance accuracy & compare computation time and space complexity for both autoencoder and deep embedding network techniques. Here, dimension reduction algorithms PCA & t-SNE are also used to interpret the retrieval results. The experimented results show that deep embedding network gained 98% accuracy & autoencoder gained 97% accuracy to retrieve multi-view 2D images using RGB-D dataset.","PeriodicalId":355984,"journal":{"name":"2018 21st International Conference of Computer and Information Technology (ICCIT)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Comparative Study of Multi-View 3D Object Retrieval with Autoencoder & Deep Embedding Network\",\"authors\":\"Sakifa Aktar, Md. Al Mamun, Md. Ali Hossain\",\"doi\":\"10.1109/ICCITECHN.2018.8631920\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In many computer vision based problems, multiview 3D object retrieval is very useful with many application possibilities. Actually multiview 3D object is represented by a set of different views of 2D images. There are many hand-crafted features extraction techniques. Rather than using them, deep embedding network and autoencoder are used to extract features and calculate Euclidean distance to measure the similarity. This paper emphasizes on the process of retrieving 3D object from multi-view 2D images. Two established deep learning based solutions are used to retrieve images of multi-view 3D object. Finally different evaluation metrics are used to compare image retrieval performance accuracy & compare computation time and space complexity for both autoencoder and deep embedding network techniques. Here, dimension reduction algorithms PCA & t-SNE are also used to interpret the retrieval results. The experimented results show that deep embedding network gained 98% accuracy & autoencoder gained 97% accuracy to retrieve multi-view 2D images using RGB-D dataset.\",\"PeriodicalId\":355984,\"journal\":{\"name\":\"2018 21st International Conference of Computer and Information Technology (ICCIT)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 21st International Conference of Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCITECHN.2018.8631920\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 21st International Conference of Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITECHN.2018.8631920","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在许多基于计算机视觉的问题中,多视图三维目标检索是非常有用的,具有许多应用可能性。实际上,多视图3D对象是由一组不同视图的2D图像表示的。有许多手工特征提取技术。而不是使用它们,而是使用深度嵌入网络和自编码器来提取特征并计算欧几里得距离来测量相似度。本文重点研究了从多视图二维图像中检索三维目标的过程。两种基于深度学习的解决方案用于检索多视图三维物体图像。最后,采用不同的评价指标来比较自编码器和深度嵌入网络技术的图像检索性能、精度以及计算时间和空间复杂度。这里,降维算法PCA和t-SNE也被用来解释检索结果。实验结果表明,使用RGB-D数据集检索多视图2D图像,深度嵌入网络和自编码器的准确率分别达到98%和97%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparative Study of Multi-View 3D Object Retrieval with Autoencoder & Deep Embedding Network
In many computer vision based problems, multiview 3D object retrieval is very useful with many application possibilities. Actually multiview 3D object is represented by a set of different views of 2D images. There are many hand-crafted features extraction techniques. Rather than using them, deep embedding network and autoencoder are used to extract features and calculate Euclidean distance to measure the similarity. This paper emphasizes on the process of retrieving 3D object from multi-view 2D images. Two established deep learning based solutions are used to retrieve images of multi-view 3D object. Finally different evaluation metrics are used to compare image retrieval performance accuracy & compare computation time and space complexity for both autoencoder and deep embedding network techniques. Here, dimension reduction algorithms PCA & t-SNE are also used to interpret the retrieval results. The experimented results show that deep embedding network gained 98% accuracy & autoencoder gained 97% accuracy to retrieve multi-view 2D images using RGB-D dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信