基于区域的三维形状识别联合注意网络

IF 8.4 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yue Zhao, Weizhi Nie, Jie Nie, Yuyi Zhang, Bo Wang
{"title":"基于区域的三维形状识别联合注意网络","authors":"Yue Zhao,&nbsp;Weizhi Nie,&nbsp;Jie Nie,&nbsp;Yuyi Zhang,&nbsp;Bo Wang","doi":"10.1049/cit2.12388","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <p>As an essential field of multimedia and computer vision, 3D shape recognition has attracted much research attention in recent years. Multiview-based approaches have demonstrated their superiority in generating effective 3D shape representations. Typical methods usually extract the multiview global features and aggregate them together to generate 3D shape descriptors. However, there exist two disadvantages: First, the mainstream methods ignore the comprehensive exploration of local information in each view. Second, many approaches roughly aggregate multiview features by adding or concatenating them together. The information loss for some discriminative characteristics limits the representation effectiveness. To address these problems, a novel architecture named region-based joint attention network (RJAN) was proposed. Specifically, the authors first design a hierarchical local information exploration module for view descriptor extraction. The region-to-region and channel-to-channel relationships from different granularities can be comprehensively explored and utilised to provide more discriminative characteristics for view feature learning. Subsequently, a novel relation-aware view aggregation module is designed to aggregate the multiview features for shape descriptor generation, considering the view-to-view relationships. Extensive experiments were conducted on three public databases: ModelNet40, ModelNet10, and ShapeNetCore55. RJAN achieves state-of-the-art performance in the tasks of 3D shape classification and 3D shape retrieval, which demonstrates the effectiveness of RJAN. The code has been released on https://github.com/slurrpp/RJAN.</p>\n </section>\n </div>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"460-473"},"PeriodicalIF":8.4000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12388","citationCount":"0","resultStr":"{\"title\":\"RJAN: Region-based joint attention network for 3D shape recognition\",\"authors\":\"Yue Zhao,&nbsp;Weizhi Nie,&nbsp;Jie Nie,&nbsp;Yuyi Zhang,&nbsp;Bo Wang\",\"doi\":\"10.1049/cit2.12388\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <p>As an essential field of multimedia and computer vision, 3D shape recognition has attracted much research attention in recent years. Multiview-based approaches have demonstrated their superiority in generating effective 3D shape representations. Typical methods usually extract the multiview global features and aggregate them together to generate 3D shape descriptors. However, there exist two disadvantages: First, the mainstream methods ignore the comprehensive exploration of local information in each view. Second, many approaches roughly aggregate multiview features by adding or concatenating them together. The information loss for some discriminative characteristics limits the representation effectiveness. To address these problems, a novel architecture named region-based joint attention network (RJAN) was proposed. Specifically, the authors first design a hierarchical local information exploration module for view descriptor extraction. The region-to-region and channel-to-channel relationships from different granularities can be comprehensively explored and utilised to provide more discriminative characteristics for view feature learning. Subsequently, a novel relation-aware view aggregation module is designed to aggregate the multiview features for shape descriptor generation, considering the view-to-view relationships. Extensive experiments were conducted on three public databases: ModelNet40, ModelNet10, and ShapeNetCore55. RJAN achieves state-of-the-art performance in the tasks of 3D shape classification and 3D shape retrieval, which demonstrates the effectiveness of RJAN. The code has been released on https://github.com/slurrpp/RJAN.</p>\\n </section>\\n </div>\",\"PeriodicalId\":46211,\"journal\":{\"name\":\"CAAI Transactions on Intelligence Technology\",\"volume\":\"10 2\",\"pages\":\"460-473\"},\"PeriodicalIF\":8.4000,\"publicationDate\":\"2024-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12388\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CAAI Transactions on Intelligence Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/cit2.12388\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/cit2.12388","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

三维形状识别作为多媒体和计算机视觉的一个重要研究领域,近年来受到了广泛的关注。基于多视图的方法在生成有效的三维形状表示方面具有优势。典型的方法通常是提取多视图全局特征并将其聚合在一起生成三维形状描述符。然而,主流方法存在两个缺点:一是忽视了对各个视角的局部信息的全面挖掘。其次,许多方法通过将多视图特征添加或连接在一起来粗略地聚合多视图特征。一些判别特征的信息丢失限制了表征的有效性。为了解决这些问题,提出了一种基于区域的联合关注网络(RJAN)。具体来说,作者首先设计了一个分层的局部信息挖掘模块,用于视图描述符的提取。可以综合探索和利用不同粒度的区域与区域、通道与通道之间的关系,为视图特征学习提供更多的判别特征。随后,考虑到视图间的关系,设计了一种新的关系感知视图聚合模块,对多视图特征进行聚合,生成形状描述符。在ModelNet40、ModelNet10和ShapeNetCore55三个公共数据库上进行了广泛的实验。RJAN在三维形状分类和三维形状检索任务中达到了最先进的性能,证明了RJAN的有效性。该代码已在https://github.com/slurrpp/RJAN上发布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

RJAN: Region-based joint attention network for 3D shape recognition

RJAN: Region-based joint attention network for 3D shape recognition

As an essential field of multimedia and computer vision, 3D shape recognition has attracted much research attention in recent years. Multiview-based approaches have demonstrated their superiority in generating effective 3D shape representations. Typical methods usually extract the multiview global features and aggregate them together to generate 3D shape descriptors. However, there exist two disadvantages: First, the mainstream methods ignore the comprehensive exploration of local information in each view. Second, many approaches roughly aggregate multiview features by adding or concatenating them together. The information loss for some discriminative characteristics limits the representation effectiveness. To address these problems, a novel architecture named region-based joint attention network (RJAN) was proposed. Specifically, the authors first design a hierarchical local information exploration module for view descriptor extraction. The region-to-region and channel-to-channel relationships from different granularities can be comprehensively explored and utilised to provide more discriminative characteristics for view feature learning. Subsequently, a novel relation-aware view aggregation module is designed to aggregate the multiview features for shape descriptor generation, considering the view-to-view relationships. Extensive experiments were conducted on three public databases: ModelNet40, ModelNet10, and ShapeNetCore55. RJAN achieves state-of-the-art performance in the tasks of 3D shape classification and 3D shape retrieval, which demonstrates the effectiveness of RJAN. The code has been released on https://github.com/slurrpp/RJAN.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CAAI Transactions on Intelligence Technology
CAAI Transactions on Intelligence Technology COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-
CiteScore
11.00
自引率
3.90%
发文量
134
审稿时长
35 weeks
期刊介绍: CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信