基于区域的三维形状识别联合注意网络

IF 7.3 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

CAAI Transactions on Intelligence Technology Pub Date : 2024-10-24 DOI:10.1049/cit2.12388

Yue Zhao, Weizhi Nie, Jie Nie, Yuyi Zhang, Bo Wang

{"title":"基于区域的三维形状识别联合注意网络","authors":"Yue Zhao, Weizhi Nie, Jie Nie, Yuyi Zhang, Bo Wang","doi":"10.1049/cit2.12388","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <p>As an essential field of multimedia and computer vision, 3D shape recognition has attracted much research attention in recent years. Multiview-based approaches have demonstrated their superiority in generating effective 3D shape representations. Typical methods usually extract the multiview global features and aggregate them together to generate 3D shape descriptors. However, there exist two disadvantages: First, the mainstream methods ignore the comprehensive exploration of local information in each view. Second, many approaches roughly aggregate multiview features by adding or concatenating them together. The information loss for some discriminative characteristics limits the representation effectiveness. To address these problems, a novel architecture named region-based joint attention network (RJAN) was proposed. Specifically, the authors first design a hierarchical local information exploration module for view descriptor extraction. The region-to-region and channel-to-channel relationships from different granularities can be comprehensively explored and utilised to provide more discriminative characteristics for view feature learning. Subsequently, a novel relation-aware view aggregation module is designed to aggregate the multiview features for shape descriptor generation, considering the view-to-view relationships. Extensive experiments were conducted on three public databases: ModelNet40, ModelNet10, and ShapeNetCore55. RJAN achieves state-of-the-art performance in the tasks of 3D shape classification and 3D shape retrieval, which demonstrates the effectiveness of RJAN. The code has been released on https://github.com/slurrpp/RJAN.</p>\n </section>\n </div>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 2","pages":"460-473"},"PeriodicalIF":7.3000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12388","citationCount":"0","resultStr":"{\"title\":\"RJAN: Region-based joint attention network for 3D shape recognition\",\"authors\":\"Yue Zhao, Weizhi Nie, Jie Nie, Yuyi Zhang, Bo Wang\",\"doi\":\"10.1049/cit2.12388\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <p>As an essential field of multimedia and computer vision, 3D shape recognition has attracted much research attention in recent years. Multiview-based approaches have demonstrated their superiority in generating effective 3D shape representations. Typical methods usually extract the multiview global features and aggregate them together to generate 3D shape descriptors. However, there exist two disadvantages: First, the mainstream methods ignore the comprehensive exploration of local information in each view. Second, many approaches roughly aggregate multiview features by adding or concatenating them together. The information loss for some discriminative characteristics limits the representation effectiveness. To address these problems, a novel architecture named region-based joint attention network (RJAN) was proposed. Specifically, the authors first design a hierarchical local information exploration module for view descriptor extraction. The region-to-region and channel-to-channel relationships from different granularities can be comprehensively explored and utilised to provide more discriminative characteristics for view feature learning. Subsequently, a novel relation-aware view aggregation module is designed to aggregate the multiview features for shape descriptor generation, considering the view-to-view relationships. Extensive experiments were conducted on three public databases: ModelNet40, ModelNet10, and ShapeNetCore55. RJAN achieves state-of-the-art performance in the tasks of 3D shape classification and 3D shape retrieval, which demonstrates the effectiveness of RJAN. The code has been released on https://github.com/slurrpp/RJAN.</p>\\n </section>\\n </div>\",\"PeriodicalId\":46211,\"journal\":{\"name\":\"CAAI Transactions on Intelligence Technology\",\"volume\":\"10 2\",\"pages\":\"460-473\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2024-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12388\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CAAI Transactions on Intelligence Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cit2.12388\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cit2.12388","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

三维形状识别作为多媒体和计算机视觉的一个重要研究领域，近年来受到了广泛的关注。基于多视图的方法在生成有效的三维形状表示方面具有优势。典型的方法通常是提取多视图全局特征并将其聚合在一起生成三维形状描述符。然而，主流方法存在两个缺点：一是忽视了对各个视角的局部信息的全面挖掘。其次，许多方法通过将多视图特征添加或连接在一起来粗略地聚合多视图特征。一些判别特征的信息丢失限制了表征的有效性。为了解决这些问题，提出了一种基于区域的联合关注网络（RJAN）。具体来说，作者首先设计了一个分层的局部信息挖掘模块，用于视图描述符的提取。可以综合探索和利用不同粒度的区域与区域、通道与通道之间的关系，为视图特征学习提供更多的判别特征。随后，考虑到视图间的关系，设计了一种新的关系感知视图聚合模块，对多视图特征进行聚合，生成形状描述符。在ModelNet40、ModelNet10和ShapeNetCore55三个公共数据库上进行了广泛的实验。RJAN在三维形状分类和三维形状检索任务中达到了最先进的性能，证明了RJAN的有效性。该代码已在https://github.com/slurrpp/RJAN上发布。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

RJAN: Region-based joint attention network for 3D shape recognition

查看原文本刊更多论文

RJAN: Region-based joint attention network for 3D shape recognition

As an essential field of multimedia and computer vision, 3D shape recognition has attracted much research attention in recent years. Multiview-based approaches have demonstrated their superiority in generating effective 3D shape representations. Typical methods usually extract the multiview global features and aggregate them together to generate 3D shape descriptors. However, there exist two disadvantages: First, the mainstream methods ignore the comprehensive exploration of local information in each view. Second, many approaches roughly aggregate multiview features by adding or concatenating them together. The information loss for some discriminative characteristics limits the representation effectiveness. To address these problems, a novel architecture named region-based joint attention network (RJAN) was proposed. Specifically, the authors first design a hierarchical local information exploration module for view descriptor extraction. The region-to-region and channel-to-channel relationships from different granularities can be comprehensively explored and utilised to provide more discriminative characteristics for view feature learning. Subsequently, a novel relation-aware view aggregation module is designed to aggregate the multiview features for shape descriptor generation, considering the view-to-view relationships. Extensive experiments were conducted on three public databases: ModelNet40, ModelNet10, and ShapeNetCore55. RJAN achieves state-of-the-art performance in the tasks of 3D shape classification and 3D shape retrieval, which demonstrates the effectiveness of RJAN. The code has been released on https://github.com/slurrpp/RJAN.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

CAAI Transactions on Intelligence Technology COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

11.00

自引率

3.90%

发文量

134

审稿时长

35 weeks

期刊介绍： CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.