MTSegNet：用于3D形状分割的歧管变压器

IF 1.3 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Computer Aided Geometric Design Pub Date : 2025-04-22 DOI:10.1016/j.cagd.2025.102440

Zhenyu Shu , Zhichao Zhang , Yiming Zhao , Teng Wu

{"title":"MTSegNet：用于3D形状分割的歧管变压器","authors":"Zhenyu Shu , Zhichao Zhang , Yiming Zhao , Teng Wu","doi":"10.1016/j.cagd.2025.102440","DOIUrl":null,"url":null,"abstract":"<div><div>The semantic segmentation of 3D meshes is a critical component of 3D shape analysis, which involves assigning semantic labels to each face of a 3D mesh. Despite its significance, current methods often struggle to capture manifold information in 3D meshes, a fundamental characteristic distinguishing them from other representation forms of 3D data, like 3D point clouds or 3D voxels, resulting in suboptimal segmentation outcomes. In this paper, we propose a novel Transformer-based approach, Manifold Transformer (MTSegNet), for 3D mesh semantic segmentation, which effectively learns manifold information. By using hierarchical Transformers, MTSegNet can capture both local and global features of 3D meshes, while reducing the computational complexity and memory consumption. To further improve the performance of our method, we design an effective input-generating algorithm that serializes input data into multiple sequences of tokens that represent the geometry and topology of 3D meshes. This algorithm preserves the structural information and spatial relations of 3D meshes, while enabling the use of standard Transformer architectures. The proposed method is evaluated on four benchmark datasets: PSB, COSEG, ShapeNetCore, and HumanBody, and it achieves state-of-the-art results on all datasets, outperforming the previous methods.</div></div>","PeriodicalId":55226,"journal":{"name":"Computer Aided Geometric Design","volume":"119 ","pages":"Article 102440"},"PeriodicalIF":1.3000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MTSegNet: Manifold Transformer for 3D shape segmentation\",\"authors\":\"Zhenyu Shu , Zhichao Zhang , Yiming Zhao , Teng Wu\",\"doi\":\"10.1016/j.cagd.2025.102440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The semantic segmentation of 3D meshes is a critical component of 3D shape analysis, which involves assigning semantic labels to each face of a 3D mesh. Despite its significance, current methods often struggle to capture manifold information in 3D meshes, a fundamental characteristic distinguishing them from other representation forms of 3D data, like 3D point clouds or 3D voxels, resulting in suboptimal segmentation outcomes. In this paper, we propose a novel Transformer-based approach, Manifold Transformer (MTSegNet), for 3D mesh semantic segmentation, which effectively learns manifold information. By using hierarchical Transformers, MTSegNet can capture both local and global features of 3D meshes, while reducing the computational complexity and memory consumption. To further improve the performance of our method, we design an effective input-generating algorithm that serializes input data into multiple sequences of tokens that represent the geometry and topology of 3D meshes. This algorithm preserves the structural information and spatial relations of 3D meshes, while enabling the use of standard Transformer architectures. The proposed method is evaluated on four benchmark datasets: PSB, COSEG, ShapeNetCore, and HumanBody, and it achieves state-of-the-art results on all datasets, outperforming the previous methods.</div></div>\",\"PeriodicalId\":55226,\"journal\":{\"name\":\"Computer Aided Geometric Design\",\"volume\":\"119 \",\"pages\":\"Article 102440\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Aided Geometric Design\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167839625000299\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Aided Geometric Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167839625000299","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

摘要

三维网格的语义分割是三维形状分析的一个重要组成部分，它涉及到为三维网格的每个面分配语义标签。尽管具有重要意义，但目前的方法往往难以捕获3D网格中的流形信息，这是将它们与3D数据的其他表示形式（如3D点云或3D体素）区分开来的基本特征，导致分割结果不理想。在本文中，我们提出了一种新的基于变压器的三维网格语义分割方法，流形变压器（MTSegNet），它可以有效地学习流形信息。通过使用分层变压器，MTSegNet可以同时捕获3D网格的局部和全局特征，同时降低计算复杂度和内存消耗。为了进一步提高我们的方法的性能，我们设计了一个有效的输入生成算法，该算法将输入数据序列化为多个表示3D网格几何和拓扑的令牌序列。该算法保留了三维网格的结构信息和空间关系，同时能够使用标准的Transformer架构。在PSB、COSEG、ShapeNetCore和HumanBody四个基准数据集上对该方法进行了评估，结果表明该方法在所有数据集上都取得了最先进的结果，优于之前的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MTSegNet: Manifold Transformer for 3D shape segmentation

The semantic segmentation of 3D meshes is a critical component of 3D shape analysis, which involves assigning semantic labels to each face of a 3D mesh. Despite its significance, current methods often struggle to capture manifold information in 3D meshes, a fundamental characteristic distinguishing them from other representation forms of 3D data, like 3D point clouds or 3D voxels, resulting in suboptimal segmentation outcomes. In this paper, we propose a novel Transformer-based approach, Manifold Transformer (MTSegNet), for 3D mesh semantic segmentation, which effectively learns manifold information. By using hierarchical Transformers, MTSegNet can capture both local and global features of 3D meshes, while reducing the computational complexity and memory consumption. To further improve the performance of our method, we design an effective input-generating algorithm that serializes input data into multiple sequences of tokens that represent the geometry and topology of 3D meshes. This algorithm preserves the structural information and spatial relations of 3D meshes, while enabling the use of standard Transformer architectures. The proposed method is evaluated on four benchmark datasets: PSB, COSEG, ShapeNetCore, and HumanBody, and it achieves state-of-the-art results on all datasets, outperforming the previous methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Aided Geometric Design 工程技术-计算机：软件工程

CiteScore

3.50

自引率

13.30%

发文量

审稿时长

60 days

期刊介绍： The journal Computer Aided Geometric Design is for researchers, scholars, and software developers dealing with mathematical and computational methods for the description of geometric objects as they arise in areas ranging from CAD/CAM to robotics and scientific visualization. The journal publishes original research papers, survey papers and with quick editorial decisions short communications of at most 3 pages. The primary objects of interest are curves, surfaces, and volumes such as splines (NURBS), meshes, subdivision surfaces as well as algorithms to generate, analyze, and manipulate them. This journal will report on new developments in CAGD and its applications, including but not restricted to the following: -Mathematical and Geometric Foundations- Curve, Surface, and Volume generation- CAGD applications in Numerical Analysis, Computational Geometry, Computer Graphics, or Computer Vision- Industrial, medical, and scientific applications. The aim is to collect and disseminate information on computer aided design in one journal. To provide the user community with methods and algorithms for representing curves and surfaces. To illustrate computer aided geometric design by means of interesting applications. To combine curve and surface methods with computer graphics. To explain scientific phenomena by means of computer graphics. To concentrate on the interaction between theory and application. To expose unsolved problems of the practice. To develop new methods in computer aided geometry.