Zhenyu Shu , Zhichao Zhang , Yiming Zhao , Teng Wu
{"title":"MTSegNet:用于3D形状分割的歧管变压器","authors":"Zhenyu Shu , Zhichao Zhang , Yiming Zhao , Teng Wu","doi":"10.1016/j.cagd.2025.102440","DOIUrl":null,"url":null,"abstract":"<div><div>The semantic segmentation of 3D meshes is a critical component of 3D shape analysis, which involves assigning semantic labels to each face of a 3D mesh. Despite its significance, current methods often struggle to capture manifold information in 3D meshes, a fundamental characteristic distinguishing them from other representation forms of 3D data, like 3D point clouds or 3D voxels, resulting in suboptimal segmentation outcomes. In this paper, we propose a novel Transformer-based approach, Manifold Transformer (MTSegNet), for 3D mesh semantic segmentation, which effectively learns manifold information. By using hierarchical Transformers, MTSegNet can capture both local and global features of 3D meshes, while reducing the computational complexity and memory consumption. To further improve the performance of our method, we design an effective input-generating algorithm that serializes input data into multiple sequences of tokens that represent the geometry and topology of 3D meshes. This algorithm preserves the structural information and spatial relations of 3D meshes, while enabling the use of standard Transformer architectures. The proposed method is evaluated on four benchmark datasets: PSB, COSEG, ShapeNetCore, and HumanBody, and it achieves state-of-the-art results on all datasets, outperforming the previous methods.</div></div>","PeriodicalId":55226,"journal":{"name":"Computer Aided Geometric Design","volume":"119 ","pages":"Article 102440"},"PeriodicalIF":1.3000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MTSegNet: Manifold Transformer for 3D shape segmentation\",\"authors\":\"Zhenyu Shu , Zhichao Zhang , Yiming Zhao , Teng Wu\",\"doi\":\"10.1016/j.cagd.2025.102440\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The semantic segmentation of 3D meshes is a critical component of 3D shape analysis, which involves assigning semantic labels to each face of a 3D mesh. Despite its significance, current methods often struggle to capture manifold information in 3D meshes, a fundamental characteristic distinguishing them from other representation forms of 3D data, like 3D point clouds or 3D voxels, resulting in suboptimal segmentation outcomes. In this paper, we propose a novel Transformer-based approach, Manifold Transformer (MTSegNet), for 3D mesh semantic segmentation, which effectively learns manifold information. By using hierarchical Transformers, MTSegNet can capture both local and global features of 3D meshes, while reducing the computational complexity and memory consumption. To further improve the performance of our method, we design an effective input-generating algorithm that serializes input data into multiple sequences of tokens that represent the geometry and topology of 3D meshes. This algorithm preserves the structural information and spatial relations of 3D meshes, while enabling the use of standard Transformer architectures. The proposed method is evaluated on four benchmark datasets: PSB, COSEG, ShapeNetCore, and HumanBody, and it achieves state-of-the-art results on all datasets, outperforming the previous methods.</div></div>\",\"PeriodicalId\":55226,\"journal\":{\"name\":\"Computer Aided Geometric Design\",\"volume\":\"119 \",\"pages\":\"Article 102440\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Aided Geometric Design\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167839625000299\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Aided Geometric Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167839625000299","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
MTSegNet: Manifold Transformer for 3D shape segmentation
The semantic segmentation of 3D meshes is a critical component of 3D shape analysis, which involves assigning semantic labels to each face of a 3D mesh. Despite its significance, current methods often struggle to capture manifold information in 3D meshes, a fundamental characteristic distinguishing them from other representation forms of 3D data, like 3D point clouds or 3D voxels, resulting in suboptimal segmentation outcomes. In this paper, we propose a novel Transformer-based approach, Manifold Transformer (MTSegNet), for 3D mesh semantic segmentation, which effectively learns manifold information. By using hierarchical Transformers, MTSegNet can capture both local and global features of 3D meshes, while reducing the computational complexity and memory consumption. To further improve the performance of our method, we design an effective input-generating algorithm that serializes input data into multiple sequences of tokens that represent the geometry and topology of 3D meshes. This algorithm preserves the structural information and spatial relations of 3D meshes, while enabling the use of standard Transformer architectures. The proposed method is evaluated on four benchmark datasets: PSB, COSEG, ShapeNetCore, and HumanBody, and it achieves state-of-the-art results on all datasets, outperforming the previous methods.
期刊介绍:
The journal Computer Aided Geometric Design is for researchers, scholars, and software developers dealing with mathematical and computational methods for the description of geometric objects as they arise in areas ranging from CAD/CAM to robotics and scientific visualization. The journal publishes original research papers, survey papers and with quick editorial decisions short communications of at most 3 pages. The primary objects of interest are curves, surfaces, and volumes such as splines (NURBS), meshes, subdivision surfaces as well as algorithms to generate, analyze, and manipulate them. This journal will report on new developments in CAGD and its applications, including but not restricted to the following:
-Mathematical and Geometric Foundations-
Curve, Surface, and Volume generation-
CAGD applications in Numerical Analysis, Computational Geometry, Computer Graphics, or Computer Vision-
Industrial, medical, and scientific applications.
The aim is to collect and disseminate information on computer aided design in one journal. To provide the user community with methods and algorithms for representing curves and surfaces. To illustrate computer aided geometric design by means of interesting applications. To combine curve and surface methods with computer graphics. To explain scientific phenomena by means of computer graphics. To concentrate on the interaction between theory and application. To expose unsolved problems of the practice. To develop new methods in computer aided geometry.