A task-driven network for mesh classification and semantic part segmentation

IF 1.7 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Computer Aided Geometric Design Pub Date : 2024-04-24 DOI:10.1016/j.cagd.2024.102304

Qiujie Dong , Xiaoran Gong , Rui Xu , Zixiong Wang , Junjie Gao , Shuangmin Chen , Shiqing Xin , Changhe Tu , Wenping Wang

{"title":"A task-driven network for mesh classification and semantic part segmentation","authors":"Qiujie Dong , Xiaoran Gong , Rui Xu , Zixiong Wang , Junjie Gao , Shuangmin Chen , Shiqing Xin , Changhe Tu , Wenping Wang","doi":"10.1016/j.cagd.2024.102304","DOIUrl":null,"url":null,"abstract":"<div><p>Given the rapid advancements in geometric deep-learning techniques, there has been a dedicated effort to create mesh-based convolutional operators that act as a link between irregular mesh structures and widely adopted backbone networks. Despite the numerous advantages of Convolutional Neural Networks (CNNs) over Multi-Layer Perceptrons (MLPs), mesh-oriented CNNs often require intricate network architectures to tackle irregularities of a triangular mesh. These architectures not only demand that the mesh be manifold and watertight but also impose constraints on the abundance of training samples. In this paper, we note that for specific tasks such as mesh classification and semantic part segmentation, large-scale shape features play a pivotal role. This is in contrast to the realm of shape correspondence, where a comprehensive understanding of 3D shapes necessitates considering both local and global characteristics. Inspired by this key observation, we introduce a task-driven neural network architecture that seamlessly operates in an end-to-end fashion. Our method takes as input mesh vertices equipped with the heat kernel signature (HKS) and dihedral angles between adjacent faces. Notably, we replace the conventional convolutional module, commonly found in ResNet architectures, with MLPs and incorporate Layer Normalization (LN) to facilitate layer-wise normalization. Our approach, with a seemingly straightforward network architecture, demonstrates an accuracy advantage. It exhibits a marginal 0.1% improvement in the mesh classification task and a substantial 1.8% enhancement in the mesh part segmentation task compared to state-of-the-art methodologies. Moreover, as the number of training samples decreases to 1/50 or even 1/100, the accuracy advantage of our approach becomes more pronounced. In summary, our convolution-free network is tailored for specific tasks relying on large-scale shape features and excels in the situation with a limited number of training samples, setting itself apart from state-of-the-art methodologies.</p></div>","PeriodicalId":55226,"journal":{"name":"Computer Aided Geometric Design","volume":"111 ","pages":"Article 102304"},"PeriodicalIF":1.7000,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Aided Geometric Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167839624000384","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Given the rapid advancements in geometric deep-learning techniques, there has been a dedicated effort to create mesh-based convolutional operators that act as a link between irregular mesh structures and widely adopted backbone networks. Despite the numerous advantages of Convolutional Neural Networks (CNNs) over Multi-Layer Perceptrons (MLPs), mesh-oriented CNNs often require intricate network architectures to tackle irregularities of a triangular mesh. These architectures not only demand that the mesh be manifold and watertight but also impose constraints on the abundance of training samples. In this paper, we note that for specific tasks such as mesh classification and semantic part segmentation, large-scale shape features play a pivotal role. This is in contrast to the realm of shape correspondence, where a comprehensive understanding of 3D shapes necessitates considering both local and global characteristics. Inspired by this key observation, we introduce a task-driven neural network architecture that seamlessly operates in an end-to-end fashion. Our method takes as input mesh vertices equipped with the heat kernel signature (HKS) and dihedral angles between adjacent faces. Notably, we replace the conventional convolutional module, commonly found in ResNet architectures, with MLPs and incorporate Layer Normalization (LN) to facilitate layer-wise normalization. Our approach, with a seemingly straightforward network architecture, demonstrates an accuracy advantage. It exhibits a marginal 0.1% improvement in the mesh classification task and a substantial 1.8% enhancement in the mesh part segmentation task compared to state-of-the-art methodologies. Moreover, as the number of training samples decreases to 1/50 or even 1/100, the accuracy advantage of our approach becomes more pronounced. In summary, our convolution-free network is tailored for specific tasks relying on large-scale shape features and excels in the situation with a limited number of training samples, setting itself apart from state-of-the-art methodologies.

查看原文本刊更多论文

用于网格分类和语义部分分割的任务驱动网络

鉴于几何深度学习技术的快速发展，人们一直致力于创建基于网格的卷积算子，作为不规则网格结构与广泛采用的骨干网络之间的纽带。尽管与多层感知器（MLP）相比，卷积神经网络（CNN）具有诸多优势，但面向网格的 CNN 通常需要复杂的网络架构来处理三角形网格的不规则性。这些架构不仅要求网格具有多面性和无懈可击性，还对训练样本的丰富程度施加了限制。在本文中，我们注意到在网格分类和语义部分分割等特定任务中，大规模形状特征起着举足轻重的作用。这与形状对应领域形成了鲜明对比，在形状对应领域，要全面了解三维形状，必须同时考虑局部和全局特征。受这一重要观点的启发，我们引入了一种任务驱动型神经网络架构，该架构以端到端的方式无缝运行。我们的方法将配备热核特征（HKS）的网格顶点和相邻面之间的二面角作为输入。值得注意的是，我们用 MLP 取代了 ResNet 架构中常见的传统卷积模块，并加入了层归一化（LN）以促进层归一化。我们的方法采用看似简单的网络架构，但在准确性方面却具有优势。与最先进的方法相比，它在网格分类任务中提高了 0.1%，在网格部分分割任务中提高了 1.8%。此外，当训练样本数减少到 1/50 甚至 1/100 时，我们的方法在准确性上的优势会更加明显。总之，我们的无卷积网络专为依赖大规模形状特征的特定任务定制，并在训练样本数量有限的情况下表现出色，从而在最先进的方法中脱颖而出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Aided Geometric Design 工程技术-计算机：软件工程

CiteScore

3.50

自引率

13.30%

发文量

审稿时长

60 days

期刊介绍： The journal Computer Aided Geometric Design is for researchers, scholars, and software developers dealing with mathematical and computational methods for the description of geometric objects as they arise in areas ranging from CAD/CAM to robotics and scientific visualization. The journal publishes original research papers, survey papers and with quick editorial decisions short communications of at most 3 pages. The primary objects of interest are curves, surfaces, and volumes such as splines (NURBS), meshes, subdivision surfaces as well as algorithms to generate, analyze, and manipulate them. This journal will report on new developments in CAGD and its applications, including but not restricted to the following: -Mathematical and Geometric Foundations- Curve, Surface, and Volume generation- CAGD applications in Numerical Analysis, Computational Geometry, Computer Graphics, or Computer Vision- Industrial, medical, and scientific applications. The aim is to collect and disseminate information on computer aided design in one journal. To provide the user community with methods and algorithms for representing curves and surfaces. To illustrate computer aided geometric design by means of interesting applications. To combine curve and surface methods with computer graphics. To explain scientific phenomena by means of computer graphics. To concentrate on the interaction between theory and application. To expose unsolved problems of the practice. To develop new methods in computer aided geometry.