{"title":"基于多模态对比学习的三维形状分析","authors":"Zhenyu Shu , Xufei Sun , Chaoyi Pang","doi":"10.1016/j.cagd.2025.102454","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, 3D shape analysis has emerged as a crucial field with applications in various domains, such as multimedia processing, computer graphics, computer vision, and robotics. The ability to understand and interpret 3D shapes is fundamental for tasks like 3D shape segmentation, points of interest detection, shape retrieval, recognition, and generation. However, the complexity of 3D mesh models is a significant barrier that stops the topic from enhancing. Thus, we propose a novel 3D shape analysis framework in this paper by multi-modal contrastive learning techniques. Our framework makes use of the original mesh data and the projected images from various points of view of the mesh model. Those two modals contribute to providing more precise features with the help of our within-modal and cross-modal losses, which respectively calculate the distances of feature vectors within the mesh model and between feature vectors of mesh and image. Our framework is tested on downstream tasks, including 3D shape segmentation and points of interest detection, and outperforms most state-of-the-art methods on public datasets.</div></div>","PeriodicalId":55226,"journal":{"name":"Computer Aided Geometric Design","volume":"119 ","pages":"Article 102454"},"PeriodicalIF":1.3000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"3D shape analysis via multi-modal contrastive learning\",\"authors\":\"Zhenyu Shu , Xufei Sun , Chaoyi Pang\",\"doi\":\"10.1016/j.cagd.2025.102454\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In recent years, 3D shape analysis has emerged as a crucial field with applications in various domains, such as multimedia processing, computer graphics, computer vision, and robotics. The ability to understand and interpret 3D shapes is fundamental for tasks like 3D shape segmentation, points of interest detection, shape retrieval, recognition, and generation. However, the complexity of 3D mesh models is a significant barrier that stops the topic from enhancing. Thus, we propose a novel 3D shape analysis framework in this paper by multi-modal contrastive learning techniques. Our framework makes use of the original mesh data and the projected images from various points of view of the mesh model. Those two modals contribute to providing more precise features with the help of our within-modal and cross-modal losses, which respectively calculate the distances of feature vectors within the mesh model and between feature vectors of mesh and image. Our framework is tested on downstream tasks, including 3D shape segmentation and points of interest detection, and outperforms most state-of-the-art methods on public datasets.</div></div>\",\"PeriodicalId\":55226,\"journal\":{\"name\":\"Computer Aided Geometric Design\",\"volume\":\"119 \",\"pages\":\"Article 102454\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Aided Geometric Design\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167839625000433\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Aided Geometric Design","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167839625000433","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
3D shape analysis via multi-modal contrastive learning
In recent years, 3D shape analysis has emerged as a crucial field with applications in various domains, such as multimedia processing, computer graphics, computer vision, and robotics. The ability to understand and interpret 3D shapes is fundamental for tasks like 3D shape segmentation, points of interest detection, shape retrieval, recognition, and generation. However, the complexity of 3D mesh models is a significant barrier that stops the topic from enhancing. Thus, we propose a novel 3D shape analysis framework in this paper by multi-modal contrastive learning techniques. Our framework makes use of the original mesh data and the projected images from various points of view of the mesh model. Those two modals contribute to providing more precise features with the help of our within-modal and cross-modal losses, which respectively calculate the distances of feature vectors within the mesh model and between feature vectors of mesh and image. Our framework is tested on downstream tasks, including 3D shape segmentation and points of interest detection, and outperforms most state-of-the-art methods on public datasets.
期刊介绍:
The journal Computer Aided Geometric Design is for researchers, scholars, and software developers dealing with mathematical and computational methods for the description of geometric objects as they arise in areas ranging from CAD/CAM to robotics and scientific visualization. The journal publishes original research papers, survey papers and with quick editorial decisions short communications of at most 3 pages. The primary objects of interest are curves, surfaces, and volumes such as splines (NURBS), meshes, subdivision surfaces as well as algorithms to generate, analyze, and manipulate them. This journal will report on new developments in CAGD and its applications, including but not restricted to the following:
-Mathematical and Geometric Foundations-
Curve, Surface, and Volume generation-
CAGD applications in Numerical Analysis, Computational Geometry, Computer Graphics, or Computer Vision-
Industrial, medical, and scientific applications.
The aim is to collect and disseminate information on computer aided design in one journal. To provide the user community with methods and algorithms for representing curves and surfaces. To illustrate computer aided geometric design by means of interesting applications. To combine curve and surface methods with computer graphics. To explain scientific phenomena by means of computer graphics. To concentrate on the interaction between theory and application. To expose unsolved problems of the practice. To develop new methods in computer aided geometry.