OrthoMatch-Net: Unsupervised registration of orthodontic dental point clouds via hierarchical attention feature modeling and bidirectional matching mechanism
IF 3.4 2区 工程技术Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE
{"title":"OrthoMatch-Net: Unsupervised registration of orthodontic dental point clouds via hierarchical attention feature modeling and bidirectional matching mechanism","authors":"Shanshan Huang , Wenkang Chen , Ni Liao , Xuejun Zhang , Ganxin Ouyang","doi":"10.1016/j.displa.2025.103203","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate 3D dental point cloud registration is a crucial task for monitoring therapeutic progress and evaluating treatment outcomes. However, existing methods struggle to align dental structures owing to their intricate, highly similar shapes, as well as noise and pose variations in clinical environments, and are hindered by inadequate feature extraction and insufficient modeling of feature interactions. To tackle these challenges, we propose OrthoMatch-Net, an innovative unsupervised framework for dental point cloud registration, whose core contributions lie in two novel designs: (1) hierarchical attention feature modeling and (2) bidirectional matching mechanism, aimed at achieving robust alignment of pre- and post-treatment dental point clouds. The proposed hierarchical attention feature modeling employs transformation-invariant guided cross-attention to enhance local feature aggregation. It further captures global structural relationships through the window transformer. Moreover, a feedback interaction mechanism is introduced to enable feature fusion across hierarchical levels, thereby improving discriminative representation robustness for dental registration. Simultaneously, the bidirectional matching mechanism reinforces geometric consistency by learning key point correspondences in both directions (source-to-target and target-to-source). It leverages local structural consistency to weight and filter the matched pairs, effectively enhancing the symmetry and stability of the registration process. Extensive experiments on clinical dental datasets demonstrate that OrthoMatch-Net outperforms state-of-the-art methods with sub-millimeter accuracy across multiple metrics. It also exhibits strong robustness under noise perturbations, offering a practical and reliable solution for improving orthodontic treatment precision and supporting clinical decision-making. To facilitate further study, our source code and the pretrained models will be released at <span><span>https://github.com/shanshanhuang2023/OrthoMatch-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103203"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225002409","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate 3D dental point cloud registration is a crucial task for monitoring therapeutic progress and evaluating treatment outcomes. However, existing methods struggle to align dental structures owing to their intricate, highly similar shapes, as well as noise and pose variations in clinical environments, and are hindered by inadequate feature extraction and insufficient modeling of feature interactions. To tackle these challenges, we propose OrthoMatch-Net, an innovative unsupervised framework for dental point cloud registration, whose core contributions lie in two novel designs: (1) hierarchical attention feature modeling and (2) bidirectional matching mechanism, aimed at achieving robust alignment of pre- and post-treatment dental point clouds. The proposed hierarchical attention feature modeling employs transformation-invariant guided cross-attention to enhance local feature aggregation. It further captures global structural relationships through the window transformer. Moreover, a feedback interaction mechanism is introduced to enable feature fusion across hierarchical levels, thereby improving discriminative representation robustness for dental registration. Simultaneously, the bidirectional matching mechanism reinforces geometric consistency by learning key point correspondences in both directions (source-to-target and target-to-source). It leverages local structural consistency to weight and filter the matched pairs, effectively enhancing the symmetry and stability of the registration process. Extensive experiments on clinical dental datasets demonstrate that OrthoMatch-Net outperforms state-of-the-art methods with sub-millimeter accuracy across multiple metrics. It also exhibits strong robustness under noise perturbations, offering a practical and reliable solution for improving orthodontic treatment precision and supporting clinical decision-making. To facilitate further study, our source code and the pretrained models will be released at https://github.com/shanshanhuang2023/OrthoMatch-Net.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.