TCFNet: Bidirectional face-bone transformation via a Transformer-based coarse-to-fine point movement network

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-06-16 DOI:10.1016/j.media.2025.103653

Runshi Zhang , Bimeng Jie , Yang He , Junchen Wang

{"title":"TCFNet: Bidirectional face-bone transformation via a Transformer-based coarse-to-fine point movement network","authors":"Runshi Zhang , Bimeng Jie , Yang He , Junchen Wang","doi":"10.1016/j.media.2025.103653","DOIUrl":null,"url":null,"abstract":"<div><div>Computer-aided surgical simulation is a critical component of orthognathic surgical planning, where accurately simulating face-bone shape transformations is significant. The traditional biomechanical simulation methods are limited by their computational time consumption levels, labor-intensive data processing strategies and low accuracy. Recently, deep learning-based simulation methods have been proposed to view this problem as a point-to-point transformation between skeletal and facial point clouds. However, these approaches cannot process large-scale points, have limited receptive fields that lead to noisy points, and employ complex preprocessing and postprocessing operations based on registration. These shortcomings limit the performance and widespread applicability of such methods. Therefore, we propose a Transformer-based coarse-to-fine point movement network (TCFNet) to learn unique, complicated correspondences at the patch and point levels for dense face-bone point cloud transformations. This end-to-end framework adopts a Transformer-based network and a local information aggregation network (LIA-Net) in the first and second stages, respectively, which reinforce each other to generate precise point movement paths. LIA-Net can effectively compensate for the neighborhood precision loss of the Transformer-based network by modeling local geometric structures (edges, orientations and relative position features). The previous global features are employed to guide the local displacement using a gated recurrent unit. Inspired by deformable medical image registration, we propose an auxiliary loss that can utilize expert knowledge for reconstructing critical organs. Our framework is an unsupervised algorithm, and this loss is optional. Compared with the existing state-of-the-art (SOTA) methods on gathered datasets, TCFNet achieves outstanding evaluation metrics and visualization results. The code is available at <span><span>https://github.com/Runshi-Zhang/TCFNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103653"},"PeriodicalIF":11.8000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002002","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Computer-aided surgical simulation is a critical component of orthognathic surgical planning, where accurately simulating face-bone shape transformations is significant. The traditional biomechanical simulation methods are limited by their computational time consumption levels, labor-intensive data processing strategies and low accuracy. Recently, deep learning-based simulation methods have been proposed to view this problem as a point-to-point transformation between skeletal and facial point clouds. However, these approaches cannot process large-scale points, have limited receptive fields that lead to noisy points, and employ complex preprocessing and postprocessing operations based on registration. These shortcomings limit the performance and widespread applicability of such methods. Therefore, we propose a Transformer-based coarse-to-fine point movement network (TCFNet) to learn unique, complicated correspondences at the patch and point levels for dense face-bone point cloud transformations. This end-to-end framework adopts a Transformer-based network and a local information aggregation network (LIA-Net) in the first and second stages, respectively, which reinforce each other to generate precise point movement paths. LIA-Net can effectively compensate for the neighborhood precision loss of the Transformer-based network by modeling local geometric structures (edges, orientations and relative position features). The previous global features are employed to guide the local displacement using a gated recurrent unit. Inspired by deformable medical image registration, we propose an auxiliary loss that can utilize expert knowledge for reconstructing critical organs. Our framework is an unsupervised algorithm, and this loss is optional. Compared with the existing state-of-the-art (SOTA) methods on gathered datasets, TCFNet achieves outstanding evaluation metrics and visualization results. The code is available at https://github.com/Runshi-Zhang/TCFNet.

查看原文本刊更多论文

TCFNet：通过基于transformer的粗到细点移动网络进行双向面骨转换

计算机辅助手术模拟是正颌手术计划的关键组成部分，其中准确模拟脸骨形状转换是重要的。传统的生物力学仿真方法存在计算时间大、数据处理策略费力、精度低等问题。最近，人们提出了基于深度学习的仿真方法，将该问题视为骨骼和面部点云之间的点对点转换。然而，这些方法不能处理大规模的点，有有限的接受域导致有噪声的点，并且采用复杂的基于配准的预处理和后处理操作。这些缺点限制了这些方法的性能和广泛适用性。因此，我们提出了一种基于transformer的粗到细点移动网络（TCFNet），以学习密集面骨点云变换在斑块和点水平上的独特，复杂的对应关系。该框架在第一阶段和第二阶段分别采用基于transformer的网络和局部信息聚合网络（LIA-Net），两者相互增强以生成精确的点移动路径。LIA-Net通过对局部几何结构（边缘、方向和相对位置特征）的建模，可以有效地弥补基于变压器的网络的邻域精度损失。使用门控循环单元，利用先前的全局特征来引导局部位移。受可变形医学图像配准的启发，我们提出了一种利用专家知识重建关键器官的辅助损失算法。我们的框架是一个无监督算法，这种损失是可选的。与现有的最先进的SOTA方法相比，TCFNet在收集数据集上取得了出色的评价指标和可视化结果。代码可在https://github.com/Runshi-Zhang/TCFNet上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.