{"title":"MODD:多中心一次性医学地标检测与去噪。","authors":"Jialin Shi, Xiangde Li, Ning Zhang, Zongjie Wang","doi":"10.1088/1361-6560/ae0d40","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>One-shot landmark detection on lateral cephalometric X-ray images has advantages for label-efficient training. As clinical datasets are typically small and do not generalize well to variations in equipment or imaging environments. When using datasets from multiple studies (multi-center data) for joint training, the imbalance in the number of images and the differences in image properties can lead to poor generalization performance. In this work, we aim to propose a method that leverages more data from multiple experiments to improve the accuracy of a single experiment.
Approach. To address these challenges, we propose a Multicenter One-shot landmark Detection and Denoising framework (MODD). It incorporates a self-supervised one-shot mapping based on multicenter template transformation and the pseudo-label denoising module. Label denoising is used to reduce the impact of inaccurate pseudo-labels on the algorithm, focusing on label quality rather than signal noise in X-ray images.
For denoising module, we propose the shuffled dynamic sample selection and contrastive correction of multicenter pseudo labels. These two components together enable more accurate one-shot landmarks detection on lateral cephalometric X-ray images in multicenter scenarios.
Main results. Experiments are conducted with the publicly available multicenter cephalometric X-ray datasets. MODD achieves a landmark detection accuracy of 79.27\\% within a 4.0 mm range and the mean radial error of 2.94 mm, demonstrating satisfactory performance compared to state-of-the-art methods. 
Significance. This study expands the application of medical landmark detection to the multi-center one-shot filed and demonstrates the potential of the MODD architecture.</p>","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MODD: Multicenter one-shot medical landmark detection and denoising.\",\"authors\":\"Jialin Shi, Xiangde Li, Ning Zhang, Zongjie Wang\",\"doi\":\"10.1088/1361-6560/ae0d40\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>One-shot landmark detection on lateral cephalometric X-ray images has advantages for label-efficient training. As clinical datasets are typically small and do not generalize well to variations in equipment or imaging environments. When using datasets from multiple studies (multi-center data) for joint training, the imbalance in the number of images and the differences in image properties can lead to poor generalization performance. In this work, we aim to propose a method that leverages more data from multiple experiments to improve the accuracy of a single experiment.
Approach. To address these challenges, we propose a Multicenter One-shot landmark Detection and Denoising framework (MODD). It incorporates a self-supervised one-shot mapping based on multicenter template transformation and the pseudo-label denoising module. Label denoising is used to reduce the impact of inaccurate pseudo-labels on the algorithm, focusing on label quality rather than signal noise in X-ray images.
For denoising module, we propose the shuffled dynamic sample selection and contrastive correction of multicenter pseudo labels. These two components together enable more accurate one-shot landmarks detection on lateral cephalometric X-ray images in multicenter scenarios.
Main results. Experiments are conducted with the publicly available multicenter cephalometric X-ray datasets. MODD achieves a landmark detection accuracy of 79.27\\\\% within a 4.0 mm range and the mean radial error of 2.94 mm, demonstrating satisfactory performance compared to state-of-the-art methods. 
Significance. This study expands the application of medical landmark detection to the multi-center one-shot filed and demonstrates the potential of the MODD architecture.</p>\",\"PeriodicalId\":20185,\"journal\":{\"name\":\"Physics in medicine and biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physics in medicine and biology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6560/ae0d40\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/ae0d40","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
MODD: Multicenter one-shot medical landmark detection and denoising.
Objective: One-shot landmark detection on lateral cephalometric X-ray images has advantages for label-efficient training. As clinical datasets are typically small and do not generalize well to variations in equipment or imaging environments. When using datasets from multiple studies (multi-center data) for joint training, the imbalance in the number of images and the differences in image properties can lead to poor generalization performance. In this work, we aim to propose a method that leverages more data from multiple experiments to improve the accuracy of a single experiment.
Approach. To address these challenges, we propose a Multicenter One-shot landmark Detection and Denoising framework (MODD). It incorporates a self-supervised one-shot mapping based on multicenter template transformation and the pseudo-label denoising module. Label denoising is used to reduce the impact of inaccurate pseudo-labels on the algorithm, focusing on label quality rather than signal noise in X-ray images.
For denoising module, we propose the shuffled dynamic sample selection and contrastive correction of multicenter pseudo labels. These two components together enable more accurate one-shot landmarks detection on lateral cephalometric X-ray images in multicenter scenarios.
Main results. Experiments are conducted with the publicly available multicenter cephalometric X-ray datasets. MODD achieves a landmark detection accuracy of 79.27\% within a 4.0 mm range and the mean radial error of 2.94 mm, demonstrating satisfactory performance compared to state-of-the-art methods.
Significance. This study expands the application of medical landmark detection to the multi-center one-shot filed and demonstrates the potential of the MODD architecture.
期刊介绍:
The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry