{"title":"Accuracy evaluation of an automatic landmarking method based on 3D segmented CT scans of full lower limbs with knee osteoarthritis","authors":"Clément Horteur , Nolwenn Fougeron , Benoît Gaulin , Ysé Roch , Marek Bucki , Estelle Palluel , Yohan Payan , Antoine Perrier","doi":"10.1016/j.ejrad.2025.112292","DOIUrl":null,"url":null,"abstract":"<div><div>In preoperative planning, a number of techniques for registration of anatomical landmarks on bones, or “bones landmarking”, have been developed for automatic measurement of lower limbs. However, the evaluation of these techniques is often neglected, reducing confidence in results and limiting clinical applications. We propose a protocol to evaluate landmarking techniques applied to thirty 3D images of osteoarthritic lower limbs. The protocol includes the evaluation of the ground truth based on manual landmarking performed by two experienced clinicians. Intra- and inter-operator variability were quantified. Automatic landmarking was evaluated using the same procedure and compared to manual landmarking by computing distances with bounding boxes including manual landmarks. The method was applied to 87 landmarks on the femur, tibia, fibula, and patella. For intra-operator variability, the mean median error was 2.0 mm, with a maximum of 42.1 mm. For inter-operator variability, errors ranged from 0.0 to 43.5 mm, with a mean median error of 2.3 mm across all landmarks. The mean median error between manual and automatic landmarking methods was 2.4 mm, with a range from 0.0 mm to 22.8 mm. Confidence intervals of automatic landmarking errors were consistent with intra- and inter-operator variability confidence intervals. Approximately 42 % of the automatic landmarks were inside their bounding boxes, with a mean distance of 1.4 mm from the box for landmarks on the outside. Results showed different errors depending on landmark location and coordinates. Intra- and inter-operator variability shed light on the difficulty of defining ground truth based on manual segmentation. This finding emphasizes the need for a rigorous evaluation protocol for landmarking validation, including the verification of statistical test hypotheses and population size influence.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"191 ","pages":"Article 112292"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X2500378X","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
In preoperative planning, a number of techniques for registration of anatomical landmarks on bones, or “bones landmarking”, have been developed for automatic measurement of lower limbs. However, the evaluation of these techniques is often neglected, reducing confidence in results and limiting clinical applications. We propose a protocol to evaluate landmarking techniques applied to thirty 3D images of osteoarthritic lower limbs. The protocol includes the evaluation of the ground truth based on manual landmarking performed by two experienced clinicians. Intra- and inter-operator variability were quantified. Automatic landmarking was evaluated using the same procedure and compared to manual landmarking by computing distances with bounding boxes including manual landmarks. The method was applied to 87 landmarks on the femur, tibia, fibula, and patella. For intra-operator variability, the mean median error was 2.0 mm, with a maximum of 42.1 mm. For inter-operator variability, errors ranged from 0.0 to 43.5 mm, with a mean median error of 2.3 mm across all landmarks. The mean median error between manual and automatic landmarking methods was 2.4 mm, with a range from 0.0 mm to 22.8 mm. Confidence intervals of automatic landmarking errors were consistent with intra- and inter-operator variability confidence intervals. Approximately 42 % of the automatic landmarks were inside their bounding boxes, with a mean distance of 1.4 mm from the box for landmarks on the outside. Results showed different errors depending on landmark location and coordinates. Intra- and inter-operator variability shed light on the difficulty of defining ground truth based on manual segmentation. This finding emphasizes the need for a rigorous evaluation protocol for landmarking validation, including the verification of statistical test hypotheses and population size influence.
期刊介绍:
European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field.
Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.