Nancy Park B.S. , Johannes Sieberer M.Sc. , Armita Manafzadeh Ph.D. , Rieke-Marie Hackbarth , Shelby Desroches B.S. , Rithvik Ghankot B.S. , John Lynch Ph.D. , Neil A. Segal M.D. , Joshua Stefanik Ph.D. , David Felson M.D. , John P. Fulkerson M.D.
{"title":"Semiautomated Three-Dimensional Landmark Placement on Knee Models Is a Reliable Method to Describe Bone Shape and Alignment","authors":"Nancy Park B.S. , Johannes Sieberer M.Sc. , Armita Manafzadeh Ph.D. , Rieke-Marie Hackbarth , Shelby Desroches B.S. , Rithvik Ghankot B.S. , John Lynch Ph.D. , Neil A. Segal M.D. , Joshua Stefanik Ph.D. , David Felson M.D. , John P. Fulkerson M.D.","doi":"10.1016/j.asmr.2024.101036","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>To assess the inter- and intrarater reliability of 21 anatomical landmarks initially placed with an artificial intelligence algorithm and then manually verified with human input.</div></div><div><h3>Methods</h3><div>Thirty computed tomography scans of the knees of participants from the Multicenter Osteoarthritis Study (MOST) ages 45 to 55 years were included. Approximately one-half experienced progression of patellofemoral osteoarthritis, defined as an increased cartilage score in the patellofemoral compartment on magnetic resonance imaging over 2 years. The algorithm automatically placed 19 anatomic landmarks on the femur, tibia, and patella. An additional 2 landmarks were added manually. Two landmark reviewers separately reviewed all 30 scans and verified all landmarks. After 2 weeks, one reviewer repeated the process for the same dataset. The mean Euclidean distance of manual landmark displacement, mean absolute disagreement between and within raters, and intraclass correlation coefficients for inter- and intrarater reliability were calculated.</div></div><div><h3>Results</h3><div>All landmarks had excellent inter-rater reliability. The tibial and femoral shaft centers had intraclass correlation coefficients (ICCs) of 1, indicating their positions did not differ. Seventeen landmarks had ICCs between 0.90 and 0.99 and the tibial tuberosity had an ICC of 0.87. Intrarater reliability scores were 1 for 16 landmarks and between 0.90 and 0.99 for the remaining 5.</div></div><div><h3>Conclusions</h3><div>There was excellent agreement on the locations of all 21 landmarks evaluated in this study.</div></div><div><h3>Clinical Relevance</h3><div>The potential role of artificial intelligence in medical imaging and orthopaedic research is a growing area of interest. The excellent reliability demonstrated across multiple landmarks in our study reveals the potential for semiautomated 3-dimensional methods to enhance precision of anatomical measurements of the knee over 2-dimensional methods.</div></div>","PeriodicalId":34631,"journal":{"name":"Arthroscopy Sports Medicine and Rehabilitation","volume":"7 2","pages":"Article 101036"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arthroscopy Sports Medicine and Rehabilitation","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666061X24001792","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
To assess the inter- and intrarater reliability of 21 anatomical landmarks initially placed with an artificial intelligence algorithm and then manually verified with human input.
Methods
Thirty computed tomography scans of the knees of participants from the Multicenter Osteoarthritis Study (MOST) ages 45 to 55 years were included. Approximately one-half experienced progression of patellofemoral osteoarthritis, defined as an increased cartilage score in the patellofemoral compartment on magnetic resonance imaging over 2 years. The algorithm automatically placed 19 anatomic landmarks on the femur, tibia, and patella. An additional 2 landmarks were added manually. Two landmark reviewers separately reviewed all 30 scans and verified all landmarks. After 2 weeks, one reviewer repeated the process for the same dataset. The mean Euclidean distance of manual landmark displacement, mean absolute disagreement between and within raters, and intraclass correlation coefficients for inter- and intrarater reliability were calculated.
Results
All landmarks had excellent inter-rater reliability. The tibial and femoral shaft centers had intraclass correlation coefficients (ICCs) of 1, indicating their positions did not differ. Seventeen landmarks had ICCs between 0.90 and 0.99 and the tibial tuberosity had an ICC of 0.87. Intrarater reliability scores were 1 for 16 landmarks and between 0.90 and 0.99 for the remaining 5.
Conclusions
There was excellent agreement on the locations of all 21 landmarks evaluated in this study.
Clinical Relevance
The potential role of artificial intelligence in medical imaging and orthopaedic research is a growing area of interest. The excellent reliability demonstrated across multiple landmarks in our study reveals the potential for semiautomated 3-dimensional methods to enhance precision of anatomical measurements of the knee over 2-dimensional methods.