HIPPO artificial intelligence: Correlating automated radiographic femoroacetabular measurements with patient-reported outcomes in developmental hip dysplasia.
Ahmed Alshaikhsalama, Holden Archer, Yin Xi, Richard Ljuhar, Joel E Wells, Avneesh Chhabra
{"title":"HIPPO artificial intelligence: Correlating automated radiographic femoroacetabular measurements with patient-reported outcomes in developmental hip dysplasia.","authors":"Ahmed Alshaikhsalama, Holden Archer, Yin Xi, Richard Ljuhar, Joel E Wells, Avneesh Chhabra","doi":"10.5493/wjem.v14.i4.99359","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Hip dysplasia (HD) is characterized by insufficient acetabular coverage of the femoral head, leading to a predisposition for osteoarthritis. While radiographic measurements such as the lateral center edge angle (LCEA) and Tönnis angle are essential in evaluating HD severity, patient-reported outcome measures (PROMs) offer insights into the subjective health impact on patients.</p><p><strong>Aim: </strong>To investigate the correlations between machine-learning automated and manual radiographic measurements of HD and PROMs with the hypothesis that artificial intelligence (AI)-generated HD measurements indicating less severe dysplasia correlate with better PROMs.</p><p><strong>Methods: </strong>Retrospective study evaluating 256 hips from 130 HD patients from a hip preservation clinic database. Manual and AI-derived radiographic measurements were collected and PROMs such as the Harris hip score (HHS), international hip outcome tool (iHOT-12), short form (SF) 12 (SF-12), and Visual Analogue Scale of the European Quality of Life Group survey were correlated using Spearman's rank-order correlation.</p><p><strong>Results: </strong>The median patient age was 28.6 years (range 15.7-62.3 years) with 82.3% of patients being women and 17.7% being men. The median interpretation time for manual readers and AI ranged between 4-12 minutes per patient and 31 seconds, respectively. Manual measurements exhibited weak correlations with HHS, including LCEA (<i>r</i> = 0.18) and Tönnis angle (<i>r</i> = -0.24). AI-derived metrics showed similar weak correlations, with the most significant being Caput-Collum-Diaphyseal (CCD) with iHOT-12 at <i>r</i> = -0.25 (<i>P</i> = 0.042) and CCD with SF-12 at <i>r</i> = 0.25 (<i>P</i> = 0.048). Other measured correlations were not significant (<i>P</i> > 0.05).</p><p><strong>Conclusion: </strong>This study suggests AI can aid in HD assessment, but weak PROM correlations highlight their continued importance in predicting subjective health and outcomes, complementing AI-derived measurements in HD management.</p>","PeriodicalId":75340,"journal":{"name":"World journal of experimental medicine","volume":"14 4","pages":"99359"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11551701/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World journal of experimental medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5493/wjem.v14.i4.99359","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Hip dysplasia (HD) is characterized by insufficient acetabular coverage of the femoral head, leading to a predisposition for osteoarthritis. While radiographic measurements such as the lateral center edge angle (LCEA) and Tönnis angle are essential in evaluating HD severity, patient-reported outcome measures (PROMs) offer insights into the subjective health impact on patients.
Aim: To investigate the correlations between machine-learning automated and manual radiographic measurements of HD and PROMs with the hypothesis that artificial intelligence (AI)-generated HD measurements indicating less severe dysplasia correlate with better PROMs.
Methods: Retrospective study evaluating 256 hips from 130 HD patients from a hip preservation clinic database. Manual and AI-derived radiographic measurements were collected and PROMs such as the Harris hip score (HHS), international hip outcome tool (iHOT-12), short form (SF) 12 (SF-12), and Visual Analogue Scale of the European Quality of Life Group survey were correlated using Spearman's rank-order correlation.
Results: The median patient age was 28.6 years (range 15.7-62.3 years) with 82.3% of patients being women and 17.7% being men. The median interpretation time for manual readers and AI ranged between 4-12 minutes per patient and 31 seconds, respectively. Manual measurements exhibited weak correlations with HHS, including LCEA (r = 0.18) and Tönnis angle (r = -0.24). AI-derived metrics showed similar weak correlations, with the most significant being Caput-Collum-Diaphyseal (CCD) with iHOT-12 at r = -0.25 (P = 0.042) and CCD with SF-12 at r = 0.25 (P = 0.048). Other measured correlations were not significant (P > 0.05).
Conclusion: This study suggests AI can aid in HD assessment, but weak PROM correlations highlight their continued importance in predicting subjective health and outcomes, complementing AI-derived measurements in HD management.