Konrad Kwolek, Artur Gądek, Kamil Kwolek, Agnieszka Lechowska-Liszka, Michał Malczak, Henryk Liszka
{"title":"基于人工智能的足部前后位片诊断拇外翻。","authors":"Konrad Kwolek, Artur Gądek, Kamil Kwolek, Agnieszka Lechowska-Liszka, Michał Malczak, Henryk Liszka","doi":"10.5312/wjo.v16.i6.103832","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>A recently developed method enables automated measurement of the hallux valgus angle (HVA) and the first intermetatarsal angle (IMA) from weight-bearing foot radiographs. This approach employs bone segmentation to identify anatomical landmarks and provides standardized angle measurements based on established guidelines. While effective for HVA and IMA, preoperative radiograph analysis remains complex and requires additional measurements, such as the hallux interphalangeal angle (IPA), which has received limited research attention.</p><p><strong>Aim: </strong>To expand the previous method, which measured HVA and IMA, by incorporating the automatic measurement of IPA, evaluating its accuracy and clinical relevance.</p><p><strong>Methods: </strong>A preexisting database of manually labeled foot radiographs was used to train a U-Net neural network for segmenting bones and identifying landmarks necessary for IPA measurement. Of the 265 radiographs in the dataset, 161 were selected for training and 20 for validation. The U-Net neural network achieves a high mean Sørensen-Dice index (> 0.97). The remaining 84 radiographs were used to assess the reliability of automated IPA measurements against those taken manually by two orthopedic surgeons (O<sub>A</sub> and O<sub>B</sub>) using computer-based tools. Each measurement was repeated to assess intraobserver (O<sub>A1</sub> and O<sub>A2</sub>) and interobserver (O<sub>A2</sub> and O<sub>B</sub>) reliability. Agreement between automated and manual methods was evaluated using the Intraclass Correlation Coefficient (ICC), and Bland-Altman analysis identified systematic differences. Standard error of measurement (SEM) and Pearson correlation coefficients quantified precision and linearity, and measurement times were recorded to evaluate efficiency.</p><p><strong>Results: </strong>The artificial intelligence (AI)-based system demonstrated excellent reliability, with ICC3.1 values of 0.92 (AI <i>vs</i> O<sub>A2</sub>) and 0.88 (AI <i>vs</i> O<sub>B</sub>), both statistically significant (<i>P</i> < 0.001). For manual measurements, ICC values were 0.95 (O<sub>A2</sub> <i>vs</i> O<sub>A1</sub>) and 0.95 (O<sub>A2</sub> <i>vs</i> O<sub>B</sub>), supporting both intraobserver and interobserver reliability. Bland-Altman analysis revealed minimal biases of: (1) 1.61° (AI <i>vs</i> O<sub>A2</sub>); and (2) 2.54° (AI <i>vs</i> O<sub>B</sub>), with clinically acceptable limits of agreement. The AI system also showed high precision, as evidenced by low SEM values: (1) 1.22° (O<sub>A2</sub> <i>vs</i> O<sub>B</sub>); (2) 1.77° (AI <i>vs</i> O<sub>A2</sub>); and (3) 2.09° (AI <i>vs</i> O<sub>B</sub>). Furthermore, Pearson correlation coefficients confirmed strong linear relationships between automated and manual measurements, with <i>r</i> = 0.85 (AI <i>vs</i> O<sub>A2</sub>) and <i>r</i> = 0.90 (AI <i>vs</i> O<sub>B</sub>). The AI method significantly improved efficiency, completing all 84 measurements 8 times faster than manual methods, reducing the time required from an average 36 minutes to just 4.5 minutes.</p><p><strong>Conclusion: </strong>The proposed AI-assisted IPA measurement method shows strong clinical potential, effectively corresponding with manual measurements. Integrating IPA with HVA and IMA assessments provides a comprehensive tool for automated forefoot deformity analysis, supporting hallux valgus severity classification and preoperative planning, while offering substantial time savings in high-volume clinical settings.</p>","PeriodicalId":47843,"journal":{"name":"World Journal of Orthopedics","volume":"16 6","pages":"103832"},"PeriodicalIF":2.3000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12179893/pdf/","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence-based diagnosis of hallux valgus interphalangeus using anteroposterior foot radiographs.\",\"authors\":\"Konrad Kwolek, Artur Gądek, Kamil Kwolek, Agnieszka Lechowska-Liszka, Michał Malczak, Henryk Liszka\",\"doi\":\"10.5312/wjo.v16.i6.103832\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>A recently developed method enables automated measurement of the hallux valgus angle (HVA) and the first intermetatarsal angle (IMA) from weight-bearing foot radiographs. This approach employs bone segmentation to identify anatomical landmarks and provides standardized angle measurements based on established guidelines. While effective for HVA and IMA, preoperative radiograph analysis remains complex and requires additional measurements, such as the hallux interphalangeal angle (IPA), which has received limited research attention.</p><p><strong>Aim: </strong>To expand the previous method, which measured HVA and IMA, by incorporating the automatic measurement of IPA, evaluating its accuracy and clinical relevance.</p><p><strong>Methods: </strong>A preexisting database of manually labeled foot radiographs was used to train a U-Net neural network for segmenting bones and identifying landmarks necessary for IPA measurement. Of the 265 radiographs in the dataset, 161 were selected for training and 20 for validation. The U-Net neural network achieves a high mean Sørensen-Dice index (> 0.97). The remaining 84 radiographs were used to assess the reliability of automated IPA measurements against those taken manually by two orthopedic surgeons (O<sub>A</sub> and O<sub>B</sub>) using computer-based tools. Each measurement was repeated to assess intraobserver (O<sub>A1</sub> and O<sub>A2</sub>) and interobserver (O<sub>A2</sub> and O<sub>B</sub>) reliability. Agreement between automated and manual methods was evaluated using the Intraclass Correlation Coefficient (ICC), and Bland-Altman analysis identified systematic differences. Standard error of measurement (SEM) and Pearson correlation coefficients quantified precision and linearity, and measurement times were recorded to evaluate efficiency.</p><p><strong>Results: </strong>The artificial intelligence (AI)-based system demonstrated excellent reliability, with ICC3.1 values of 0.92 (AI <i>vs</i> O<sub>A2</sub>) and 0.88 (AI <i>vs</i> O<sub>B</sub>), both statistically significant (<i>P</i> < 0.001). For manual measurements, ICC values were 0.95 (O<sub>A2</sub> <i>vs</i> O<sub>A1</sub>) and 0.95 (O<sub>A2</sub> <i>vs</i> O<sub>B</sub>), supporting both intraobserver and interobserver reliability. Bland-Altman analysis revealed minimal biases of: (1) 1.61° (AI <i>vs</i> O<sub>A2</sub>); and (2) 2.54° (AI <i>vs</i> O<sub>B</sub>), with clinically acceptable limits of agreement. The AI system also showed high precision, as evidenced by low SEM values: (1) 1.22° (O<sub>A2</sub> <i>vs</i> O<sub>B</sub>); (2) 1.77° (AI <i>vs</i> O<sub>A2</sub>); and (3) 2.09° (AI <i>vs</i> O<sub>B</sub>). Furthermore, Pearson correlation coefficients confirmed strong linear relationships between automated and manual measurements, with <i>r</i> = 0.85 (AI <i>vs</i> O<sub>A2</sub>) and <i>r</i> = 0.90 (AI <i>vs</i> O<sub>B</sub>). The AI method significantly improved efficiency, completing all 84 measurements 8 times faster than manual methods, reducing the time required from an average 36 minutes to just 4.5 minutes.</p><p><strong>Conclusion: </strong>The proposed AI-assisted IPA measurement method shows strong clinical potential, effectively corresponding with manual measurements. Integrating IPA with HVA and IMA assessments provides a comprehensive tool for automated forefoot deformity analysis, supporting hallux valgus severity classification and preoperative planning, while offering substantial time savings in high-volume clinical settings.</p>\",\"PeriodicalId\":47843,\"journal\":{\"name\":\"World Journal of Orthopedics\",\"volume\":\"16 6\",\"pages\":\"103832\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12179893/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Journal of Orthopedics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5312/wjo.v16.i6.103832\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Orthopedics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5312/wjo.v16.i6.103832","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
摘要
背景:最近开发的一种方法可以自动测量负重足x线片上的拇外翻角(HVA)和第一跖间角(IMA)。该方法采用骨分割来识别解剖标志,并根据已建立的指南提供标准化的角度测量。虽然对HVA和IMA有效,但术前x线片分析仍然很复杂,需要额外的测量,如拇指间角(IPA),这方面的研究关注有限。目的:将IPA自动测量纳入HVA和IMA的测量方法,对其准确性和临床意义进行评价。方法:使用预先存在的手动标记足部x线照片数据库来训练U-Net神经网络,用于分割骨骼和识别IPA测量所需的地标。在数据集中的265张x光片中,选择161张用于训练,20张用于验证。U-Net神经网络实现了较高的平均Sørensen-Dice指数(> 0.97)。其余84张x线片用于评估自动IPA测量与两位骨科医生(OA和OB)使用基于计算机的工具手动测量的可靠性。重复每项测量以评估观察者内部(OA1和OA2)和观察者之间(OA2和OB)的可靠性。使用类内相关系数(ICC)评估了自动化和手动方法之间的一致性,Bland-Altman分析确定了系统差异。测量标准误差(SEM)和Pearson相关系数量化精密度和线性度,并记录测量次数以评价效率。结果:基于人工智能(AI)的系统具有良好的可靠性,ICC3.1值分别为0.92 (AI vs OA2)和0.88 (AI vs OB),均具有统计学意义(P < 0.001)。对于人工测量,ICC值为0.95 (OA2 vs OA1)和0.95 (OA2 vs OB),支持观察者内部和观察者之间的可靠性。Bland-Altman分析显示最小偏差:(1)1.61°(AI vs OA2);(2) 2.54°(AI vs OB),具有临床可接受的一致限度。人工智能系统也显示出很高的精度,从低SEM值可以看出:(1)1.22°(OA2 vs OB);(2) 1.77°(AI vs OA2);(3) 2.09°(AI vs OB)。此外,Pearson相关系数证实了自动测量和手动测量之间的强线性关系,r = 0.85 (AI vs OA2)和r = 0.90 (AI vs OB)。人工智能方法显著提高了效率,完成所有84项测量的速度比人工方法快8倍,将所需时间从平均36分钟减少到4.5分钟。结论:人工智能辅助IPA测量方法具有较强的临床应用潜力,可与人工测量方法有效对应。将IPA与HVA和IMA评估相结合,可为自动前足畸形分析提供全面的工具,支持拇外翻严重程度分类和术前规划,同时在大量临床设置中节省大量时间。
Artificial intelligence-based diagnosis of hallux valgus interphalangeus using anteroposterior foot radiographs.
Background: A recently developed method enables automated measurement of the hallux valgus angle (HVA) and the first intermetatarsal angle (IMA) from weight-bearing foot radiographs. This approach employs bone segmentation to identify anatomical landmarks and provides standardized angle measurements based on established guidelines. While effective for HVA and IMA, preoperative radiograph analysis remains complex and requires additional measurements, such as the hallux interphalangeal angle (IPA), which has received limited research attention.
Aim: To expand the previous method, which measured HVA and IMA, by incorporating the automatic measurement of IPA, evaluating its accuracy and clinical relevance.
Methods: A preexisting database of manually labeled foot radiographs was used to train a U-Net neural network for segmenting bones and identifying landmarks necessary for IPA measurement. Of the 265 radiographs in the dataset, 161 were selected for training and 20 for validation. The U-Net neural network achieves a high mean Sørensen-Dice index (> 0.97). The remaining 84 radiographs were used to assess the reliability of automated IPA measurements against those taken manually by two orthopedic surgeons (OA and OB) using computer-based tools. Each measurement was repeated to assess intraobserver (OA1 and OA2) and interobserver (OA2 and OB) reliability. Agreement between automated and manual methods was evaluated using the Intraclass Correlation Coefficient (ICC), and Bland-Altman analysis identified systematic differences. Standard error of measurement (SEM) and Pearson correlation coefficients quantified precision and linearity, and measurement times were recorded to evaluate efficiency.
Results: The artificial intelligence (AI)-based system demonstrated excellent reliability, with ICC3.1 values of 0.92 (AI vs OA2) and 0.88 (AI vs OB), both statistically significant (P < 0.001). For manual measurements, ICC values were 0.95 (OA2vs OA1) and 0.95 (OA2vs OB), supporting both intraobserver and interobserver reliability. Bland-Altman analysis revealed minimal biases of: (1) 1.61° (AI vs OA2); and (2) 2.54° (AI vs OB), with clinically acceptable limits of agreement. The AI system also showed high precision, as evidenced by low SEM values: (1) 1.22° (OA2vs OB); (2) 1.77° (AI vs OA2); and (3) 2.09° (AI vs OB). Furthermore, Pearson correlation coefficients confirmed strong linear relationships between automated and manual measurements, with r = 0.85 (AI vs OA2) and r = 0.90 (AI vs OB). The AI method significantly improved efficiency, completing all 84 measurements 8 times faster than manual methods, reducing the time required from an average 36 minutes to just 4.5 minutes.
Conclusion: The proposed AI-assisted IPA measurement method shows strong clinical potential, effectively corresponding with manual measurements. Integrating IPA with HVA and IMA assessments provides a comprehensive tool for automated forefoot deformity analysis, supporting hallux valgus severity classification and preoperative planning, while offering substantial time savings in high-volume clinical settings.