Emily Feng, Nishantha M Jayasuriya, Karim Rizwan Nathani, Konstantinos Katsos, Laura A Machlab, Graham W Johnson, Brett A Freedman, Mohamad Bydon
{"title":"Hounsfield单元在术前胸腰椎CT扫描中的人工智能图像分析:脊柱手术患者骨质疏松症的自动筛查","authors":"Emily Feng, Nishantha M Jayasuriya, Karim Rizwan Nathani, Konstantinos Katsos, Laura A Machlab, Graham W Johnson, Brett A Freedman, Mohamad Bydon","doi":"10.3171/2025.1.SPINE24900","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to develop an artificial intelligence (AI) model for automatically detecting Hounsfield unit (HU) values at the L1 vertebra in preoperative thoracolumbar CT scans. This model serves as a screening tool for osteoporosis in patients undergoing spine surgery, offering an alternative to traditional bone mineral density measurement methods like dual-energy x-ray absorptiometry.</p><p><strong>Methods: </strong>The authors utilized two CT scan datasets, comprising 501 images, which were split into training, validation, and test subsets. The nnU-Net framework was used for segmentation, followed by an algorithm to calculate HU values from the L1 vertebra. The model's performance was validated against manual HU calculations by expert raters on 56 CT scans. Statistical measures included the Dice coefficient, Pearson correlation coefficient, intraclass correlation coefficient (ICC), and Bland-Altman plots to assess the agreement between AI and human-derived HU measurements.</p><p><strong>Results: </strong>The AI model achieved a high Dice coefficient of 0.91 for vertebral segmentation. The Pearson correlation coefficient between AI-derived HU and human-derived HU values was 0.96, indicating strong agreement. ICC values for interrater reliability were 0.95 and 0.94 for raters 1 and 2, respectively. The mean difference between AI and human HU values was 7.0 HU, with limits of agreement ranging from -21.1 to 35.2 HU. A paired t-test showed no significant difference between AI and human measurements (p = 0.21).</p><p><strong>Conclusions: </strong>The AI model demonstrated strong agreement with human experts in measuring HU values, validating its potential as a reliable tool for automated osteoporosis screening in spine surgery patients. This approach can enhance preoperative risk assessment and perioperative bone health optimization. Future research should focus on external validation and inclusion of diverse patient demographics to ensure broader applicability.</p>","PeriodicalId":16562,"journal":{"name":"Journal of neurosurgery. Spine","volume":" ","pages":"1-8"},"PeriodicalIF":3.1000,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence image analysis for Hounsfield units in preoperative thoracolumbar CT scans: an automated screening for osteoporosis in patients undergoing spine surgery.\",\"authors\":\"Emily Feng, Nishantha M Jayasuriya, Karim Rizwan Nathani, Konstantinos Katsos, Laura A Machlab, Graham W Johnson, Brett A Freedman, Mohamad Bydon\",\"doi\":\"10.3171/2025.1.SPINE24900\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>This study aimed to develop an artificial intelligence (AI) model for automatically detecting Hounsfield unit (HU) values at the L1 vertebra in preoperative thoracolumbar CT scans. This model serves as a screening tool for osteoporosis in patients undergoing spine surgery, offering an alternative to traditional bone mineral density measurement methods like dual-energy x-ray absorptiometry.</p><p><strong>Methods: </strong>The authors utilized two CT scan datasets, comprising 501 images, which were split into training, validation, and test subsets. The nnU-Net framework was used for segmentation, followed by an algorithm to calculate HU values from the L1 vertebra. The model's performance was validated against manual HU calculations by expert raters on 56 CT scans. Statistical measures included the Dice coefficient, Pearson correlation coefficient, intraclass correlation coefficient (ICC), and Bland-Altman plots to assess the agreement between AI and human-derived HU measurements.</p><p><strong>Results: </strong>The AI model achieved a high Dice coefficient of 0.91 for vertebral segmentation. The Pearson correlation coefficient between AI-derived HU and human-derived HU values was 0.96, indicating strong agreement. ICC values for interrater reliability were 0.95 and 0.94 for raters 1 and 2, respectively. The mean difference between AI and human HU values was 7.0 HU, with limits of agreement ranging from -21.1 to 35.2 HU. A paired t-test showed no significant difference between AI and human measurements (p = 0.21).</p><p><strong>Conclusions: </strong>The AI model demonstrated strong agreement with human experts in measuring HU values, validating its potential as a reliable tool for automated osteoporosis screening in spine surgery patients. This approach can enhance preoperative risk assessment and perioperative bone health optimization. Future research should focus on external validation and inclusion of diverse patient demographics to ensure broader applicability.</p>\",\"PeriodicalId\":16562,\"journal\":{\"name\":\"Journal of neurosurgery. Spine\",\"volume\":\" \",\"pages\":\"1-8\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-04-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of neurosurgery. Spine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3171/2025.1.SPINE24900\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/7/1 0:00:00\",\"PubModel\":\"Print\",\"JCR\":\"Q2\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of neurosurgery. Spine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3171/2025.1.SPINE24900","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/1 0:00:00","PubModel":"Print","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:本研究旨在建立一种人工智能(AI)模型,用于在术前胸腰椎CT扫描中自动检测L1椎体的Hounsfield unit (HU)值。该模型可作为脊柱手术患者骨质疏松症的筛查工具,为传统的骨密度测量方法(如双能x线吸收仪)提供了一种替代方法。方法:作者利用两个CT扫描数据集,包括501张图像,分为训练子集、验证子集和测试子集。使用nnU-Net框架进行分割,然后使用算法从L1椎体计算HU值。在56次CT扫描中,该模型的性能与专家评估人员的手动HU计算进行了验证。统计测量包括Dice系数、Pearson相关系数、类内相关系数(ICC)和Bland-Altman图,以评估人工智能与人类衍生的HU测量值之间的一致性。结果:人工智能模型在椎体分割上获得了0.91的高Dice系数。人工智能得到的HU值与人类得到的HU值的Pearson相关系数为0.96,表明高度一致。评价者间信度的ICC值分别为0.95和0.94的评价者1和2。人工智能与人类HU值的平均差异为7.0 HU,一致性范围为-21.1至35.2 HU。配对t检验显示人工智能与人类测量结果无显著差异(p = 0.21)。结论:人工智能模型在测量HU值方面与人类专家表现出强烈的一致性,验证了其作为脊柱手术患者骨质疏松症自动筛查的可靠工具的潜力。该方法可提高术前风险评估和围手术期骨健康优化。未来的研究应侧重于外部验证和纳入不同的患者人口统计数据,以确保更广泛的适用性。
Artificial intelligence image analysis for Hounsfield units in preoperative thoracolumbar CT scans: an automated screening for osteoporosis in patients undergoing spine surgery.
Objective: This study aimed to develop an artificial intelligence (AI) model for automatically detecting Hounsfield unit (HU) values at the L1 vertebra in preoperative thoracolumbar CT scans. This model serves as a screening tool for osteoporosis in patients undergoing spine surgery, offering an alternative to traditional bone mineral density measurement methods like dual-energy x-ray absorptiometry.
Methods: The authors utilized two CT scan datasets, comprising 501 images, which were split into training, validation, and test subsets. The nnU-Net framework was used for segmentation, followed by an algorithm to calculate HU values from the L1 vertebra. The model's performance was validated against manual HU calculations by expert raters on 56 CT scans. Statistical measures included the Dice coefficient, Pearson correlation coefficient, intraclass correlation coefficient (ICC), and Bland-Altman plots to assess the agreement between AI and human-derived HU measurements.
Results: The AI model achieved a high Dice coefficient of 0.91 for vertebral segmentation. The Pearson correlation coefficient between AI-derived HU and human-derived HU values was 0.96, indicating strong agreement. ICC values for interrater reliability were 0.95 and 0.94 for raters 1 and 2, respectively. The mean difference between AI and human HU values was 7.0 HU, with limits of agreement ranging from -21.1 to 35.2 HU. A paired t-test showed no significant difference between AI and human measurements (p = 0.21).
Conclusions: The AI model demonstrated strong agreement with human experts in measuring HU values, validating its potential as a reliable tool for automated osteoporosis screening in spine surgery patients. This approach can enhance preoperative risk assessment and perioperative bone health optimization. Future research should focus on external validation and inclusion of diverse patient demographics to ensure broader applicability.
期刊介绍:
Primarily publish original works in neurosurgery but also include studies in clinical neurophysiology, organic neurology, ophthalmology, radiology, pathology, and molecular biology.