使用复合损失函数的可变SegNet从术前和术后x射线图像中自动测量下肢角度

IF 3 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2025-01-17 DOI:10.1002/ima.70027

Iyyakutty Dheivya, Gurunathan Saravana Kumar

{"title":"使用复合损失函数的可变SegNet从术前和术后x射线图像中自动测量下肢角度","authors":"Iyyakutty Dheivya, Gurunathan Saravana Kumar","doi":"10.1002/ima.70027","DOIUrl":null,"url":null,"abstract":"<div>\n \n This work envisages developing an automated computer workflow to locate the landmarks like knee center, tibial plateau, tibial and femoral axis to measure Femur-Tibia Angle (FTA), Medial Proximal Tibial Angle (MPTA), and Hip Knee Ankle Angle (HKAA) from the pre- and post-operative x-rays. In this work, we propose a variant of semantic segmentation model (vSegNet) for the segmentation of the knee and tibia gap for extracting important features used in the automated workflow. Since femur tibia gap is a small region as compared to the complete x-ray image, it poses severe class imbalance issue. Using a combination of the Dice coefficient and Hausdorff distance as a compound loss function, the proposed neural network model shows better segmentation performance as compared to state-of-the-art segmentation models like U-Net, SegNet (with and without VGG16 pre-trained weights), VGG16, MobileNetV2, Pretrained DeepLabv3+ (Resnet18 weights), and Pretrained FCN (VGG16 weights) and different loss functions. We subsequently propose computer methods for feature recognition and prediction of landmarks at femur, tibial and knee center, the side of the fibula and, subsequently, the various knee joint angles. An analysis of sensitivity of segmentation accuracy on the accuracy of predicted angles further substantiate the efficacy of the proposed methods. Dice score of U-Net, Pretrained SegNet, SegNet, VGG16, MobileNetV2, Pretrained DeepLabv3+, Pretrained FCN, vSegNet with cross-entropy loss function and vSegNet with compound loss function are observed as <math>\n <semantics>\n <mrow>\n <mn>0.083</mn>\n <mo>±</mo>\n <mn>0.04</mn>\n </mrow>\n <annotation>$$ 0.083\\pm 0.04 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.51</mn>\n <mo>±</mo>\n <mn>0.16</mn>\n </mrow>\n <annotation>$$ 0.51\\pm 0.16 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.66</mn>\n <mo>±</mo>\n <mn>0.20</mn>\n </mrow>\n <annotation>$$ 0.66\\pm 0.20 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.61</mn>\n <mo>±</mo>\n <mn>0.15</mn>\n </mrow>\n <annotation>$$ 0.61\\pm 0.15 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.17</mn>\n <mo>±</mo>\n <mn>0.16</mn>\n </mrow>\n <annotation>$$ 0.17\\pm 0.16 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.61</mn>\n <mo>±</mo>\n <mn>0.22</mn>\n </mrow>\n <annotation>$$ 0.61\\pm 0.22 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.504</mn>\n <mo>±</mo>\n <mn>0.13</mn>\n </mrow>\n <annotation>$$ 0.504\\pm 0.13 $$</annotation>\n </semantics></math>, <math>\n <semantics>\n <mrow>\n <mn>0.77</mn>\n <mo>±</mo>\n <mn>0.08</mn>\n </mrow>\n <annotation>$$ 0.77\\pm 0.08 $$</annotation>\n </semantics></math> and <math>\n <semantics>\n <mrow>\n <mn>0.95</mn>\n <mo>±</mo>\n <mn>0.02</mn>\n </mrow>\n <annotation>$$ 0.95\\pm 0.02 $$</annotation>\n </semantics></math> respectively. Using the proposed network vSegNet and the automated workflow, we obtained an Intraclass correlation of 0.999, 0.994, and 0.997 for the FTA, MPTA and HKAA measurements, respectively, with the ground truth. FTA, MPTA, and HKAA measurements using the proposed automatic pipeline positively correlated with the expert's measurement. The proposed vSegNet with compound loss function handles the challenges posed by class imbalance and obtains the best results as compared to other networks and loss functions tested in the work and also in comparison with contemporary works described in literature.\n </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic Measurement of Lower Limb Angles From Pre- and Post-Operative X-Ray Images Using a Variant SegNet With Compound Loss Function\",\"authors\":\"Iyyakutty Dheivya, Gurunathan Saravana Kumar\",\"doi\":\"10.1002/ima.70027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n This work envisages developing an automated computer workflow to locate the landmarks like knee center, tibial plateau, tibial and femoral axis to measure Femur-Tibia Angle (FTA), Medial Proximal Tibial Angle (MPTA), and Hip Knee Ankle Angle (HKAA) from the pre- and post-operative x-rays. In this work, we propose a variant of semantic segmentation model (vSegNet) for the segmentation of the knee and tibia gap for extracting important features used in the automated workflow. Since femur tibia gap is a small region as compared to the complete x-ray image, it poses severe class imbalance issue. Using a combination of the Dice coefficient and Hausdorff distance as a compound loss function, the proposed neural network model shows better segmentation performance as compared to state-of-the-art segmentation models like U-Net, SegNet (with and without VGG16 pre-trained weights), VGG16, MobileNetV2, Pretrained DeepLabv3+ (Resnet18 weights), and Pretrained FCN (VGG16 weights) and different loss functions. We subsequently propose computer methods for feature recognition and prediction of landmarks at femur, tibial and knee center, the side of the fibula and, subsequently, the various knee joint angles. An analysis of sensitivity of segmentation accuracy on the accuracy of predicted angles further substantiate the efficacy of the proposed methods. Dice score of U-Net, Pretrained SegNet, SegNet, VGG16, MobileNetV2, Pretrained DeepLabv3+, Pretrained FCN, vSegNet with cross-entropy loss function and vSegNet with compound loss function are observed as <math>\\n <semantics>\\n <mrow>\\n <mn>0.083</mn>\\n <mo>±</mo>\\n <mn>0.04</mn>\\n </mrow>\\n <annotation>$$ 0.083\\\\pm 0.04 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.51</mn>\\n <mo>±</mo>\\n <mn>0.16</mn>\\n </mrow>\\n <annotation>$$ 0.51\\\\pm 0.16 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.66</mn>\\n <mo>±</mo>\\n <mn>0.20</mn>\\n </mrow>\\n <annotation>$$ 0.66\\\\pm 0.20 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.61</mn>\\n <mo>±</mo>\\n <mn>0.15</mn>\\n </mrow>\\n <annotation>$$ 0.61\\\\pm 0.15 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.17</mn>\\n <mo>±</mo>\\n <mn>0.16</mn>\\n </mrow>\\n <annotation>$$ 0.17\\\\pm 0.16 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.61</mn>\\n <mo>±</mo>\\n <mn>0.22</mn>\\n </mrow>\\n <annotation>$$ 0.61\\\\pm 0.22 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.504</mn>\\n <mo>±</mo>\\n <mn>0.13</mn>\\n </mrow>\\n <annotation>$$ 0.504\\\\pm 0.13 $$</annotation>\\n </semantics></math>, <math>\\n <semantics>\\n <mrow>\\n <mn>0.77</mn>\\n <mo>±</mo>\\n <mn>0.08</mn>\\n </mrow>\\n <annotation>$$ 0.77\\\\pm 0.08 $$</annotation>\\n </semantics></math> and <math>\\n <semantics>\\n <mrow>\\n <mn>0.95</mn>\\n <mo>±</mo>\\n <mn>0.02</mn>\\n </mrow>\\n <annotation>$$ 0.95\\\\pm 0.02 $$</annotation>\\n </semantics></math> respectively. Using the proposed network vSegNet and the automated workflow, we obtained an Intraclass correlation of 0.999, 0.994, and 0.997 for the FTA, MPTA and HKAA measurements, respectively, with the ground truth. FTA, MPTA, and HKAA measurements using the proposed automatic pipeline positively correlated with the expert's measurement. The proposed vSegNet with compound loss function handles the challenges posed by class imbalance and obtains the best results as compared to other networks and loss functions tested in the work and also in comparison with contemporary works described in literature.\\n </div>\",\"PeriodicalId\":14027,\"journal\":{\"name\":\"International Journal of Imaging Systems and Technology\",\"volume\":\"35 1\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-01-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Imaging Systems and Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ima.70027\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70027","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

这项工作设想开发一种自动化的计算机工作流程，以定位膝关节中心、胫骨平台、胫骨和股轴等地标，并从术前和术后的x射线中测量股骨-胫骨角（FTA）、胫骨内侧近端角（MPTA）和髋关节-膝关节角（HKAA）。在这项工作中，我们提出了一种语义分割模型（vSegNet）的变体，用于膝关节和胫骨间隙的分割，以提取自动化工作流程中使用的重要特征。由于股骨胫骨间隙相对于完整的x线图像是一个很小的区域，它造成了严重的类不平衡问题。结合Dice系数和Hausdorff距离作为复合损失函数，与U-Net、SegNet（有和没有VGG16预训练权值）、VGG16、MobileNetV2、预训练DeepLabv3+ （Resnet18权值）、预训练FCN （VGG16权值）和不同损失函数等最先进的分割模型相比，所提出的神经网络模型具有更好的分割性能。我们随后提出了特征识别和预测股骨、胫骨和膝关节中心、腓骨侧面以及随后的各种膝关节角度的标志的计算机方法。分析了分割精度对预测角度精度的敏感性，进一步验证了所提方法的有效性。U-Net、预训练的SegNet、SegNet、VGG16、MobileNetV2、预训练的DeepLabv3+、预训练的FCN、带交叉熵损失函数的vSegNet和带复合损失函数的vSegNet的骰子得分分别为0.083±0.04 $$ 0.083\pm 0.04 $$。0.51±0.16 $$ 0.51\pm 0.16 $$, 0.66±0.20 $$ 0.66\pm 0.20 $$, 0.61±0.15 $$ 0.61\pm 0.15 $$，0.17±0.16 $$ 0.17\pm 0.16 $$, 0.61±0.22 $$ 0.61\pm 0.22 $$, 0.504±0.13 $$ 0.504\pm 0.13 $$，分别为0.77±0.08 $$ 0.77\pm 0.08 $$和0.95±0.02 $$ 0.95\pm 0.02 $$。使用所提出的网络vSegNet和自动化工作流，我们分别获得了FTA， MPTA和HKAA测量值与地面真实值的类内相关性为0.999,0.994和0.997。FTA、MPTA和HKAA测量使用拟议的自动管道与专家的测量呈正相关。所提出的具有复合损失函数的vSegNet处理了类不平衡带来的挑战，与工作中测试的其他网络和损失函数以及与文献中描述的当代作品相比，获得了最好的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic Measurement of Lower Limb Angles From Pre- and Post-Operative X-Ray Images Using a Variant SegNet With Compound Loss Function

This work envisages developing an automated computer workflow to locate the landmarks like knee center, tibial plateau, tibial and femoral axis to measure Femur-Tibia Angle (FTA), Medial Proximal Tibial Angle (MPTA), and Hip Knee Ankle Angle (HKAA) from the pre- and post-operative x-rays. In this work, we propose a variant of semantic segmentation model (vSegNet) for the segmentation of the knee and tibia gap for extracting important features used in the automated workflow. Since femur tibia gap is a small region as compared to the complete x-ray image, it poses severe class imbalance issue. Using a combination of the Dice coefficient and Hausdorff distance as a compound loss function, the proposed neural network model shows better segmentation performance as compared to state-of-the-art segmentation models like U-Net, SegNet (with and without VGG16 pre-trained weights), VGG16, MobileNetV2, Pretrained DeepLabv3+ (Resnet18 weights), and Pretrained FCN (VGG16 weights) and different loss functions. We subsequently propose computer methods for feature recognition and prediction of landmarks at femur, tibial and knee center, the side of the fibula and, subsequently, the various knee joint angles. An analysis of sensitivity of segmentation accuracy on the accuracy of predicted angles further substantiate the efficacy of the proposed methods. Dice score of U-Net, Pretrained SegNet, SegNet, VGG16, MobileNetV2, Pretrained DeepLabv3+, Pretrained FCN, vSegNet with cross-entropy loss function and vSegNet with compound loss function are observed as $0.083 \pm 0.04$ , $0.51 \pm 0.16$ , $0.66 \pm 0.20$ , $0.61 \pm 0.15$ , $0.17 \pm 0.16$ , $0.61 \pm 0.22$ , $0.504 \pm 0.13$ , $0.77 \pm 0.08$ and $0.95 \pm 0.02$ respectively. Using the proposed network vSegNet and the automated workflow, we obtained an Intraclass correlation of 0.999, 0.994, and 0.997 for the FTA, MPTA and HKAA measurements, respectively, with the ground truth. FTA, MPTA, and HKAA measurements using the proposed automatic pipeline positively correlated with the expert's measurement. The proposed vSegNet with compound loss function handles the challenges posed by class imbalance and obtains the best results as compared to other networks and loss functions tested in the work and also in comparison with contemporary works described in literature.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.