四维肝脏超声标志物标记的观察者间和观察者内变异分析。

IF 1.7 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Journal of Medical Imaging Pub Date : 2025-09-01 Epub Date: 2025-06-30 DOI:10.1117/1.JMI.12.5.051807

Daniel Wulff, Floris Ernst

{"title":"四维肝脏超声标志物标记的观察者间和观察者内变异分析。","authors":"Daniel Wulff, Floris Ernst","doi":"10.1117/1.JMI.12.5.051807","DOIUrl":null,"url":null,"abstract":"Purpose: Four-dimensional (4D) ultrasound imaging is widely used in clinics for diagnostics and therapy guidance. Accurate target tracking in 4D ultrasound is crucial for autonomous therapy guidance systems, such as radiotherapy, where precise tumor localization ensures effective treatment. Supervised deep learning approaches rely on reliable ground truth, making accurate labels essential. We investigate the reliability of expert-labeled ground truth data by evaluating intra- and inter-observer variability in landmark labeling for 4D ultrasound imaging in the liver.Approach: Eight 4D liver ultrasound sequences were labeled by eight expert observers, each labeling eight landmarks three times. Intra- and inter-observer variability was quantified, and observer survey and motion analysis were conducted to determine factors influencing labeling accuracy, such as ultrasound artifacts and motion amplitude.Results: The mean intra-observer variability ranged from <math><mrow><mn>1.58</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>0.90</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> to <math><mrow><mn>2.05</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>1.22</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> depending on the observer. The inter-observer variability for the two observer groups was <math><mrow><mn>2.68</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>1.69</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> and <math><mrow><mn>3.06</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>1.74</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> . The observer survey and motion analysis revealed that ultrasound artifacts significantly affected labeling accuracy due to limited landmark visibility, whereas motion amplitude had no measurable effect. Our measured mean landmark motion was <math><mrow><mn>11.56</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>5.86</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> .Conclusions: We highlight variability in expert-labeled ground truth data for 4D ultrasound imaging and identify ultrasound artifacts as a major source of labeling inaccuracies. These findings underscore the importance of addressing observer variability and artifact-related challenges to improve the reliability of ground truth data for evaluating target tracking algorithms in 4D ultrasound applications.","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 5","pages":"051807"},"PeriodicalIF":1.7000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12207815/pdf/","citationCount":"0","resultStr":"{\"title\":\"Analysis of intra- and inter-observer variability in 4D liver ultrasound landmark labeling.\",\"authors\":\"Daniel Wulff, Floris Ernst\",\"doi\":\"10.1117/1.JMI.12.5.051807\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: Four-dimensional (4D) ultrasound imaging is widely used in clinics for diagnostics and therapy guidance. Accurate target tracking in 4D ultrasound is crucial for autonomous therapy guidance systems, such as radiotherapy, where precise tumor localization ensures effective treatment. Supervised deep learning approaches rely on reliable ground truth, making accurate labels essential. We investigate the reliability of expert-labeled ground truth data by evaluating intra- and inter-observer variability in landmark labeling for 4D ultrasound imaging in the liver.Approach: Eight 4D liver ultrasound sequences were labeled by eight expert observers, each labeling eight landmarks three times. Intra- and inter-observer variability was quantified, and observer survey and motion analysis were conducted to determine factors influencing labeling accuracy, such as ultrasound artifacts and motion amplitude.Results: The mean intra-observer variability ranged from <math><mrow><mn>1.58</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>0.90</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> to <math><mrow><mn>2.05</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>1.22</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> depending on the observer. The inter-observer variability for the two observer groups was <math><mrow><mn>2.68</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>1.69</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> and <math><mrow><mn>3.06</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>1.74</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> . The observer survey and motion analysis revealed that ultrasound artifacts significantly affected labeling accuracy due to limited landmark visibility, whereas motion amplitude had no measurable effect. Our measured mean landmark motion was <math><mrow><mn>11.56</mn> <mtext> </mtext> <mi>mm</mi> <mo>±</mo> <mn>5.86</mn> <mtext> </mtext> <mi>mm</mi></mrow> </math> .Conclusions: We highlight variability in expert-labeled ground truth data for 4D ultrasound imaging and identify ultrasound artifacts as a major source of labeling inaccuracies. These findings underscore the importance of addressing observer variability and artifact-related challenges to improve the reliability of ground truth data for evaluating target tracking algorithms in 4D ultrasound applications.\",\"PeriodicalId\":47707,\"journal\":{\"name\":\"Journal of Medical Imaging\",\"volume\":\"12 5\",\"pages\":\"051807\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12207815/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Medical Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1117/1.JMI.12.5.051807\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.5.051807","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

目的：四维超声成像广泛应用于临床诊断和治疗指导。在四维超声中精确的目标跟踪对于自主治疗引导系统至关重要，例如放射治疗，其中精确的肿瘤定位确保有效治疗。有监督的深度学习方法依赖于可靠的真实情况，因此准确的标签至关重要。我们通过评估肝脏四维超声成像中地标标记的观察者内部和观察者之间的可变性来研究专家标记的真实数据的可靠性。方法：8个4D肝脏超声序列由8名专家观察者标记，每个标记3次，标记8个地标。量化观察者内部和观察者之间的可变性，并进行观察者调查和运动分析，以确定影响标记准确性的因素，如超声伪影和运动幅度。结果：根据观察者的不同，观察者内部的平均变异范围为1.58 mm±0.90 mm至2.05 mm±1.22 mm。两个观察组的观察者间变异分别为2.68 mm±1.69 mm和3.06 mm±1.74 mm。观察者调查和运动分析显示，由于地标可见性有限，超声伪影显著影响标记准确性，而运动幅度没有可测量的影响。我们测量到的平均地标运动为11.56 mm±5.86 mm。结论：我们强调了专家标记的四维超声成像地面真值数据的可变性，并确定超声伪影是标记不准确的主要来源。这些发现强调了解决观测者可变性和伪影相关挑战的重要性，以提高评估4D超声应用中目标跟踪算法的地面真实数据的可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis of intra- and inter-observer variability in 4D liver ultrasound landmark labeling.

Purpose: Four-dimensional (4D) ultrasound imaging is widely used in clinics for diagnostics and therapy guidance. Accurate target tracking in 4D ultrasound is crucial for autonomous therapy guidance systems, such as radiotherapy, where precise tumor localization ensures effective treatment. Supervised deep learning approaches rely on reliable ground truth, making accurate labels essential. We investigate the reliability of expert-labeled ground truth data by evaluating intra- and inter-observer variability in landmark labeling for 4D ultrasound imaging in the liver.

Approach: Eight 4D liver ultrasound sequences were labeled by eight expert observers, each labeling eight landmarks three times. Intra- and inter-observer variability was quantified, and observer survey and motion analysis were conducted to determine factors influencing labeling accuracy, such as ultrasound artifacts and motion amplitude.

Results: The mean intra-observer variability ranged from $1.58 mm \pm 0.90 mm$ to $2.05 mm \pm 1.22 mm$ depending on the observer. The inter-observer variability for the two observer groups was $2.68 mm \pm 1.69 mm$ and $3.06 mm \pm 1.74 mm$ . The observer survey and motion analysis revealed that ultrasound artifacts significantly affected labeling accuracy due to limited landmark visibility, whereas motion amplitude had no measurable effect. Our measured mean landmark motion was $11.56 mm \pm 5.86 mm$ .

Conclusions: We highlight variability in expert-labeled ground truth data for 4D ultrasound imaging and identify ultrasound artifacts as a major source of labeling inaccuracies. These findings underscore the importance of addressing observer variability and artifact-related challenges to improve the reliability of ground truth data for evaluating target tracking algorithms in 4D ultrasound applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.10

自引率

4.20%

发文量

期刊介绍： JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.