基于多注释器标签的阴道超声图像质量自动评估的新型机器学习模型的开发

Q3 Medicine

Australasian Journal of Ultrasound in Medicine Pub Date : 2025-09-22 DOI:10.1002/ajum.70026

Alison Deslandes, Daniel Petashvili, Hu Wang, Gustavo Carnerio, Jodie Avery, George Condous, Mathew Leonardi, M. Louise Hull, Hsiang-Ting Chen

{"title":"基于多注释器标签的阴道超声图像质量自动评估的新型机器学习模型的开发","authors":"Alison Deslandes, Daniel Petashvili, Hu Wang, Gustavo Carnerio, Jodie Avery, George Condous, Mathew Leonardi, M. Louise Hull, Hsiang-Ting Chen","doi":"10.1002/ajum.70026","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Objectives</h3>\n \n <p>Accurate diagnosis of pathology from ultrasound images is reliant upon images of a suitable diagnostic quality being acquired. This study aimed to create a novel machine learning model to automatically assess transvaginal ultrasound (TVUS) image quality for gynaecological ultrasound.</p>\n </section>\n \n <section>\n \n <h3> Method</h3>\n \n <p>Six imaging professionals (two sonographers, two gynaecological sonologists and two radiologists) assigned a quality score to 150 TVUS images from 50 cases (50 uterus images and 100 ovary images). Images were given a score of 1–4 (1—reject/image inaccurate, 2—poor quality, 3—suboptimal quality or 4—optimal quality). As variation existed between the scores assigned by the labellers, we framed this problem as a multi-annotator noisy label problem. To address this, a new machine learning architecture was developed, combining a weighted ensemble algorithm to estimate consensus labels and a multi-axis vision transformer (MaxViT) to handle noisy labels, improving model accuracy in predicting image quality. Forty cases (120 images) were used for model training, while the remaining 10 cases (30 images) were reserved as a test set for model evaluation.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The novel machine learning architecture we created was able to successfully determine image quality with a validation accuracy of 80% and a macro average recall of 77%. This significantly improved upon the 57% accuracy of the baseline machine learning method (ResNet50). The MaxViTs were able to outperform human performance in most cases, with an accuracy of 80% surpassing four of the six human labellers.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>This novel machine learning model offers an automated method of assessing the quality of TVUS images. The tool has the potential to provide real-time feedback to those performing TVUS, reduce the need for repeated imaging, and improve the diagnosis of gynaecological pathology.</p>\n </section>\n </div>","PeriodicalId":36517,"journal":{"name":"Australasian Journal of Ultrasound in Medicine","volume":"28 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ajum.70026","citationCount":"0","resultStr":"{\"title\":\"Development of a Novel Machine Learning Model for Automatic Assessment of Quality of Transvaginal Ultrasound Images From Multi-Annotator Labels\",\"authors\":\"Alison Deslandes, Daniel Petashvili, Hu Wang, Gustavo Carnerio, Jodie Avery, George Condous, Mathew Leonardi, M. Louise Hull, Hsiang-Ting Chen\",\"doi\":\"10.1002/ajum.70026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Objectives</h3>\\n \\n <p>Accurate diagnosis of pathology from ultrasound images is reliant upon images of a suitable diagnostic quality being acquired. This study aimed to create a novel machine learning model to automatically assess transvaginal ultrasound (TVUS) image quality for gynaecological ultrasound.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Method</h3>\\n \\n <p>Six imaging professionals (two sonographers, two gynaecological sonologists and two radiologists) assigned a quality score to 150 TVUS images from 50 cases (50 uterus images and 100 ovary images). Images were given a score of 1–4 (1—reject/image inaccurate, 2—poor quality, 3—suboptimal quality or 4—optimal quality). As variation existed between the scores assigned by the labellers, we framed this problem as a multi-annotator noisy label problem. To address this, a new machine learning architecture was developed, combining a weighted ensemble algorithm to estimate consensus labels and a multi-axis vision transformer (MaxViT) to handle noisy labels, improving model accuracy in predicting image quality. Forty cases (120 images) were used for model training, while the remaining 10 cases (30 images) were reserved as a test set for model evaluation.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>The novel machine learning architecture we created was able to successfully determine image quality with a validation accuracy of 80% and a macro average recall of 77%. This significantly improved upon the 57% accuracy of the baseline machine learning method (ResNet50). The MaxViTs were able to outperform human performance in most cases, with an accuracy of 80% surpassing four of the six human labellers.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>This novel machine learning model offers an automated method of assessing the quality of TVUS images. The tool has the potential to provide real-time feedback to those performing TVUS, reduce the need for repeated imaging, and improve the diagnosis of gynaecological pathology.</p>\\n </section>\\n </div>\",\"PeriodicalId\":36517,\"journal\":{\"name\":\"Australasian Journal of Ultrasound in Medicine\",\"volume\":\"28 4\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-09-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ajum.70026\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Australasian Journal of Ultrasound in Medicine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ajum.70026\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Australasian Journal of Ultrasound in Medicine","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ajum.70026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

摘要

目的从超声图像中准确诊断病理是依赖于获得合适的诊断质量的图像。本研究旨在建立一种新的机器学习模型来自动评估妇科超声经阴道超声（TVUS）图像质量。方法6名影像学专家（2名超声医师、2名妇科超声医师和2名放射科医师）对50例（50张子宫图像和100张卵巢图像）的150张TVUS图像进行质量评分。图像的评分为1-4分（1 -拒绝/图像不准确，2 -质量差，3 -次优质量或4 -最佳质量）。由于标注者分配的分数之间存在差异，我们将此问题定义为多标注者噪声标注问题。为了解决这个问题，开发了一种新的机器学习架构，结合了加权集成算法来估计共识标签和多轴视觉变压器（MaxViT）来处理噪声标签，提高了模型预测图像质量的准确性。其中40个案例（120张图片）用于模型训练，其余10个案例（30张图片）保留作为模型评估的测试集。我们创建的新型机器学习架构能够成功地确定图像质量，验证精度为80%，宏观平均召回率为77%。这在基准机器学习方法57%的准确率（ResNet50）的基础上显著提高。在大多数情况下，MaxViTs能够超越人类的表现，准确率达到80%，超过了6个人类标记器中的4个。该新型机器学习模型提供了一种评估TVUS图像质量的自动化方法。该工具有可能为执行TVUS的患者提供实时反馈，减少重复成像的需要，并改善妇科病理的诊断。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Development of a Novel Machine Learning Model for Automatic Assessment of Quality of Transvaginal Ultrasound Images From Multi-Annotator Labels

查看原文本刊更多论文

Development of a Novel Machine Learning Model for Automatic Assessment of Quality of Transvaginal Ultrasound Images From Multi-Annotator Labels

Objectives

Accurate diagnosis of pathology from ultrasound images is reliant upon images of a suitable diagnostic quality being acquired. This study aimed to create a novel machine learning model to automatically assess transvaginal ultrasound (TVUS) image quality for gynaecological ultrasound.

Method

Six imaging professionals (two sonographers, two gynaecological sonologists and two radiologists) assigned a quality score to 150 TVUS images from 50 cases (50 uterus images and 100 ovary images). Images were given a score of 1–4 (1—reject/image inaccurate, 2—poor quality, 3—suboptimal quality or 4—optimal quality). As variation existed between the scores assigned by the labellers, we framed this problem as a multi-annotator noisy label problem. To address this, a new machine learning architecture was developed, combining a weighted ensemble algorithm to estimate consensus labels and a multi-axis vision transformer (MaxViT) to handle noisy labels, improving model accuracy in predicting image quality. Forty cases (120 images) were used for model training, while the remaining 10 cases (30 images) were reserved as a test set for model evaluation.

Results

The novel machine learning architecture we created was able to successfully determine image quality with a validation accuracy of 80% and a macro average recall of 77%. This significantly improved upon the 57% accuracy of the baseline machine learning method (ResNet50). The MaxViTs were able to outperform human performance in most cases, with an accuracy of 80% surpassing four of the six human labellers.

Conclusions

This novel machine learning model offers an automated method of assessing the quality of TVUS images. The tool has the potential to provide real-time feedback to those performing TVUS, reduce the need for repeated imaging, and improve the diagnosis of gynaecological pathology.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Australasian Journal of Ultrasound in Medicine Medicine-Radiology, Nuclear Medicine and Imaging

CiteScore

1.90

自引率

0.00%

发文量