基于组织病理学图像的外部验证机器学习模型在女性乳腺癌诊断、分类、预后或治疗效果预测方面的表现:系统综述

Q2 Medicine
Ricardo Gonzalez , Peyman Nejat , Ashirbani Saha , Clinton J.V. Campbell , Andrew P. Norgan , Cynthia Lokker
{"title":"基于组织病理学图像的外部验证机器学习模型在女性乳腺癌诊断、分类、预后或治疗效果预测方面的表现:系统综述","authors":"Ricardo Gonzalez ,&nbsp;Peyman Nejat ,&nbsp;Ashirbani Saha ,&nbsp;Clinton J.V. Campbell ,&nbsp;Andrew P. Norgan ,&nbsp;Cynthia Lokker","doi":"10.1016/j.jpi.2023.100348","DOIUrl":null,"url":null,"abstract":"<div><p>Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer. A systematic search of MEDLINE, EMBASE, CINAHL, IEEE, MICCAI, and SPIE conferences was performed for studies published between January 2010 and February 2022. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was employed, and the results were narratively described. Of the 2011 non-duplicated citations, 8 journal articles and 2 conference proceedings met inclusion criteria. Three studies externally validated ML models for diagnosis, 4 for classification, 2 for prognosis, and 1 for both classification and prognosis. Most studies used Convolutional Neural Networks and one used logistic regression algorithms. For diagnostic/classification models, the most common performance metrics reported in the EV were accuracy and area under the curve, which were greater than 87% and 90%, respectively, using pathologists' annotations/diagnoses as ground truth. The hazard ratios in the EV of prognostic ML models were between 1.7 (95% CI, 1.2–2.6) and 1.8 (95% CI, 1.3–2.7) to predict distant disease-free survival; 1.91 (95% CI, 1.11–3.29) for recurrence, and between 0.09 (95% CI, 0.01–0.70) and 0.65 (95% CI, 0.43–0.98) for overall survival, using clinical data as ground truth. Despite EV being an important step before the clinical application of a ML model, it hasn't been performed routinely. The large variability in the training/validation datasets, methods, performance metrics, and reported information limited the comparison of the models and the analysis of their results. Increasing the availability of validation datasets and implementing standardized methods and reporting protocols may facilitate future analyses.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2153353923001621/pdfft?md5=065ed1f82e40c99cefb1eb56d5945b9c&pid=1-s2.0-S2153353923001621-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Performance of externally validated machine learning models based on histopathology images for the diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer: A systematic review\",\"authors\":\"Ricardo Gonzalez ,&nbsp;Peyman Nejat ,&nbsp;Ashirbani Saha ,&nbsp;Clinton J.V. Campbell ,&nbsp;Andrew P. Norgan ,&nbsp;Cynthia Lokker\",\"doi\":\"10.1016/j.jpi.2023.100348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer. A systematic search of MEDLINE, EMBASE, CINAHL, IEEE, MICCAI, and SPIE conferences was performed for studies published between January 2010 and February 2022. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was employed, and the results were narratively described. Of the 2011 non-duplicated citations, 8 journal articles and 2 conference proceedings met inclusion criteria. Three studies externally validated ML models for diagnosis, 4 for classification, 2 for prognosis, and 1 for both classification and prognosis. Most studies used Convolutional Neural Networks and one used logistic regression algorithms. For diagnostic/classification models, the most common performance metrics reported in the EV were accuracy and area under the curve, which were greater than 87% and 90%, respectively, using pathologists' annotations/diagnoses as ground truth. The hazard ratios in the EV of prognostic ML models were between 1.7 (95% CI, 1.2–2.6) and 1.8 (95% CI, 1.3–2.7) to predict distant disease-free survival; 1.91 (95% CI, 1.11–3.29) for recurrence, and between 0.09 (95% CI, 0.01–0.70) and 0.65 (95% CI, 0.43–0.98) for overall survival, using clinical data as ground truth. Despite EV being an important step before the clinical application of a ML model, it hasn't been performed routinely. The large variability in the training/validation datasets, methods, performance metrics, and reported information limited the comparison of the models and the analysis of their results. Increasing the availability of validation datasets and implementing standardized methods and reporting protocols may facilitate future analyses.</p></div>\",\"PeriodicalId\":37769,\"journal\":{\"name\":\"Journal of Pathology Informatics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2153353923001621/pdfft?md5=065ed1f82e40c99cefb1eb56d5945b9c&pid=1-s2.0-S2153353923001621-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Pathology Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2153353923001621\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pathology Informatics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2153353923001621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

利用各种类型的数据为乳腺癌开发了大量机器学习(ML)模型。ML 模型成功的外部验证(EV)是其通用性的重要证据。本系统综述旨在评估经过外部验证的基于组织病理学图像的机器学习模型在女性乳腺癌诊断、分类、预后或治疗效果预测方面的性能。我们对 2010 年 1 月至 2022 年 2 月期间发表的研究进行了系统检索,包括 MEDLINE、EMBASE、CINAHL、IEEE、MICCAI 和 SPIE 会议。采用了预测模型偏倚风险评估工具(PROBAST),并对结果进行了叙述性描述。在 2011 年的非重复引用中,有 8 篇期刊论文和 2 篇会议论文集符合纳入标准。3 项研究对用于诊断的 ML 模型进行了外部验证,4 项用于分类,2 项用于预后,1 项用于分类和预后。大多数研究使用了卷积神经网络,一项研究使用了逻辑回归算法。对于诊断/分类模型,EV 报告中最常见的性能指标是准确率和曲线下面积,以病理学家的注释/诊断为基本事实,准确率和曲线下面积分别大于 87% 和 90%。使用临床数据作为基本真相,预后 ML 模型 EV 中预测远期无病生存的危险比介于 1.7(95% CI,1.2-2.6)和 1.8(95% CI,1.3-2.7)之间;预测复发的危险比介于 1.91(95% CI,1.11-3.29)之间;预测总生存的危险比介于 0.09(95% CI,0.01-0.70)和 0.65(95% CI,0.43-0.98)之间。尽管EV是ML模型临床应用前的一个重要步骤,但它还没有被常规化。训练/验证数据集、方法、性能指标和报告信息的巨大差异限制了模型的比较和结果分析。增加验证数据集的可用性并实施标准化的方法和报告协议可能会促进未来的分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Performance of externally validated machine learning models based on histopathology images for the diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer: A systematic review

Numerous machine learning (ML) models have been developed for breast cancer using various types of data. Successful external validation (EV) of ML models is important evidence of their generalizability. The aim of this systematic review was to assess the performance of externally validated ML models based on histopathology images for diagnosis, classification, prognosis, or treatment outcome prediction in female breast cancer. A systematic search of MEDLINE, EMBASE, CINAHL, IEEE, MICCAI, and SPIE conferences was performed for studies published between January 2010 and February 2022. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was employed, and the results were narratively described. Of the 2011 non-duplicated citations, 8 journal articles and 2 conference proceedings met inclusion criteria. Three studies externally validated ML models for diagnosis, 4 for classification, 2 for prognosis, and 1 for both classification and prognosis. Most studies used Convolutional Neural Networks and one used logistic regression algorithms. For diagnostic/classification models, the most common performance metrics reported in the EV were accuracy and area under the curve, which were greater than 87% and 90%, respectively, using pathologists' annotations/diagnoses as ground truth. The hazard ratios in the EV of prognostic ML models were between 1.7 (95% CI, 1.2–2.6) and 1.8 (95% CI, 1.3–2.7) to predict distant disease-free survival; 1.91 (95% CI, 1.11–3.29) for recurrence, and between 0.09 (95% CI, 0.01–0.70) and 0.65 (95% CI, 0.43–0.98) for overall survival, using clinical data as ground truth. Despite EV being an important step before the clinical application of a ML model, it hasn't been performed routinely. The large variability in the training/validation datasets, methods, performance metrics, and reported information limited the comparison of the models and the analysis of their results. Increasing the availability of validation datasets and implementing standardized methods and reporting protocols may facilitate future analyses.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Pathology Informatics
Journal of Pathology Informatics Medicine-Pathology and Forensic Medicine
CiteScore
3.70
自引率
0.00%
发文量
2
审稿时长
18 weeks
期刊介绍: The Journal of Pathology Informatics (JPI) is an open access peer-reviewed journal dedicated to the advancement of pathology informatics. This is the official journal of the Association for Pathology Informatics (API). The journal aims to publish broadly about pathology informatics and freely disseminate all articles worldwide. This journal is of interest to pathologists, informaticians, academics, researchers, health IT specialists, information officers, IT staff, vendors, and anyone with an interest in informatics. We encourage submissions from anyone with an interest in the field of pathology informatics. We publish all types of papers related to pathology informatics including original research articles, technical notes, reviews, viewpoints, commentaries, editorials, symposia, meeting abstracts, book reviews, and correspondence to the editors. All submissions are subject to rigorous peer review by the well-regarded editorial board and by expert referees in appropriate specialties.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信