人工智能在免疫染色全片图像上自动泛器官联合PD-L1评分的临床评价

C. Bossard , C. Magois , H. Roussel , F. Thomas , B. Cormier , A. Collin , V. Lemerle , I. Chokri , L. Lambros , F. Jossic , J.-F. Jazeron , C. Eymerit-Morin , A. Dhouibi , N. Labaied , A. Mensah , B. Gourdin , F. Leclair , D. Pommeret , Y. Salhi , M. Cecchini , J. Chetritt
{"title":"人工智能在免疫染色全片图像上自动泛器官联合PD-L1评分的临床评价","authors":"C. Bossard ,&nbsp;C. Magois ,&nbsp;H. Roussel ,&nbsp;F. Thomas ,&nbsp;B. Cormier ,&nbsp;A. Collin ,&nbsp;V. Lemerle ,&nbsp;I. Chokri ,&nbsp;L. Lambros ,&nbsp;F. Jossic ,&nbsp;J.-F. Jazeron ,&nbsp;C. Eymerit-Morin ,&nbsp;A. Dhouibi ,&nbsp;N. Labaied ,&nbsp;A. Mensah ,&nbsp;B. Gourdin ,&nbsp;F. Leclair ,&nbsp;D. Pommeret ,&nbsp;Y. Salhi ,&nbsp;M. Cecchini ,&nbsp;J. Chetritt","doi":"10.1016/j.esmorw.2025.100181","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Programmed death-ligand 1 (PD-L1) inhibitors have shown remarkable results in oncology; however, many patients fail to respond, highlighting the need for reliable assessment of PD-L1 expression for patient selection. PD-L1 scoring, especially the combined positive score (CPS), is hindered by inter- and intraobserver variability, complex staining patterns, and technical discrepancies, all of which can impact therapeutic decisions. Artificial intelligence (AI) offers a solution by standardizing PD-L1 evaluation. This study evaluates a PD-L1 CPS AI, designed for reproducible and robust PD-L1 scoring across various tumor types and conditions.</div></div><div><h3>Materials and methods</h3><div>AI performance was validated on 142 samples spanning multiple tumor types (gastrointestinal, head and neck, breast, and uterine cervix) and sourced from four centers, reflecting diverse staining protocols. Routine scores were available. A gold standard was established through independent retrospective scoring by three senior pathologists enabling the assessment of variability. The scoring process was followed by collegial discussions to resolve discordant cases and ensure medical consensus. After a washout period, cases were reassessed with AI assistance. AI and routine manual scores were compared with the gold standard using organ-specific cut-offs.</div></div><div><h3>Results</h3><div>AI assistance improved interobserver agreement among pathologists, increasing intraclass correlation coefficient (ICC) from 62% to 74%, with a particularly pronounced effect in challenging cases with CPS &lt; 20 (<em>n</em> = 91), where ICC improved from 19% to 62%, underscoring the value of AI in reducing variability near clinical decision thresholds. Based on clinical cut-offs, AI-based scoring outperformed routine manual scoring in accuracy (88% versus 75%) and sensitivity (96% versus 78%), while maintaining a comparable positive predictive value (88% versus 87%), indicating an improved ability to detect true-positive cases.</div></div><div><h3>Conclusions</h3><div>This study highlights the potential of an AI-driven tool—DiaKwant PD-L1 algorithm—to improve PD-L1 scoring accuracy and reduce observer variability, particularly near clinical thresholds, across various solid carcinomas, independently of pre-analytical and digitization platforms. Its integration into clinical workflows could enhance efficiency and optimize patient eligibility for immunotherapy.</div></div>","PeriodicalId":100491,"journal":{"name":"ESMO Real World Data and Digital Oncology","volume":"10 ","pages":"Article 100181"},"PeriodicalIF":0.0000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Clinical evaluation of an automated pan-organ combined PD-L1 scoring using artificial intelligence on immunostained whole-slide images\",\"authors\":\"C. Bossard ,&nbsp;C. Magois ,&nbsp;H. Roussel ,&nbsp;F. Thomas ,&nbsp;B. Cormier ,&nbsp;A. Collin ,&nbsp;V. Lemerle ,&nbsp;I. Chokri ,&nbsp;L. Lambros ,&nbsp;F. Jossic ,&nbsp;J.-F. Jazeron ,&nbsp;C. Eymerit-Morin ,&nbsp;A. Dhouibi ,&nbsp;N. Labaied ,&nbsp;A. Mensah ,&nbsp;B. Gourdin ,&nbsp;F. Leclair ,&nbsp;D. Pommeret ,&nbsp;Y. Salhi ,&nbsp;M. Cecchini ,&nbsp;J. Chetritt\",\"doi\":\"10.1016/j.esmorw.2025.100181\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Programmed death-ligand 1 (PD-L1) inhibitors have shown remarkable results in oncology; however, many patients fail to respond, highlighting the need for reliable assessment of PD-L1 expression for patient selection. PD-L1 scoring, especially the combined positive score (CPS), is hindered by inter- and intraobserver variability, complex staining patterns, and technical discrepancies, all of which can impact therapeutic decisions. Artificial intelligence (AI) offers a solution by standardizing PD-L1 evaluation. This study evaluates a PD-L1 CPS AI, designed for reproducible and robust PD-L1 scoring across various tumor types and conditions.</div></div><div><h3>Materials and methods</h3><div>AI performance was validated on 142 samples spanning multiple tumor types (gastrointestinal, head and neck, breast, and uterine cervix) and sourced from four centers, reflecting diverse staining protocols. Routine scores were available. A gold standard was established through independent retrospective scoring by three senior pathologists enabling the assessment of variability. The scoring process was followed by collegial discussions to resolve discordant cases and ensure medical consensus. After a washout period, cases were reassessed with AI assistance. AI and routine manual scores were compared with the gold standard using organ-specific cut-offs.</div></div><div><h3>Results</h3><div>AI assistance improved interobserver agreement among pathologists, increasing intraclass correlation coefficient (ICC) from 62% to 74%, with a particularly pronounced effect in challenging cases with CPS &lt; 20 (<em>n</em> = 91), where ICC improved from 19% to 62%, underscoring the value of AI in reducing variability near clinical decision thresholds. Based on clinical cut-offs, AI-based scoring outperformed routine manual scoring in accuracy (88% versus 75%) and sensitivity (96% versus 78%), while maintaining a comparable positive predictive value (88% versus 87%), indicating an improved ability to detect true-positive cases.</div></div><div><h3>Conclusions</h3><div>This study highlights the potential of an AI-driven tool—DiaKwant PD-L1 algorithm—to improve PD-L1 scoring accuracy and reduce observer variability, particularly near clinical thresholds, across various solid carcinomas, independently of pre-analytical and digitization platforms. Its integration into clinical workflows could enhance efficiency and optimize patient eligibility for immunotherapy.</div></div>\",\"PeriodicalId\":100491,\"journal\":{\"name\":\"ESMO Real World Data and Digital Oncology\",\"volume\":\"10 \",\"pages\":\"Article 100181\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ESMO Real World Data and Digital Oncology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2949820125000700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESMO Real World Data and Digital Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949820125000700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

程序性死亡配体1 (PD-L1)抑制剂在肿瘤学中显示出显著的效果;然而,许多患者没有反应,因此需要可靠的PD-L1表达评估来选择患者。PD-L1评分,特别是联合阳性评分(CPS),受到观察者之间和观察者内部的变异性、复杂的染色模式和技术差异的阻碍,所有这些都会影响治疗决策。人工智能(AI)通过标准化PD-L1评价提供了解决方案。本研究评估了PD-L1 CPS AI,该AI设计用于在各种肿瘤类型和条件下可重复且稳健的PD-L1评分。材料和方法在来自四个中心的142个样本上验证了sai的性能,这些样本涵盖多种肿瘤类型(胃肠道、头颈部、乳房和子宫颈),反映了不同的染色方案。常规评分是可用的。金标准是由三位高级病理学家通过独立的回顾性评分建立的,从而能够评估变异性。评分过程之后是学院讨论,以解决不一致的情况,并确保医学共识。在洗脱期后,用人工智能辅助对病例进行重新评估。人工智能和常规人工评分与使用器官特异性切断的金标准进行比较。结果AI辅助提高了病理学家之间的观察者之间的一致性,将类内相关系数(ICC)从62%增加到74%,在具有挑战性的CPS病例中(n = 91)尤其显著,ICC从19%提高到62%,强调了AI在减少接近临床决策阈值的变异性方面的价值。基于临床临界值,基于人工智能的评分在准确性(88%对75%)和敏感性(96%对78%)方面优于常规人工评分,同时保持相当的阳性预测值(88%对87%),表明检测真阳性病例的能力有所提高。本研究强调了ai驱动工具diakwant PD-L1算法的潜力,可以提高PD-L1评分的准确性,并减少观察者的可变性,特别是在接近临床阈值的情况下,跨越各种实体癌,独立于分析前和数字化平台。将其整合到临床工作流程中可以提高效率并优化患者对免疫治疗的资格。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Clinical evaluation of an automated pan-organ combined PD-L1 scoring using artificial intelligence on immunostained whole-slide images

Background

Programmed death-ligand 1 (PD-L1) inhibitors have shown remarkable results in oncology; however, many patients fail to respond, highlighting the need for reliable assessment of PD-L1 expression for patient selection. PD-L1 scoring, especially the combined positive score (CPS), is hindered by inter- and intraobserver variability, complex staining patterns, and technical discrepancies, all of which can impact therapeutic decisions. Artificial intelligence (AI) offers a solution by standardizing PD-L1 evaluation. This study evaluates a PD-L1 CPS AI, designed for reproducible and robust PD-L1 scoring across various tumor types and conditions.

Materials and methods

AI performance was validated on 142 samples spanning multiple tumor types (gastrointestinal, head and neck, breast, and uterine cervix) and sourced from four centers, reflecting diverse staining protocols. Routine scores were available. A gold standard was established through independent retrospective scoring by three senior pathologists enabling the assessment of variability. The scoring process was followed by collegial discussions to resolve discordant cases and ensure medical consensus. After a washout period, cases were reassessed with AI assistance. AI and routine manual scores were compared with the gold standard using organ-specific cut-offs.

Results

AI assistance improved interobserver agreement among pathologists, increasing intraclass correlation coefficient (ICC) from 62% to 74%, with a particularly pronounced effect in challenging cases with CPS < 20 (n = 91), where ICC improved from 19% to 62%, underscoring the value of AI in reducing variability near clinical decision thresholds. Based on clinical cut-offs, AI-based scoring outperformed routine manual scoring in accuracy (88% versus 75%) and sensitivity (96% versus 78%), while maintaining a comparable positive predictive value (88% versus 87%), indicating an improved ability to detect true-positive cases.

Conclusions

This study highlights the potential of an AI-driven tool—DiaKwant PD-L1 algorithm—to improve PD-L1 scoring accuracy and reduce observer variability, particularly near clinical thresholds, across various solid carcinomas, independently of pre-analytical and digitization platforms. Its integration into clinical workflows could enhance efficiency and optimize patient eligibility for immunotherapy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信