用于评估机器学习/人工智能模型的管道，以量化 PD-L1 免疫组化。

IF 5.1 2区医学 Q1 MEDICINE, RESEARCH & EXPERIMENTAL

Laboratory Investigation Pub Date : 2024-04-26 DOI:10.1016/j.labinv.2024.102070

Beatrice S. Knudsen , Alok Jadhav , Lindsey J. Perry , Jeppe Thagaard , Georgios Deftereos , Jian Ying , Ben J. Brintz , Wei Zhang

{"title":"用于评估机器学习/人工智能模型的管道，以量化 PD-L1 免疫组化。","authors":"Beatrice S. Knudsen , Alok Jadhav , Lindsey J. Perry , Jeppe Thagaard , Georgios Deftereos , Jian Ying , Ben J. Brintz , Wei Zhang","doi":"10.1016/j.labinv.2024.102070","DOIUrl":null,"url":null,"abstract":"<div><p>Immunohistochemistry (IHC) is used to guide treatment decisions in multiple cancer types. For treatment with checkpoint inhibitors, programmed death ligand 1 (PD-L1) IHC is used as a companion diagnostic. However, the scoring of PD-L1 is complicated by its expression in cancer and immune cells. Separation of cancer and noncancer regions is needed to calculate tumor proportion scores (TPS) of PD-L1, which is based on the percentage of PD-L1-positive cancer cells. Evaluation of PD-L1 expression requires highly experienced pathologists and is often challenging and time-consuming. Here, we used a multi-institutional cohort of 77 lung cancer cases stained centrally with the PD-L1 22C3 clone. We developed a 4-step pipeline for measuring TPS that includes the coregistration of hematoxylin and eosin, PD-L1, and negative control (NC) digital slides for exclusion of necrosis, segmentation of cancer regions, and quantification of PD-L1+ cells. As cancer segmentation is a challenging step for TPS generation, we trained DeepLab V3 in the Visiopharm software package to outline cancer regions in PD-L1 and NC images and evaluated the model performance by mean intersection over union (mIoU) against manual outlines. Only 14 cases were required to accomplish a mIoU of 0.82 for cancer segmentation in hematoxylin-stained NC cases. For PD-L1-stained slides, a model trained on PD-L1 tiles augmented by registered NC tiles achieved a mIoU of 0.79. In segmented cancer regions from whole slide images, the digital TPS achieved an accuracy of 75% against the manual TPS scores from the pathology report. Major reasons for algorithmic inaccuracies include the inclusion of immune cells in cancer outlines and poor nuclear segmentation of cancer cells. Our transparent and stepwise approach and performance metrics can be applied to any IHC assay to provide pathologists with important insights on when to apply and how to evaluate commercial automated IHC scoring systems.</p></div>","PeriodicalId":17930,"journal":{"name":"Laboratory Investigation","volume":"104 6","pages":"Article 102070"},"PeriodicalIF":5.1000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Pipeline for Evaluation of Machine Learning/Artificial Intelligence Models to Quantify Programmed Death Ligand 1 Immunohistochemistry\",\"authors\":\"Beatrice S. Knudsen , Alok Jadhav , Lindsey J. Perry , Jeppe Thagaard , Georgios Deftereos , Jian Ying , Ben J. Brintz , Wei Zhang\",\"doi\":\"10.1016/j.labinv.2024.102070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Immunohistochemistry (IHC) is used to guide treatment decisions in multiple cancer types. For treatment with checkpoint inhibitors, programmed death ligand 1 (PD-L1) IHC is used as a companion diagnostic. However, the scoring of PD-L1 is complicated by its expression in cancer and immune cells. Separation of cancer and noncancer regions is needed to calculate tumor proportion scores (TPS) of PD-L1, which is based on the percentage of PD-L1-positive cancer cells. Evaluation of PD-L1 expression requires highly experienced pathologists and is often challenging and time-consuming. Here, we used a multi-institutional cohort of 77 lung cancer cases stained centrally with the PD-L1 22C3 clone. We developed a 4-step pipeline for measuring TPS that includes the coregistration of hematoxylin and eosin, PD-L1, and negative control (NC) digital slides for exclusion of necrosis, segmentation of cancer regions, and quantification of PD-L1+ cells. As cancer segmentation is a challenging step for TPS generation, we trained DeepLab V3 in the Visiopharm software package to outline cancer regions in PD-L1 and NC images and evaluated the model performance by mean intersection over union (mIoU) against manual outlines. Only 14 cases were required to accomplish a mIoU of 0.82 for cancer segmentation in hematoxylin-stained NC cases. For PD-L1-stained slides, a model trained on PD-L1 tiles augmented by registered NC tiles achieved a mIoU of 0.79. In segmented cancer regions from whole slide images, the digital TPS achieved an accuracy of 75% against the manual TPS scores from the pathology report. Major reasons for algorithmic inaccuracies include the inclusion of immune cells in cancer outlines and poor nuclear segmentation of cancer cells. Our transparent and stepwise approach and performance metrics can be applied to any IHC assay to provide pathologists with important insights on when to apply and how to evaluate commercial automated IHC scoring systems.</p></div>\",\"PeriodicalId\":17930,\"journal\":{\"name\":\"Laboratory Investigation\",\"volume\":\"104 6\",\"pages\":\"Article 102070\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2024-04-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Laboratory Investigation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0023683724017483\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Laboratory Investigation","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0023683724017483","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

摘要

免疫组化（IHC）用于指导多种癌症类型的治疗决策。在使用检查点抑制剂治疗时，PD-L1 IHC 被用作辅助诊断。然而，PD-L1 在癌细胞和免疫细胞中的表达使其评分变得复杂。要计算 PD-L1 的肿瘤比例分数（TPS），需要将癌症区域和非癌症区域分开，TPS 是基于 PD-L1 阳性癌细胞的百分比。评估 PD-L1 的表达需要经验丰富的病理学家，通常具有挑战性且耗时较长。在这里，我们使用了一个由 77 例肺癌病例组成的多机构队列，这些病例都用 PD-L1 22C3 克隆进行了集中染色。我们开发了一种四步测量 TPS 的方法，包括共同登记 H&E、PD-L1 和阴性对照 (NC) 数字切片以排除坏死、分割癌症区域和量化 PD-L1+ 细胞。由于癌症分割是生成 TPS 的一个具有挑战性的步骤，我们对 Visiopharm 软件包中的 DeepLab V3 进行了训练，以勾勒出 PD-L1 和阴性对照（NC）图像中的癌症区域，并通过平均交集大于联合（mIoU）评估了模型性能与人工勾勒的对比。在苏木精染色的 NC 病例中，只需要 14 个病例就能达到 0.82 的癌症分割 mIoU。对于 PD-L1 染色玻片，在 PD-L1 片上训练的模型通过注册 NC 片进行增强，mIoU 达到 0.79。在从整张切片图像中分割癌症区域时，数字 TPS 的准确率达到了病理报告中人工 TPS 评分的 75%。算法不准确的主要原因包括癌症轮廓中包含免疫细胞以及癌细胞核分割不佳。我们透明、循序渐进的方法和性能指标可应用于任何 IHC 检测，为病理学家提供了何时应用和如何评估商用自动 IHC 评分系统的重要见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Pipeline for Evaluation of Machine Learning/Artificial Intelligence Models to Quantify Programmed Death Ligand 1 Immunohistochemistry

Immunohistochemistry (IHC) is used to guide treatment decisions in multiple cancer types. For treatment with checkpoint inhibitors, programmed death ligand 1 (PD-L1) IHC is used as a companion diagnostic. However, the scoring of PD-L1 is complicated by its expression in cancer and immune cells. Separation of cancer and noncancer regions is needed to calculate tumor proportion scores (TPS) of PD-L1, which is based on the percentage of PD-L1-positive cancer cells. Evaluation of PD-L1 expression requires highly experienced pathologists and is often challenging and time-consuming. Here, we used a multi-institutional cohort of 77 lung cancer cases stained centrally with the PD-L1 22C3 clone. We developed a 4-step pipeline for measuring TPS that includes the coregistration of hematoxylin and eosin, PD-L1, and negative control (NC) digital slides for exclusion of necrosis, segmentation of cancer regions, and quantification of PD-L1+ cells. As cancer segmentation is a challenging step for TPS generation, we trained DeepLab V3 in the Visiopharm software package to outline cancer regions in PD-L1 and NC images and evaluated the model performance by mean intersection over union (mIoU) against manual outlines. Only 14 cases were required to accomplish a mIoU of 0.82 for cancer segmentation in hematoxylin-stained NC cases. For PD-L1-stained slides, a model trained on PD-L1 tiles augmented by registered NC tiles achieved a mIoU of 0.79. In segmented cancer regions from whole slide images, the digital TPS achieved an accuracy of 75% against the manual TPS scores from the pathology report. Major reasons for algorithmic inaccuracies include the inclusion of immune cells in cancer outlines and poor nuclear segmentation of cancer cells. Our transparent and stepwise approach and performance metrics can be applied to any IHC assay to provide pathologists with important insights on when to apply and how to evaluate commercial automated IHC scoring systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Laboratory Investigation 医学-病理学

CiteScore

8.30

自引率

0.00%

发文量

125

审稿时长

2 months

期刊介绍： Laboratory Investigation is an international journal owned by the United States and Canadian Academy of Pathology. Laboratory Investigation offers prompt publication of high-quality original research in all biomedical disciplines relating to the understanding of human disease and the application of new methods to the diagnosis of disease. Both human and experimental studies are welcome.