Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. Christison-Lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein
{"title":"2423:利用数字组织病理学对儿童肉瘤进行自动分类","authors":"Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. Christison-Lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein","doi":"10.1158/1538-7445.am2025-2423","DOIUrl":null,"url":null,"abstract":"Introduction: Pediatric sarcomas are challenging to accurately classify due to their rarity and the wide diversity of subtypes. The process requires highly specialized pathologists as well as molecular and genetic testing that is expensive, takes time, and is not universally available. Deep neural network models (DNNs) trained on histopathology slides can reduce the time and cost to diagnosis and attenuate disparities in care based on geographical location and socioeconomic status. Here, we demonstrate the efficacy of automated image analysis for assigning sarcoma diagnoses across centers by identifying Ewing Sarcoma (ES), distinguishing rhabdomyosarcoma (RMS) vs non-rhabdomyosarcoma soft tissue sarcomas (NRSTS), as well as classifying alveolar, embryonal, and spindle cell RMS subtypes. Methods: Images were collected from Massachusetts General Hospital, Yale School of Medicine, St Jude Children’s, and the Children’s Oncology Group. A total of 691 images were collected across all centers, including 9 different sarcoma subtypes. Images were harmonized using our published STQ pipeline including focus checking, resolution standardization, and stain normalization. Image tiling and feature extraction was performed comparing multiple deep learning backbones (CTransPath, UNI, CONCH). Tile-level features were transposed to whole slide-level representations using our published SAMPLER method in which each feature is represented as the vector of decile values of its distribution across all tiles. Resulting feature sets were fed into logistic regression models for sarcoma classification tasks. We benchmark this approach against transformer based multi-head self-attention models trained on a V100 GPU. Results: We are able to distinguish ES from all other sarcoma types with an AUROC of 0.966. In the task of NRSTS v. alveolar RMS v. embryonal RMS we achieve an AUROC of 0.939. Restricting to RMS subtypes, we distinguish alveolar from embryonal with an AUROC of 0.95. Also, despite uneven sample representation, we obtain an AUROC of 0.88 for alveolar v. embryonal v. spindle type RMS. Finally, we have developed spatially-resolved attention maps, which provide interpretability for the regions of a slide that contain malignant cells. Conclusion: To our knowledge, we have amassed the largest multicenter pediatric sarcoma imaging dataset with broad representation across subtypes, anatomical locations, race, and sex. Our pipeline allows for further integration of images from any center, which may be co-analyzed to attenuate center-specific batch effects. Our classification accuracies are state of the art for multiple tasks fundamental to clinical sarcoma pathology, including novel multiclass distinction among three different RMS subtypes. Importantly, our pipeline and SAMPLER model can be run with minimal computational requirements, allowing for broad accessibility. Citation Format: Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. Christison-Lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein. Automated classification of pediatric sarcoma using digital histopathology [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2025; Part 1 (Regular s); 2025 Apr 25-30; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2025;85(8_Suppl_1): nr 2423.","PeriodicalId":9441,"journal":{"name":"Cancer research","volume":"4 1","pages":""},"PeriodicalIF":16.6000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Abstract 2423: Automated classification of pediatric sarcoma using digital histopathology\",\"authors\":\"Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. Christison-Lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein\",\"doi\":\"10.1158/1538-7445.am2025-2423\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Pediatric sarcomas are challenging to accurately classify due to their rarity and the wide diversity of subtypes. The process requires highly specialized pathologists as well as molecular and genetic testing that is expensive, takes time, and is not universally available. Deep neural network models (DNNs) trained on histopathology slides can reduce the time and cost to diagnosis and attenuate disparities in care based on geographical location and socioeconomic status. Here, we demonstrate the efficacy of automated image analysis for assigning sarcoma diagnoses across centers by identifying Ewing Sarcoma (ES), distinguishing rhabdomyosarcoma (RMS) vs non-rhabdomyosarcoma soft tissue sarcomas (NRSTS), as well as classifying alveolar, embryonal, and spindle cell RMS subtypes. Methods: Images were collected from Massachusetts General Hospital, Yale School of Medicine, St Jude Children’s, and the Children’s Oncology Group. A total of 691 images were collected across all centers, including 9 different sarcoma subtypes. Images were harmonized using our published STQ pipeline including focus checking, resolution standardization, and stain normalization. Image tiling and feature extraction was performed comparing multiple deep learning backbones (CTransPath, UNI, CONCH). Tile-level features were transposed to whole slide-level representations using our published SAMPLER method in which each feature is represented as the vector of decile values of its distribution across all tiles. Resulting feature sets were fed into logistic regression models for sarcoma classification tasks. We benchmark this approach against transformer based multi-head self-attention models trained on a V100 GPU. Results: We are able to distinguish ES from all other sarcoma types with an AUROC of 0.966. In the task of NRSTS v. alveolar RMS v. embryonal RMS we achieve an AUROC of 0.939. Restricting to RMS subtypes, we distinguish alveolar from embryonal with an AUROC of 0.95. Also, despite uneven sample representation, we obtain an AUROC of 0.88 for alveolar v. embryonal v. spindle type RMS. Finally, we have developed spatially-resolved attention maps, which provide interpretability for the regions of a slide that contain malignant cells. Conclusion: To our knowledge, we have amassed the largest multicenter pediatric sarcoma imaging dataset with broad representation across subtypes, anatomical locations, race, and sex. Our pipeline allows for further integration of images from any center, which may be co-analyzed to attenuate center-specific batch effects. Our classification accuracies are state of the art for multiple tasks fundamental to clinical sarcoma pathology, including novel multiclass distinction among three different RMS subtypes. Importantly, our pipeline and SAMPLER model can be run with minimal computational requirements, allowing for broad accessibility. Citation Format: Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. Christison-Lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein. Automated classification of pediatric sarcoma using digital histopathology [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2025; Part 1 (Regular s); 2025 Apr 25-30; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2025;85(8_Suppl_1): nr 2423.\",\"PeriodicalId\":9441,\"journal\":{\"name\":\"Cancer research\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":16.6000,\"publicationDate\":\"2025-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cancer research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1158/1538-7445.am2025-2423\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1158/1538-7445.am2025-2423","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
儿童肉瘤由于其罕见性和亚型的广泛多样性而难以准确分类。这一过程需要高度专业化的病理学家以及分子和基因检测,这是昂贵的,耗时的,并不是普遍可用的。在组织病理学载玻片上训练的深度神经网络模型(dnn)可以减少诊断的时间和成本,并缩小基于地理位置和社会经济地位的护理差异。在这里,我们通过识别Ewing肉瘤(ES),区分横纹肌肉瘤(RMS)和非横纹肌肉瘤软组织肉瘤(NRSTS),以及分类肺泡、胚胎和梭形细胞RMS亚型,证明了自动图像分析在分配肉瘤诊断中的有效性。方法:图像收集自马萨诸塞州总医院、耶鲁大学医学院、圣裘德儿童医院和儿童肿瘤组。所有中心共收集了691张图像,包括9种不同的肉瘤亚型。使用我们发布的STQ管道对图像进行协调,包括焦点检查、分辨率标准化和染色归一化。对多个深度学习骨干(CTransPath, UNI, CONCH)进行图像平铺和特征提取比较。使用我们发布的SAMPLER方法,将瓦片级特征转换为整个幻灯片级表示,其中每个特征都表示为其分布在所有瓦片上的十分位数向量。得到的特征集被输入到用于肉瘤分类任务的逻辑回归模型中。我们将这种方法与在V100 GPU上训练的基于变压器的多头自关注模型进行基准测试。结果:我们能够将ES与所有其他类型的肉瘤区分开来,AUROC为0.966。在NRSTS、肺泡RMS和胚胎RMS的比较中,我们获得了0.939的AUROC。在RMS亚型中,我们区分肺泡型和胚胎型,AUROC为0.95。此外,尽管样本代表性不均匀,我们获得了肺泡型、胚胎型和纺锤体型RMS的AUROC为0.88。最后,我们开发了空间分辨注意力图,它为幻灯片中含有恶性细胞的区域提供了可解释性。结论:据我们所知,我们已经积累了最大的多中心儿童肉瘤成像数据集,具有广泛的亚型、解剖位置、种族和性别代表性。我们的流水线允许进一步整合来自任何中心的图像,这些图像可能会被共同分析,以减弱中心特定的批处理效果。我们的分类准确性是最先进的,用于临床肉瘤病理基础的多项任务,包括三种不同RMS亚型之间的新型多类别区分。重要的是,我们的管道和SAMPLER模型可以以最小的计算需求运行,允许广泛的可访问性。引用格式:Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. chrisson - lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein。数字组织病理学在儿童肉瘤自动分类中的应用[摘要]。摘自:《2025年美国癌症研究协会年会论文集》;第1部分(常规);2025年4月25日至30日;费城(PA): AACR;中国生物医学工程学报,2015;31(5):444 - 444。
Abstract 2423: Automated classification of pediatric sarcoma using digital histopathology
Introduction: Pediatric sarcomas are challenging to accurately classify due to their rarity and the wide diversity of subtypes. The process requires highly specialized pathologists as well as molecular and genetic testing that is expensive, takes time, and is not universally available. Deep neural network models (DNNs) trained on histopathology slides can reduce the time and cost to diagnosis and attenuate disparities in care based on geographical location and socioeconomic status. Here, we demonstrate the efficacy of automated image analysis for assigning sarcoma diagnoses across centers by identifying Ewing Sarcoma (ES), distinguishing rhabdomyosarcoma (RMS) vs non-rhabdomyosarcoma soft tissue sarcomas (NRSTS), as well as classifying alveolar, embryonal, and spindle cell RMS subtypes. Methods: Images were collected from Massachusetts General Hospital, Yale School of Medicine, St Jude Children’s, and the Children’s Oncology Group. A total of 691 images were collected across all centers, including 9 different sarcoma subtypes. Images were harmonized using our published STQ pipeline including focus checking, resolution standardization, and stain normalization. Image tiling and feature extraction was performed comparing multiple deep learning backbones (CTransPath, UNI, CONCH). Tile-level features were transposed to whole slide-level representations using our published SAMPLER method in which each feature is represented as the vector of decile values of its distribution across all tiles. Resulting feature sets were fed into logistic regression models for sarcoma classification tasks. We benchmark this approach against transformer based multi-head self-attention models trained on a V100 GPU. Results: We are able to distinguish ES from all other sarcoma types with an AUROC of 0.966. In the task of NRSTS v. alveolar RMS v. embryonal RMS we achieve an AUROC of 0.939. Restricting to RMS subtypes, we distinguish alveolar from embryonal with an AUROC of 0.95. Also, despite uneven sample representation, we obtain an AUROC of 0.88 for alveolar v. embryonal v. spindle type RMS. Finally, we have developed spatially-resolved attention maps, which provide interpretability for the regions of a slide that contain malignant cells. Conclusion: To our knowledge, we have amassed the largest multicenter pediatric sarcoma imaging dataset with broad representation across subtypes, anatomical locations, race, and sex. Our pipeline allows for further integration of images from any center, which may be co-analyzed to attenuate center-specific batch effects. Our classification accuracies are state of the art for multiple tasks fundamental to clinical sarcoma pathology, including novel multiclass distinction among three different RMS subtypes. Importantly, our pipeline and SAMPLER model can be run with minimal computational requirements, allowing for broad accessibility. Citation Format: Adam Thiesen, Sergii Domanskyi, Ali Foroughi pour, Jingyan Zhang, Todd B. Sheridan, Steven B. Neuhauser, Alyssa Stetson, Katelyn Dannheim, Danielle B. Cameron, Shawn Ahn, Hao Wu, Emily R. Christison-Lagay, Carol J. Bult, Jeffrey H. Chuang, Jill C. Rubinstein. Automated classification of pediatric sarcoma using digital histopathology [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2025; Part 1 (Regular s); 2025 Apr 25-30; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2025;85(8_Suppl_1): nr 2423.
期刊介绍:
Cancer Research, published by the American Association for Cancer Research (AACR), is a journal that focuses on impactful original studies, reviews, and opinion pieces relevant to the broad cancer research community. Manuscripts that present conceptual or technological advances leading to insights into cancer biology are particularly sought after. The journal also places emphasis on convergence science, which involves bridging multiple distinct areas of cancer research.
With primary subsections including Cancer Biology, Cancer Immunology, Cancer Metabolism and Molecular Mechanisms, Translational Cancer Biology, Cancer Landscapes, and Convergence Science, Cancer Research has a comprehensive scope. It is published twice a month and has one volume per year, with a print ISSN of 0008-5472 and an online ISSN of 1538-7445.
Cancer Research is abstracted and/or indexed in various databases and platforms, including BIOSIS Previews (R) Database, MEDLINE, Current Contents/Life Sciences, Current Contents/Clinical Medicine, Science Citation Index, Scopus, and Web of Science.