An initial game-theoretic assessment of enhanced tissue preparation and imaging protocols for improved deep learning inference of spatial transcriptomics from tissue morphology.
Michael Y Fatemi, Yunrui Lu, Alos B Diallo, Gokul Srinivasan, Zarif L Azher, Brock C Christensen, Lucas A Salas, Gregory J Tsongalis, Scott M Palisoul, Laurent Perreard, Fred W Kolling, Louis J Vaickus, Joshua J Levy
{"title":"An initial game-theoretic assessment of enhanced tissue preparation and imaging protocols for improved deep learning inference of spatial transcriptomics from tissue morphology.","authors":"Michael Y Fatemi, Yunrui Lu, Alos B Diallo, Gokul Srinivasan, Zarif L Azher, Brock C Christensen, Lucas A Salas, Gregory J Tsongalis, Scott M Palisoul, Laurent Perreard, Fred W Kolling, Louis J Vaickus, Joshua J Levy","doi":"10.1093/bib/bbae476","DOIUrl":null,"url":null,"abstract":"<p><p>The application of deep learning to spatial transcriptomics (ST) can reveal relationships between gene expression and tissue architecture. Prior work has demonstrated that inferring gene expression from tissue histomorphology can discern these spatial molecular markers to enable population scale studies, reducing the fiscal barriers associated with large-scale spatial profiling. However, while most improvements in algorithmic performance have focused on improving model architectures, little is known about how the quality of tissue preparation and imaging can affect deep learning model training for spatial inference from morphology and its potential for widespread clinical adoption. Prior studies for ST inference from histology typically utilize manually stained frozen sections with imaging on non-clinical grade scanners. Training such models on ST cohorts is also costly. We hypothesize that adopting tissue processing and imaging practices that mirror standards for clinical implementation (permanent sections, automated tissue staining, and clinical grade scanning) can significantly improve model performance. An enhanced specimen processing and imaging protocol was developed for deep learning-based ST inference from morphology. This protocol featured the Visium CytAssist assay to permit automated hematoxylin and eosin staining (e.g. Leica Bond), 40×-resolution imaging, and joining of multiple patients' tissue sections per capture area prior to ST profiling. Using a cohort of 13 pathologic T Stage-III stage colorectal cancer patients, we compared the performance of models trained on slide prepared using enhanced versus traditional (i.e. manual staining and low-resolution imaging) protocols. Leveraging Inceptionv3 neural networks, we predicted gene expression across serial, histologically-matched tissue sections using whole slide images (WSI) from both protocols. The data Shapley was used to quantify and compare marginal performance gains on a patient-by-patient basis attributed to using the enhanced protocol versus the actual costs of spatial profiling. Findings indicate that training and validating on WSI acquired through the enhanced protocol as opposed to the traditional method resulted in improved performance at lower fiscal cost. In the realm of ST, the enhancement of deep learning architectures frequently captures the spotlight; however, the significance of specimen processing and imaging is often understated. This research, informed through a game-theoretic lens, underscores the substantial impact that specimen preparation/imaging can have on spatial transcriptomic inference from morphology. It is essential to integrate such optimized processing protocols to facilitate the identification of prognostic markers at a larger scale.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"25 6","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11452536/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbae476","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
The application of deep learning to spatial transcriptomics (ST) can reveal relationships between gene expression and tissue architecture. Prior work has demonstrated that inferring gene expression from tissue histomorphology can discern these spatial molecular markers to enable population scale studies, reducing the fiscal barriers associated with large-scale spatial profiling. However, while most improvements in algorithmic performance have focused on improving model architectures, little is known about how the quality of tissue preparation and imaging can affect deep learning model training for spatial inference from morphology and its potential for widespread clinical adoption. Prior studies for ST inference from histology typically utilize manually stained frozen sections with imaging on non-clinical grade scanners. Training such models on ST cohorts is also costly. We hypothesize that adopting tissue processing and imaging practices that mirror standards for clinical implementation (permanent sections, automated tissue staining, and clinical grade scanning) can significantly improve model performance. An enhanced specimen processing and imaging protocol was developed for deep learning-based ST inference from morphology. This protocol featured the Visium CytAssist assay to permit automated hematoxylin and eosin staining (e.g. Leica Bond), 40×-resolution imaging, and joining of multiple patients' tissue sections per capture area prior to ST profiling. Using a cohort of 13 pathologic T Stage-III stage colorectal cancer patients, we compared the performance of models trained on slide prepared using enhanced versus traditional (i.e. manual staining and low-resolution imaging) protocols. Leveraging Inceptionv3 neural networks, we predicted gene expression across serial, histologically-matched tissue sections using whole slide images (WSI) from both protocols. The data Shapley was used to quantify and compare marginal performance gains on a patient-by-patient basis attributed to using the enhanced protocol versus the actual costs of spatial profiling. Findings indicate that training and validating on WSI acquired through the enhanced protocol as opposed to the traditional method resulted in improved performance at lower fiscal cost. In the realm of ST, the enhancement of deep learning architectures frequently captures the spotlight; however, the significance of specimen processing and imaging is often understated. This research, informed through a game-theoretic lens, underscores the substantial impact that specimen preparation/imaging can have on spatial transcriptomic inference from morphology. It is essential to integrate such optimized processing protocols to facilitate the identification of prognostic markers at a larger scale.
将深度学习应用于空间转录组学(ST)可以揭示基因表达与组织结构之间的关系。之前的工作已经证明,从组织形态学推断基因表达可以发现这些空间分子标记,从而实现群体规模的研究,减少与大规模空间剖析相关的财政障碍。然而,算法性能的提高大多集中在模型架构的改进上,而对于组织制备和成像质量如何影响形态学空间推断的深度学习模型训练及其在临床上广泛应用的潜力却知之甚少。之前从组织学推断 ST 的研究通常使用人工染色的冷冻切片,并在非临床级扫描仪上进行成像。在 ST 队列上训练此类模型的成本也很高。我们假设,采用符合临床实施标准的组织处理和成像方法(永久切片、自动组织染色和临床级扫描)可以显著提高模型的性能。我们开发了一种增强型标本处理和成像方案,用于基于深度学习的ST形态推断。该方案以 Visium CytAssist 检测为特色,允许自动苏木精和伊红染色(如 Leica Bond)、40× 分辨率成像,并在 ST 分析之前将每个捕获区域的多个患者组织切片连接起来。我们使用一组 13 例病理 T-III 期结直肠癌患者,比较了在使用增强和传统(即手动染色和低分辨率成像)方案制备的切片上训练的模型的性能。利用 Inceptionv3 神经网络,我们使用这两种方案的全切片图像(WSI)预测了连续的、组织学上匹配的组织切片上的基因表达。数据 Shapley 用于量化和比较每个患者因使用增强型方案而获得的边际性能收益与空间剖析的实际成本。研究结果表明,与传统方法相比,通过增强型方案获得的 WSI 进行培训和验证,能以较低的财务成本提高性能。在 ST 领域,深度学习架构的增强经常成为焦点;然而,标本处理和成像的重要性往往被低估。这项研究通过博弈论的视角,强调了标本制备/成像对从形态学进行空间转录组推断的重大影响。必须整合这种优化的处理方案,以促进更大规模的预后标志物鉴定。
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.