{"title":"整合组织病理微环境和EHR表型的可解释的多模态人工智能模型用于乳腺癌种系基因检测。","authors":"Zijian Yang, Changyuan Guo, Jiayi Li, Yalun Li, Lei Zhong, Pengming Pu, Tongxuan Shang, Lin Cong, Yongxin Zhou, Guangdong Qiao, Ziqi Jia, Hengyi Xu, Heng Cao, Yansong Huang, Tianyi Liu, Jian Liang, Jiang Wu, Dongxu Ma, Yuchen Liu, Ruijie Zhou, Xiang Wang, Jianming Ying, Meng Zhou, Jiaqi Liu","doi":"10.1002/advs.202502833","DOIUrl":null,"url":null,"abstract":"<p><p>Genetic testing for pathogenic germline variants is critical for the personalized management of high-risk breast cancers, guiding targeted therapies and cascade testing for at-risk families. In this study, MAIGGT (Multimodal Artificial Intelligence Germline Genetic Testing) is proposed, a deep learning framework that integrates histopathological microenvironment features from whole-slide images with clinical phenotypes from electronic health records for precise prescreening of germline BRCA1/2 mutations. Leveraging a multi-scale Transformer-based deep generative architecture, MAIGGT employs a cross-modal latent representation unification mechanism to capture complementary biological insights from multimodal data. MAIGGT is rigorously validated across three independent cohorts and demonstrated robust performance with areas under receiver operating characteristic curves of 0.925 (95% CI 0.868 - 0.982), 0.845 (95% CI 0.779 - 0.911), and 0.833 (0.788 - 0.878), outperforming single-modality models. Mechanistic interpretability analyses revealed that BRCA1/2-mutated associated tumors may exhibit distinct microenvironment patterns, including increased inflammatory cell infiltration, stromal proliferation and necrosis, and nuclear heterogeneity. By bridging digital pathology with clinical phenotypes, MAIGGT establishes a new paradigm for cost-effective, scalable, and biologically interpretable prescreening of hereditary breast cancer, with the potential to significantly improve the accessibility of genetic testing in routine clinical practice.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":" ","pages":"e02833"},"PeriodicalIF":14.3000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Explainable Multimodal Artificial Intelligence Model Integrating Histopathological Microenvironment and EHR Phenotypes for Germline Genetic Testing in Breast Cancer.\",\"authors\":\"Zijian Yang, Changyuan Guo, Jiayi Li, Yalun Li, Lei Zhong, Pengming Pu, Tongxuan Shang, Lin Cong, Yongxin Zhou, Guangdong Qiao, Ziqi Jia, Hengyi Xu, Heng Cao, Yansong Huang, Tianyi Liu, Jian Liang, Jiang Wu, Dongxu Ma, Yuchen Liu, Ruijie Zhou, Xiang Wang, Jianming Ying, Meng Zhou, Jiaqi Liu\",\"doi\":\"10.1002/advs.202502833\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Genetic testing for pathogenic germline variants is critical for the personalized management of high-risk breast cancers, guiding targeted therapies and cascade testing for at-risk families. In this study, MAIGGT (Multimodal Artificial Intelligence Germline Genetic Testing) is proposed, a deep learning framework that integrates histopathological microenvironment features from whole-slide images with clinical phenotypes from electronic health records for precise prescreening of germline BRCA1/2 mutations. Leveraging a multi-scale Transformer-based deep generative architecture, MAIGGT employs a cross-modal latent representation unification mechanism to capture complementary biological insights from multimodal data. MAIGGT is rigorously validated across three independent cohorts and demonstrated robust performance with areas under receiver operating characteristic curves of 0.925 (95% CI 0.868 - 0.982), 0.845 (95% CI 0.779 - 0.911), and 0.833 (0.788 - 0.878), outperforming single-modality models. Mechanistic interpretability analyses revealed that BRCA1/2-mutated associated tumors may exhibit distinct microenvironment patterns, including increased inflammatory cell infiltration, stromal proliferation and necrosis, and nuclear heterogeneity. By bridging digital pathology with clinical phenotypes, MAIGGT establishes a new paradigm for cost-effective, scalable, and biologically interpretable prescreening of hereditary breast cancer, with the potential to significantly improve the accessibility of genetic testing in routine clinical practice.</p>\",\"PeriodicalId\":117,\"journal\":{\"name\":\"Advanced Science\",\"volume\":\" \",\"pages\":\"e02833\"},\"PeriodicalIF\":14.3000,\"publicationDate\":\"2025-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Science\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://doi.org/10.1002/advs.202502833\",\"RegionNum\":1,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/advs.202502833","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
致病种系变异的基因检测对于高危乳腺癌的个性化管理、指导靶向治疗和高危家庭的级联检测至关重要。在这项研究中,提出了MAIGGT(多模式人工智能种系基因检测),这是一个深度学习框架,将来自全幻灯片图像的组织病理学微环境特征与来自电子健康记录的临床表型相结合,用于精确筛选种系BRCA1/2突变。利用基于多尺度transformer的深度生成架构,MAIGGT采用跨模态潜在表示统一机制从多模态数据中捕获互补的生物学见解。MAIGGT在三个独立队列中进行了严格验证,在受试者工作特征曲线下的面积为0.925 (95% CI 0.868 - 0.982)、0.845 (95% CI 0.779 - 0.911)和0.833(0.788 - 0.878),表现出稳健的性能,优于单模态模型。机制可解释性分析显示,brca1 /2突变相关肿瘤可能表现出不同的微环境模式,包括炎症细胞浸润增加、间质增殖和坏死以及核异质性。通过将数字病理学与临床表型连接起来,MAIGGT建立了一个具有成本效益、可扩展和生物学可解释的遗传性乳腺癌预筛查的新范例,具有显著提高常规临床实践中基因检测可及性的潜力。
An Explainable Multimodal Artificial Intelligence Model Integrating Histopathological Microenvironment and EHR Phenotypes for Germline Genetic Testing in Breast Cancer.
Genetic testing for pathogenic germline variants is critical for the personalized management of high-risk breast cancers, guiding targeted therapies and cascade testing for at-risk families. In this study, MAIGGT (Multimodal Artificial Intelligence Germline Genetic Testing) is proposed, a deep learning framework that integrates histopathological microenvironment features from whole-slide images with clinical phenotypes from electronic health records for precise prescreening of germline BRCA1/2 mutations. Leveraging a multi-scale Transformer-based deep generative architecture, MAIGGT employs a cross-modal latent representation unification mechanism to capture complementary biological insights from multimodal data. MAIGGT is rigorously validated across three independent cohorts and demonstrated robust performance with areas under receiver operating characteristic curves of 0.925 (95% CI 0.868 - 0.982), 0.845 (95% CI 0.779 - 0.911), and 0.833 (0.788 - 0.878), outperforming single-modality models. Mechanistic interpretability analyses revealed that BRCA1/2-mutated associated tumors may exhibit distinct microenvironment patterns, including increased inflammatory cell infiltration, stromal proliferation and necrosis, and nuclear heterogeneity. By bridging digital pathology with clinical phenotypes, MAIGGT establishes a new paradigm for cost-effective, scalable, and biologically interpretable prescreening of hereditary breast cancer, with the potential to significantly improve the accessibility of genetic testing in routine clinical practice.
期刊介绍:
Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.