乳腺癌基因表达的综合元分析

Ifeanyichukwu O Nwosu, Stephen R Piccolo
{"title":"乳腺癌基因表达的综合元分析","authors":"Ifeanyichukwu O Nwosu, Stephen R Piccolo","doi":"10.1101/2024.08.30.610515","DOIUrl":null,"url":null,"abstract":"Background: Triple-negative breast cancers (TNBC) occur more frequently in African Americans and are associated with worse outcomes when compared to other subtypes of breast cancer. These cancers lack expression of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2) and have limited treatment options. To shed light on mechanisms behind these differences and suggest novel treatments, we used a meta-analytic approach to identify gene expression differences in breast tumors for people with self-reported African or European ancestry; additionally, we compared gene expression levels based on ER, PR, HER2 and TNBC status. Methods: After gathering and standardizing gene expression data and metadata from 106 datasets (representing 27,000 samples), we identified genes that were expressed differently between these groups via random-effects meta-analyses. To evaluate the robustness of these gene lists, we devised a novel computational methodology that uses cross validation and classification. We also computed overlaps between the most significant genes and known signaling pathways. Results: Using a false discovery rate threshold of 0.05, we identified genes that are known to play a significant role in their respective breast cancer subtypes (e.g., ESR1 for ER status and ERBB2 for HER2 status), thus confirming the validity of our findings. We also discovered genes that have not been reported previously and may be new targets for breast cancer therapy. GATA3, CA12, TBC1D9, XBP1 and FOXA1 were among the most significant genes for ER, PR, and TNBC. However, none of these genes overlapped with HER2 status, supporting prior research that HER2 tumors are mechanistically different from endocrine breast cancers. The genes identified from the race meta-analysis-including DNAJC15, HLA-DPA1, STAP2, CEP68, MOGS-have not been associated previously with race-specific breast-cancer outcomes, highlighting a potential area of further research. Conclusions: We have carried out a large meta-analysis of breast cancer gene expression data, identifying novel genes that may serve as potential biomarkers for breast cancer in diverse populations. We have also developed a computational method that identifies gene sets small enough to be analyzed and explored in future studies. This method has the potential to be applied to other cancers.","PeriodicalId":501233,"journal":{"name":"bioRxiv - Cancer Biology","volume":"64 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comprehensive Meta-Analysis of Breast Cancer Gene Expression\",\"authors\":\"Ifeanyichukwu O Nwosu, Stephen R Piccolo\",\"doi\":\"10.1101/2024.08.30.610515\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Triple-negative breast cancers (TNBC) occur more frequently in African Americans and are associated with worse outcomes when compared to other subtypes of breast cancer. These cancers lack expression of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2) and have limited treatment options. To shed light on mechanisms behind these differences and suggest novel treatments, we used a meta-analytic approach to identify gene expression differences in breast tumors for people with self-reported African or European ancestry; additionally, we compared gene expression levels based on ER, PR, HER2 and TNBC status. Methods: After gathering and standardizing gene expression data and metadata from 106 datasets (representing 27,000 samples), we identified genes that were expressed differently between these groups via random-effects meta-analyses. To evaluate the robustness of these gene lists, we devised a novel computational methodology that uses cross validation and classification. We also computed overlaps between the most significant genes and known signaling pathways. Results: Using a false discovery rate threshold of 0.05, we identified genes that are known to play a significant role in their respective breast cancer subtypes (e.g., ESR1 for ER status and ERBB2 for HER2 status), thus confirming the validity of our findings. We also discovered genes that have not been reported previously and may be new targets for breast cancer therapy. GATA3, CA12, TBC1D9, XBP1 and FOXA1 were among the most significant genes for ER, PR, and TNBC. However, none of these genes overlapped with HER2 status, supporting prior research that HER2 tumors are mechanistically different from endocrine breast cancers. The genes identified from the race meta-analysis-including DNAJC15, HLA-DPA1, STAP2, CEP68, MOGS-have not been associated previously with race-specific breast-cancer outcomes, highlighting a potential area of further research. Conclusions: We have carried out a large meta-analysis of breast cancer gene expression data, identifying novel genes that may serve as potential biomarkers for breast cancer in diverse populations. We have also developed a computational method that identifies gene sets small enough to be analyzed and explored in future studies. This method has the potential to be applied to other cancers.\",\"PeriodicalId\":501233,\"journal\":{\"name\":\"bioRxiv - Cancer Biology\",\"volume\":\"64 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv - Cancer Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.08.30.610515\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Cancer Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.30.610515","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:三阴性乳腺癌(TNBC)在非裔美国人中的发病率较高,与其他亚型乳腺癌相比,其治疗效果较差。这些癌症缺乏雌激素受体(ER)、孕激素受体(PR)和人表皮生长因子受体 2(HER2)的表达,治疗方案有限。为了揭示这些差异背后的机制并提出新的治疗方法,我们采用了一种荟萃分析方法来确定自称具有非洲或欧洲血统的人乳腺肿瘤中的基因表达差异;此外,我们还比较了基于ER、PR、HER2和TNBC状态的基因表达水平。研究方法从 106 个数据集(代表 27,000 个样本)中收集基因表达数据和元数据并对其进行标准化处理后,我们通过随机效应荟萃分析确定了这些群体间表达不同的基因。为了评估这些基因列表的稳健性,我们设计了一种使用交叉验证和分类的新型计算方法。我们还计算了最重要基因与已知信号通路之间的重叠。研究结果利用 0.05 的错误发现率阈值,我们发现了已知在各自乳腺癌亚型中起重要作用的基因(例如,ESR1 代表 ER 状态,ERBB2 代表 HER2 状态),从而证实了我们研究结果的有效性。我们还发现了以前未曾报道过的基因,它们可能成为乳腺癌治疗的新靶点。GATA3、CA12、TBC1D9、XBP1 和 FOXA1 是对 ER、PR 和 TNBC 最重要的基因。然而,这些基因都没有与 HER2 状态重叠,这支持了之前的研究,即 HER2 肿瘤与内分泌乳腺癌在机理上是不同的。从种族荟萃分析中发现的基因--包括DNAJC15、HLA-DPA1、STAP2、CEP68和MOGS--以前从未与种族特异性乳腺癌结果相关联,这凸显了进一步研究的潜在领域。结论:我们对乳腺癌基因表达数据进行了大规模的荟萃分析,发现了可作为不同人群乳腺癌潜在生物标志物的新基因。我们还开发了一种计算方法,可以识别足够小的基因集,以便在未来的研究中进行分析和探索。这种方法有可能应用于其他癌症。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Comprehensive Meta-Analysis of Breast Cancer Gene Expression
Background: Triple-negative breast cancers (TNBC) occur more frequently in African Americans and are associated with worse outcomes when compared to other subtypes of breast cancer. These cancers lack expression of estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER2) and have limited treatment options. To shed light on mechanisms behind these differences and suggest novel treatments, we used a meta-analytic approach to identify gene expression differences in breast tumors for people with self-reported African or European ancestry; additionally, we compared gene expression levels based on ER, PR, HER2 and TNBC status. Methods: After gathering and standardizing gene expression data and metadata from 106 datasets (representing 27,000 samples), we identified genes that were expressed differently between these groups via random-effects meta-analyses. To evaluate the robustness of these gene lists, we devised a novel computational methodology that uses cross validation and classification. We also computed overlaps between the most significant genes and known signaling pathways. Results: Using a false discovery rate threshold of 0.05, we identified genes that are known to play a significant role in their respective breast cancer subtypes (e.g., ESR1 for ER status and ERBB2 for HER2 status), thus confirming the validity of our findings. We also discovered genes that have not been reported previously and may be new targets for breast cancer therapy. GATA3, CA12, TBC1D9, XBP1 and FOXA1 were among the most significant genes for ER, PR, and TNBC. However, none of these genes overlapped with HER2 status, supporting prior research that HER2 tumors are mechanistically different from endocrine breast cancers. The genes identified from the race meta-analysis-including DNAJC15, HLA-DPA1, STAP2, CEP68, MOGS-have not been associated previously with race-specific breast-cancer outcomes, highlighting a potential area of further research. Conclusions: We have carried out a large meta-analysis of breast cancer gene expression data, identifying novel genes that may serve as potential biomarkers for breast cancer in diverse populations. We have also developed a computational method that identifies gene sets small enough to be analyzed and explored in future studies. This method has the potential to be applied to other cancers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信