Unsupervised single-cell analysis in triple-negative breast cancer: A case study

A. Athreya, Alan J. Gaglio, Z. Kalbarczyk, R. Iyer, J. Cairns, Krishna R. Kalari, R. Weinshilboum, Liewei Wang
{"title":"Unsupervised single-cell analysis in triple-negative breast cancer: A case study","authors":"A. Athreya, Alan J. Gaglio, Z. Kalbarczyk, R. Iyer, J. Cairns, Krishna R. Kalari, R. Weinshilboum, Liewei Wang","doi":"10.1109/BIBM.2016.7822581","DOIUrl":null,"url":null,"abstract":"This paper demonstrates an unsupervised learning approach to identify genes with significant differential expression across single-cell subpopulations induced by therapeutic treatment. Identifying this set of genes makes it possible to use well-established bioinformatics approaches such as pathway analysis to establish their biological relevance. Then, a biologist can use his/her prior knowledge to investigate in the laboratory, a few particular candidates among the subset of genes overlapping with relevant pathways. Due to the large size of the human genome and limitations in cost and skilled resources, biologists benefit from analytical methods combined with pathway analysis to design laboratory experiments focusing on only a few significant genes. As an example, we show how model-based unsupervised methods can identify a small set of genes (1% of the genome) that have significant differential expression in single-cells and are also highly correlated to pathways (p-value < 1E − 7) with anticancer effects driven by the antidiabetic drug metformin. Further analysis of genes on these relevant pathways reveal three candidate genes previously implicated in several anticancer mechanisms in other cancers, not driven by metformin. Identification of these genes can help biologists and clinicians design laboratory experiments to establish the molecular mechanisms of metformin in triple-negative breast cancer. In a domain where there is no prior knowledge of small biologically significant data, we demonstrate that careful data-driven methods can infer such significant small data to explain biological mechanisms.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2016.7822581","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

This paper demonstrates an unsupervised learning approach to identify genes with significant differential expression across single-cell subpopulations induced by therapeutic treatment. Identifying this set of genes makes it possible to use well-established bioinformatics approaches such as pathway analysis to establish their biological relevance. Then, a biologist can use his/her prior knowledge to investigate in the laboratory, a few particular candidates among the subset of genes overlapping with relevant pathways. Due to the large size of the human genome and limitations in cost and skilled resources, biologists benefit from analytical methods combined with pathway analysis to design laboratory experiments focusing on only a few significant genes. As an example, we show how model-based unsupervised methods can identify a small set of genes (1% of the genome) that have significant differential expression in single-cells and are also highly correlated to pathways (p-value < 1E − 7) with anticancer effects driven by the antidiabetic drug metformin. Further analysis of genes on these relevant pathways reveal three candidate genes previously implicated in several anticancer mechanisms in other cancers, not driven by metformin. Identification of these genes can help biologists and clinicians design laboratory experiments to establish the molecular mechanisms of metformin in triple-negative breast cancer. In a domain where there is no prior knowledge of small biologically significant data, we demonstrate that careful data-driven methods can infer such significant small data to explain biological mechanisms.
无监督的单细胞分析在三阴性乳腺癌:一个案例研究
本文展示了一种无监督学习方法来识别治疗性治疗诱导的单细胞亚群中显著差异表达的基因。识别这组基因使得使用成熟的生物信息学方法(如通路分析)来确定它们的生物学相关性成为可能。然后,生物学家可以利用他/她的先验知识在实验室中进行调查,在与相关途径重叠的基因子集中找到一些特定的候选基因。由于人类基因组的庞大规模以及成本和技术资源的限制,生物学家受益于分析方法与途径分析相结合,以设计仅关注少数重要基因的实验室实验。作为一个例子,我们展示了基于模型的无监督方法如何识别一小组基因(基因组的1%),这些基因在单细胞中具有显著的差异表达,并且与抗糖尿病药物二甲双胍驱动的抗癌作用通路高度相关(p值< 1E−7)。对这些相关通路上基因的进一步分析揭示了三个候选基因先前与其他癌症的几种抗癌机制有关,而不是由二甲双胍驱动的。这些基因的鉴定可以帮助生物学家和临床医生设计实验室实验,以建立二甲双胍在三阴性乳腺癌中的分子机制。在一个没有重要的小生物数据先验知识的领域,我们证明了谨慎的数据驱动方法可以推断出如此重要的小数据来解释生物机制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信