Anna-Lisa Doebley, Hanna Liao, C. Kikawa, Eden Cruikshank, Minjeong Ko, A. C. Hoge, Joseph B Hiatt, N. Sarkar, V. Adalsteinsson, P. Polak, D. MacPherson, P. Nelson, H. Parsons, D. Stover, G. Ha
{"title":"摘要:LB022: Griffin:一种利用无细胞DNA超低通全基因组测序进行核小体分析和乳腺癌亚型预测的方法","authors":"Anna-Lisa Doebley, Hanna Liao, C. Kikawa, Eden Cruikshank, Minjeong Ko, A. C. Hoge, Joseph B Hiatt, N. Sarkar, V. Adalsteinsson, P. Polak, D. MacPherson, P. Nelson, H. Parsons, D. Stover, G. Ha","doi":"10.1158/1538-7445.AM2021-LB022","DOIUrl":null,"url":null,"abstract":"Background: Cell-free DNA (cfDNA) is released from dying cells, including tumor cells, and can be isolated from peripheral blood for studying cancer. In the bloodstream, cfDNA is protected from degradation by nucleosomes and other DNA binding proteins, leading to a coverage pattern that reflects the genomic organization in the cells-of-origin. Recent work has shown that it is possible to use this pattern to predict gene and transcription factor activity in cancer cells. This is known as nucleosome profiling. Breast cancer is among the most common causes of cancer, accounting for 23% of cancer diagnoses and 14% of cancer-related deaths among women worldwide. Targeted therapy is guided by tumor subtype, including the expression of three key receptors: ER, PR and HER2. Typically, subtyping involves a tumor biopsy and immunohistochemistry. However, in late-stage cancer, surgical biopsies for disease monitoring are difficult to obtain. Accurate subtype determination is critical to address hormone subtype switches during metastasis or treatment resistance. cfDNA offers an alternative, non-invasive method for identifying tumor subtypes through nucleosome profiling and, to the best of our knowledge, has not been shown for breast cancer. Methods: We developed a method, called Griffin, to examine nucleosome protection and genome accessibility by quantifying cfDNA fragments around accessible sites. Unlike previous methods, Griffin uses fragment length-based GC correction to remove GC biases that obscure signals. We used ATAC-seq data from TCGA to identify differentially accessible sites between ER positive and negative breast cancers. We developed a machine learning classifier that predicts ER subtype based upon the signals at these differentially accessible sites. Results: We then tested Griffin by examining differentially accessible sites in ultra-low pass sequencing (ULP-WGS, 0.1X) of several hundred cfDNA samples from patients with ER positive or negative breast cancer. We found that overall, differential sites were more accessible in the cfDNA of their respective subtypes. Additionally, we found that site accessibility within patient cfDNA samples was correlated to the cfDNA tumor fraction. We built and tested a prediction model with cross-validation, which revealed an accuracy of >80% for correctly classifying tumor status as ER positive or negative from this ULP-WGS dataset. Conclusion: This study has several novel aspects compared to prior nucleosome profiling approaches. First, we use fragment-based GC correction which reduces sample variability and allows us to observe previously obscured signals. Second, we demonstrated that signals are correlated to tumor fraction. And finally, we applied this method to cost-effective and scalable ULP-WGS of breast cancer and demonstrated the ability to predict breast cancer ER subtype in these samples. Citation Format: Anna-Lisa Doebley, Hanna Liao, Caroline Kikawa, Eden Cruikshank, Minjeong Ko, Anna Hoge, Joseph Hiatt, Navonil De Sarkar, Viktor A. Adalsteinsson, Paz Polak, David MacPherson, Peter S. Nelson, Heather A. Parsons, Daniel Stover, Gavin Ha. Griffin: A method for nucleosome profiling and breast cancer subtype prediction from ultra-low pass whole genome sequencing of cell-free DNA [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr LB022.","PeriodicalId":73617,"journal":{"name":"Journal of bioinformatics and systems biology : Open access","volume":"43 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Abstract LB022: Griffin: A method for nucleosome profiling and breast cancer subtype prediction from ultra-low pass whole genome sequencing of cell-free DNA\",\"authors\":\"Anna-Lisa Doebley, Hanna Liao, C. Kikawa, Eden Cruikshank, Minjeong Ko, A. C. Hoge, Joseph B Hiatt, N. Sarkar, V. Adalsteinsson, P. Polak, D. MacPherson, P. Nelson, H. Parsons, D. Stover, G. Ha\",\"doi\":\"10.1158/1538-7445.AM2021-LB022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Cell-free DNA (cfDNA) is released from dying cells, including tumor cells, and can be isolated from peripheral blood for studying cancer. In the bloodstream, cfDNA is protected from degradation by nucleosomes and other DNA binding proteins, leading to a coverage pattern that reflects the genomic organization in the cells-of-origin. Recent work has shown that it is possible to use this pattern to predict gene and transcription factor activity in cancer cells. This is known as nucleosome profiling. Breast cancer is among the most common causes of cancer, accounting for 23% of cancer diagnoses and 14% of cancer-related deaths among women worldwide. Targeted therapy is guided by tumor subtype, including the expression of three key receptors: ER, PR and HER2. Typically, subtyping involves a tumor biopsy and immunohistochemistry. However, in late-stage cancer, surgical biopsies for disease monitoring are difficult to obtain. Accurate subtype determination is critical to address hormone subtype switches during metastasis or treatment resistance. cfDNA offers an alternative, non-invasive method for identifying tumor subtypes through nucleosome profiling and, to the best of our knowledge, has not been shown for breast cancer. Methods: We developed a method, called Griffin, to examine nucleosome protection and genome accessibility by quantifying cfDNA fragments around accessible sites. Unlike previous methods, Griffin uses fragment length-based GC correction to remove GC biases that obscure signals. We used ATAC-seq data from TCGA to identify differentially accessible sites between ER positive and negative breast cancers. We developed a machine learning classifier that predicts ER subtype based upon the signals at these differentially accessible sites. Results: We then tested Griffin by examining differentially accessible sites in ultra-low pass sequencing (ULP-WGS, 0.1X) of several hundred cfDNA samples from patients with ER positive or negative breast cancer. We found that overall, differential sites were more accessible in the cfDNA of their respective subtypes. Additionally, we found that site accessibility within patient cfDNA samples was correlated to the cfDNA tumor fraction. We built and tested a prediction model with cross-validation, which revealed an accuracy of >80% for correctly classifying tumor status as ER positive or negative from this ULP-WGS dataset. Conclusion: This study has several novel aspects compared to prior nucleosome profiling approaches. First, we use fragment-based GC correction which reduces sample variability and allows us to observe previously obscured signals. Second, we demonstrated that signals are correlated to tumor fraction. And finally, we applied this method to cost-effective and scalable ULP-WGS of breast cancer and demonstrated the ability to predict breast cancer ER subtype in these samples. Citation Format: Anna-Lisa Doebley, Hanna Liao, Caroline Kikawa, Eden Cruikshank, Minjeong Ko, Anna Hoge, Joseph Hiatt, Navonil De Sarkar, Viktor A. Adalsteinsson, Paz Polak, David MacPherson, Peter S. Nelson, Heather A. Parsons, Daniel Stover, Gavin Ha. Griffin: A method for nucleosome profiling and breast cancer subtype prediction from ultra-low pass whole genome sequencing of cell-free DNA [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr LB022.\",\"PeriodicalId\":73617,\"journal\":{\"name\":\"Journal of bioinformatics and systems biology : Open access\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of bioinformatics and systems biology : Open access\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1158/1538-7445.AM2021-LB022\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of bioinformatics and systems biology : Open access","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1158/1538-7445.AM2021-LB022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
背景:游离DNA (Cell-free DNA, cfDNA)是从包括肿瘤细胞在内的垂死细胞中释放出来的,可以从外周血中分离出来用于研究癌症。在血液中,cfDNA受到核小体和其他DNA结合蛋白的保护,不被降解,从而形成一种反映起源细胞基因组组织的覆盖模式。最近的研究表明,利用这种模式来预测癌细胞中的基因和转录因子活性是可能的。这被称为核小体分析。乳腺癌是最常见的癌症原因之一,占全球女性癌症诊断的23%和癌症相关死亡的14%。靶向治疗以肿瘤亚型为指导,包括三种关键受体:ER、PR和HER2的表达。通常,分型包括肿瘤活检和免疫组织化学。然而,在晚期癌症中,很难获得用于疾病监测的手术活检。准确的亚型测定对于解决转移或治疗抵抗期间的激素亚型转换至关重要。cfDNA提供了一种替代的、非侵入性的方法,通过核小体分析来识别肿瘤亚型,据我们所知,cfDNA还没有被用于乳腺癌。方法:我们开发了一种称为Griffin的方法,通过定量可达位点周围的cfDNA片段来检测核小体保护和基因组可达性。与以前的方法不同,Griffin使用基于片段长度的GC校正来去除模糊信号的GC偏差。我们使用来自TCGA的ATAC-seq数据来确定ER阳性和阴性乳腺癌之间可访问的差异位点。我们开发了一种机器学习分类器,可以根据这些不同可访问位置的信号预测ER亚型。结果:我们随后通过对来自ER阳性或阴性乳腺癌患者的数百个cfDNA样本进行超低通过测序(ULP-WGS, 0.1X)检查差异可达位点来测试Griffin。我们发现,总的来说,在各自亚型的cfDNA中,差异位点更容易被访问。此外,我们发现患者cfDNA样本中的位点可及性与cfDNA肿瘤分数相关。我们建立并测试了一个交叉验证的预测模型,该模型显示,从ULP-WGS数据集中正确分类肿瘤状态为ER阳性或阴性的准确率>80%。结论:与之前的核小体分析方法相比,这项研究有几个新的方面。首先,我们使用基于片段的GC校正,这减少了样本可变性,使我们能够观察到以前模糊的信号。其次,我们证明了信号与肿瘤分数相关。最后,我们将该方法应用于具有成本效益和可扩展的乳腺癌ULP-WGS,并证明了在这些样本中预测乳腺癌ER亚型的能力。引文格式:Anna- lisa Doebley, Hanna Liao, Caroline Kikawa, Eden Cruikshank, Minjeong Ko, Anna Hoge, Joseph Hiatt, Navonil De Sarkar, Viktor A. Adalsteinsson, Paz Polak, David MacPherson, Peter S. Nelson, Heather A. Parsons, Daniel Stover, Gavin Ha。Griffin:一种利用无细胞DNA超低通全基因组测序进行核小体谱分析和乳腺癌亚型预测的方法[摘要]。见:美国癌症研究协会2021年年会论文集;2021年4月10日至15日和5月17日至21日。费城(PA): AACR;癌症杂志,2021;81(13 -增刊):摘要nr LB022。
Abstract LB022: Griffin: A method for nucleosome profiling and breast cancer subtype prediction from ultra-low pass whole genome sequencing of cell-free DNA
Background: Cell-free DNA (cfDNA) is released from dying cells, including tumor cells, and can be isolated from peripheral blood for studying cancer. In the bloodstream, cfDNA is protected from degradation by nucleosomes and other DNA binding proteins, leading to a coverage pattern that reflects the genomic organization in the cells-of-origin. Recent work has shown that it is possible to use this pattern to predict gene and transcription factor activity in cancer cells. This is known as nucleosome profiling. Breast cancer is among the most common causes of cancer, accounting for 23% of cancer diagnoses and 14% of cancer-related deaths among women worldwide. Targeted therapy is guided by tumor subtype, including the expression of three key receptors: ER, PR and HER2. Typically, subtyping involves a tumor biopsy and immunohistochemistry. However, in late-stage cancer, surgical biopsies for disease monitoring are difficult to obtain. Accurate subtype determination is critical to address hormone subtype switches during metastasis or treatment resistance. cfDNA offers an alternative, non-invasive method for identifying tumor subtypes through nucleosome profiling and, to the best of our knowledge, has not been shown for breast cancer. Methods: We developed a method, called Griffin, to examine nucleosome protection and genome accessibility by quantifying cfDNA fragments around accessible sites. Unlike previous methods, Griffin uses fragment length-based GC correction to remove GC biases that obscure signals. We used ATAC-seq data from TCGA to identify differentially accessible sites between ER positive and negative breast cancers. We developed a machine learning classifier that predicts ER subtype based upon the signals at these differentially accessible sites. Results: We then tested Griffin by examining differentially accessible sites in ultra-low pass sequencing (ULP-WGS, 0.1X) of several hundred cfDNA samples from patients with ER positive or negative breast cancer. We found that overall, differential sites were more accessible in the cfDNA of their respective subtypes. Additionally, we found that site accessibility within patient cfDNA samples was correlated to the cfDNA tumor fraction. We built and tested a prediction model with cross-validation, which revealed an accuracy of >80% for correctly classifying tumor status as ER positive or negative from this ULP-WGS dataset. Conclusion: This study has several novel aspects compared to prior nucleosome profiling approaches. First, we use fragment-based GC correction which reduces sample variability and allows us to observe previously obscured signals. Second, we demonstrated that signals are correlated to tumor fraction. And finally, we applied this method to cost-effective and scalable ULP-WGS of breast cancer and demonstrated the ability to predict breast cancer ER subtype in these samples. Citation Format: Anna-Lisa Doebley, Hanna Liao, Caroline Kikawa, Eden Cruikshank, Minjeong Ko, Anna Hoge, Joseph Hiatt, Navonil De Sarkar, Viktor A. Adalsteinsson, Paz Polak, David MacPherson, Peter S. Nelson, Heather A. Parsons, Daniel Stover, Gavin Ha. Griffin: A method for nucleosome profiling and breast cancer subtype prediction from ultra-low pass whole genome sequencing of cell-free DNA [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr LB022.