Kawther Abdilleh, Boris Aguilar, George Acquaah-Mensah
{"title":"Clinical and Multiomic Features Differentiate Young Black and White Breast Cancer Cohorts Derived by Machine Learning Approaches.","authors":"Kawther Abdilleh, Boris Aguilar, George Acquaah-Mensah","doi":"10.1016/j.clbc.2024.11.015","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There are documented differences in Breast cancer (BrCA) presentations and outcomes between Black and White patients. In addition to molecular factors, socioeconomic, racial, and clinical factors result in disparities in outcomes for women in the United States. Using machine learning and unsupervised biclustering methods within a multiomics framework, here we sought to shed light on the biological and clinical underpinnings of observed differences between Black and White BrCA patients.</p><p><strong>Materials and methods: </strong>We examined The Cancer Genome Atlas BrCA samples from stage II patients aged 50 or younger that are Black (BAA50) or White (W50) (n = 139 patients; 36 BAA50 and 103 W50) These patients were chosen because marked differences in survival were observed in an earlier study. A variety of multiomic data sets were analyzed to further characterize the clinical and molecular disparities for insights.</p><p><strong>Results: </strong>We coupled RNAseq data with protein-protein interaction as well as BrCA-specific protein co-expression network data to identify 2 novel biclusters. These biclusters are significantly associated with clinical features including race, number of lymph nodes involved with disease, estrogen receptor status, progesterone receptor status and menopausal status. There were also differentially mutated genes. Using DNA methylation data, we identified differentially methylated genes. Machine learning algorithms were trained on differential methylation values of driver genes. The trained algorithms were successful in predicting the bicluster assignment of each sample.</p><p><strong>Conclusion: </strong>These results demonstrate that there was a significant association between the cluster membership and BAA50 and W50 cohorts, indicating that these biclusters accurately stratify these cohorts.</p>","PeriodicalId":10197,"journal":{"name":"Clinical breast cancer","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical breast cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.clbc.2024.11.015","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: There are documented differences in Breast cancer (BrCA) presentations and outcomes between Black and White patients. In addition to molecular factors, socioeconomic, racial, and clinical factors result in disparities in outcomes for women in the United States. Using machine learning and unsupervised biclustering methods within a multiomics framework, here we sought to shed light on the biological and clinical underpinnings of observed differences between Black and White BrCA patients.
Materials and methods: We examined The Cancer Genome Atlas BrCA samples from stage II patients aged 50 or younger that are Black (BAA50) or White (W50) (n = 139 patients; 36 BAA50 and 103 W50) These patients were chosen because marked differences in survival were observed in an earlier study. A variety of multiomic data sets were analyzed to further characterize the clinical and molecular disparities for insights.
Results: We coupled RNAseq data with protein-protein interaction as well as BrCA-specific protein co-expression network data to identify 2 novel biclusters. These biclusters are significantly associated with clinical features including race, number of lymph nodes involved with disease, estrogen receptor status, progesterone receptor status and menopausal status. There were also differentially mutated genes. Using DNA methylation data, we identified differentially methylated genes. Machine learning algorithms were trained on differential methylation values of driver genes. The trained algorithms were successful in predicting the bicluster assignment of each sample.
Conclusion: These results demonstrate that there was a significant association between the cluster membership and BAA50 and W50 cohorts, indicating that these biclusters accurately stratify these cohorts.
期刊介绍:
Clinical Breast Cancer is a peer-reviewed bimonthly journal that publishes original articles describing various aspects of clinical and translational research of breast cancer. Clinical Breast Cancer is devoted to articles on detection, diagnosis, prevention, and treatment of breast cancer. The main emphasis is on recent scientific developments in all areas related to breast cancer. Specific areas of interest include clinical research reports from various therapeutic modalities, cancer genetics, drug sensitivity and resistance, novel imaging, tumor genomics, biomarkers, and chemoprevention strategies.