Abstract LB410: Precision oral cancer screening and diagnostic solution using DNA methylation and machine learning to stratify high-risk lesions in saliva from patients of mixed ancestry
Ashley Ramos-Lopez, Amanda Garcia Negron, Guie Beeu Guerrero Hunt, Adhi Guerrero-Thillet, Carolina Zambrano Rabanal, Paola Quiñonez Mendez, Andrea Lopez-Marrero, Alvaro Gutierrez, Fernando Zamuner, Bruce J. Trock, Wayne Koch, Mariana Brait, David Sidransky, Rafael Guerrero-Preston
{"title":"Abstract LB410: Precision oral cancer screening and diagnostic solution using DNA methylation and machine learning to stratify high-risk lesions in saliva from patients of mixed ancestry","authors":"Ashley Ramos-Lopez, Amanda Garcia Negron, Guie Beeu Guerrero Hunt, Adhi Guerrero-Thillet, Carolina Zambrano Rabanal, Paola Quiñonez Mendez, Andrea Lopez-Marrero, Alvaro Gutierrez, Fernando Zamuner, Bruce J. Trock, Wayne Koch, Mariana Brait, David Sidransky, Rafael Guerrero-Preston","doi":"10.1158/1538-7445.am2025-lb410","DOIUrl":null,"url":null,"abstract":"Analysis of quantitative methylation specific PCR (qMSP) data for diagnosis and early detection of cancer has consisted of summarizing singleplex or multiplex DNA data in a cumulative methylation index, followed by threshold analyses. Recently, a novel clustering algorithm was used to examine digital PCR data, prior to downstream analysis. However, clustering of Real-Time qMSP data has not been used by laboratories developing DNA methylation biomarkers for the oral cancer screening and diagnostic space. We created a precision DNA methylation algorithm to quantify Differentially Methylated Promoters (DMPs) with Real-Time PCR instruments, combined with machine learning, for discovery and validation of head and neck squamous cell carcinoma (HNSCC) early detection, diagnosis, and prognostication targets. Analytic validation of PAX1, PAX5, ZIC4, PLCB1, and HHIP was performed to develop a qMSP protocol for clinical samples. The performance of the six singleplex reactions was tested in 307 oral cancer tissue and 55 normal uvulopalatopharyngoplasty (UPPP) samples from a mixed ancestry cohort (40% Black) obtained from the Johns Hopkins School of Medicine Head and Neck Cancer Tumor Bank. An R script for automated analysis of qMSP data was developed to import, process, and analyze multiple qMSP raw data files exported from Applied Biosystems SDS or DA2 software packages. The workflow includes data preprocessing; filtering by quality control metrics, such as CT and PCR efficiency; normalizing against a control gene (Bactin), and visualizing results through boxplots. A precision DNA methylation algorithm was then developed to perform unsupervised hierarchical clustering of singleplex qMSP observations for five genes, center the data, calculate the distance between all samples, determine the variance explained by each Principal Component (PC), set a cutoff DNA methylation value that maximizes performance for each gene and identify the best model fit. Logistic regression, Linear Discriminant Analysis, Loess, K nearest neighbor, and Random Forrest models, as well as an ensemble of all five models were then trained to model the relationship between test samples and PAX1, PAX5, ZIC4, PLCB1, HHIP DMPs. Model performance was compared based on accuracy and logistic regression was used for downstream analyses. The discriminatory ability of the five genes was evaluated using the Receiver Operator Characteristic (ROC) curve and Area Under the Curve (AUC) analyses. The best performance was obtained when using all five genes (PAX1, PAX5, ZIC4, PLCB1, HHIP): 94% Sensitivity, 96% Specificity, 97% Positive Predictive Value, 91% Negative Predictive Value, correctly classifying 95% with an AUC = 0.99. We also found HHIP fully discriminated between normal and tumor samples in a smaller subset of saliva samples (n=73). The normalized tissue-saliva Inter Quartile Range (IQR) ratio of HHIP DNA methylation was 98%. These results warrant to be validated in a larger cohort. RealTime PCR based tests have shown to be cost effective and scalable. The exceptional discriminatory power between normal and cancer taken into the context of post COVID excess installed Real Time PCR instruments, hold a promise of improving oral cancer early detection and diagnostic pipelines worldwide. Citation Format: Ashley Ramos-Lopez, Amanda Garcia Negron, Guie Beeu Guerrero Hunt, Adhi Guerrero-Thillet, Carolina Zambrano Rabanal, Paola Quiñonez Mendez, Andrea Lopez-Marrero, Alvaro Gutierrez, Fernando Zamuner, Bruce J. Trock, Wayne Koch, Mariana Brait, David Sidransky, Rafael Guerrero-Preston. Precision oral cancer screening and diagnostic solution using DNA methylation and machine learning to stratify high-risk lesions in saliva from patients of mixed ancestry [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2025; Part 2 (Late-Breaking, Clinical Trial, and Invited s); 2025 Apr 25-30; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2025;85(8_Suppl_2): nr LB410.","PeriodicalId":9441,"journal":{"name":"Cancer research","volume":"15 1","pages":""},"PeriodicalIF":12.5000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1158/1538-7445.am2025-lb410","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Analysis of quantitative methylation specific PCR (qMSP) data for diagnosis and early detection of cancer has consisted of summarizing singleplex or multiplex DNA data in a cumulative methylation index, followed by threshold analyses. Recently, a novel clustering algorithm was used to examine digital PCR data, prior to downstream analysis. However, clustering of Real-Time qMSP data has not been used by laboratories developing DNA methylation biomarkers for the oral cancer screening and diagnostic space. We created a precision DNA methylation algorithm to quantify Differentially Methylated Promoters (DMPs) with Real-Time PCR instruments, combined with machine learning, for discovery and validation of head and neck squamous cell carcinoma (HNSCC) early detection, diagnosis, and prognostication targets. Analytic validation of PAX1, PAX5, ZIC4, PLCB1, and HHIP was performed to develop a qMSP protocol for clinical samples. The performance of the six singleplex reactions was tested in 307 oral cancer tissue and 55 normal uvulopalatopharyngoplasty (UPPP) samples from a mixed ancestry cohort (40% Black) obtained from the Johns Hopkins School of Medicine Head and Neck Cancer Tumor Bank. An R script for automated analysis of qMSP data was developed to import, process, and analyze multiple qMSP raw data files exported from Applied Biosystems SDS or DA2 software packages. The workflow includes data preprocessing; filtering by quality control metrics, such as CT and PCR efficiency; normalizing against a control gene (Bactin), and visualizing results through boxplots. A precision DNA methylation algorithm was then developed to perform unsupervised hierarchical clustering of singleplex qMSP observations for five genes, center the data, calculate the distance between all samples, determine the variance explained by each Principal Component (PC), set a cutoff DNA methylation value that maximizes performance for each gene and identify the best model fit. Logistic regression, Linear Discriminant Analysis, Loess, K nearest neighbor, and Random Forrest models, as well as an ensemble of all five models were then trained to model the relationship between test samples and PAX1, PAX5, ZIC4, PLCB1, HHIP DMPs. Model performance was compared based on accuracy and logistic regression was used for downstream analyses. The discriminatory ability of the five genes was evaluated using the Receiver Operator Characteristic (ROC) curve and Area Under the Curve (AUC) analyses. The best performance was obtained when using all five genes (PAX1, PAX5, ZIC4, PLCB1, HHIP): 94% Sensitivity, 96% Specificity, 97% Positive Predictive Value, 91% Negative Predictive Value, correctly classifying 95% with an AUC = 0.99. We also found HHIP fully discriminated between normal and tumor samples in a smaller subset of saliva samples (n=73). The normalized tissue-saliva Inter Quartile Range (IQR) ratio of HHIP DNA methylation was 98%. These results warrant to be validated in a larger cohort. RealTime PCR based tests have shown to be cost effective and scalable. The exceptional discriminatory power between normal and cancer taken into the context of post COVID excess installed Real Time PCR instruments, hold a promise of improving oral cancer early detection and diagnostic pipelines worldwide. Citation Format: Ashley Ramos-Lopez, Amanda Garcia Negron, Guie Beeu Guerrero Hunt, Adhi Guerrero-Thillet, Carolina Zambrano Rabanal, Paola Quiñonez Mendez, Andrea Lopez-Marrero, Alvaro Gutierrez, Fernando Zamuner, Bruce J. Trock, Wayne Koch, Mariana Brait, David Sidransky, Rafael Guerrero-Preston. Precision oral cancer screening and diagnostic solution using DNA methylation and machine learning to stratify high-risk lesions in saliva from patients of mixed ancestry [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2025; Part 2 (Late-Breaking, Clinical Trial, and Invited s); 2025 Apr 25-30; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2025;85(8_Suppl_2): nr LB410.
期刊介绍:
Cancer Research, published by the American Association for Cancer Research (AACR), is a journal that focuses on impactful original studies, reviews, and opinion pieces relevant to the broad cancer research community. Manuscripts that present conceptual or technological advances leading to insights into cancer biology are particularly sought after. The journal also places emphasis on convergence science, which involves bridging multiple distinct areas of cancer research.
With primary subsections including Cancer Biology, Cancer Immunology, Cancer Metabolism and Molecular Mechanisms, Translational Cancer Biology, Cancer Landscapes, and Convergence Science, Cancer Research has a comprehensive scope. It is published twice a month and has one volume per year, with a print ISSN of 0008-5472 and an online ISSN of 1538-7445.
Cancer Research is abstracted and/or indexed in various databases and platforms, including BIOSIS Previews (R) Database, MEDLINE, Current Contents/Life Sciences, Current Contents/Clinical Medicine, Science Citation Index, Scopus, and Web of Science.