从宫颈刮痧样本中探索卵巢癌潜在的甲基化标志物。

IF 3.8 3区 医学 Q2 GENETICS & HEREDITY
Ju-Yin Lien, Lu Ann Hii, Po-Hsuan Su, Lin-Yu Chen, Kuo-Chang Wen, Hung-Cheng Lai, Yu-Chao Wang
{"title":"从宫颈刮痧样本中探索卵巢癌潜在的甲基化标志物。","authors":"Ju-Yin Lien, Lu Ann Hii, Po-Hsuan Su, Lin-Yu Chen, Kuo-Chang Wen, Hung-Cheng Lai, Yu-Chao Wang","doi":"10.1186/s40246-025-00763-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Ovarian cancer has the highest mortality rate among gynecological cancers, making early detection crucial, as the five-year survival rate drops from 92% with early-stage diagnosis compared to 31% with late-stage diagnosis. Current diagnostic methods such as histopathological examination and detection of cancer antigen 125 and human epididymis protein 4 biomarkers are either invasive or lack specificity and sensitivity. However, the Papanicolaou (Pap) test, which is widely used for cervical cancer screening, shows the potential for detecting ovarian cancer by identifying tumor DNA in cervical scrapings. Since aberrant DNA methylation patterns are linked to cancer progression, DNA methylation offers a promising avenue for early diagnosis. Therefore, this study aimed to develop a methylation-based machine-learning model to stratify patients with ovarian cancer from the cervical scraping samples collected via Pap test.</p><p><strong>Results: </strong>Cervical scrapings were collected by gynecologists using conventional Pap smears. In total, 160 samples were collected: 95 normal, 37 benign, and 28 malignant. Methylation data were generated using the Illumina Infinium MethylationEPIC BeadChip array, which contains approximately 850,000 CpG loci. Methylation data were initially divided into training and testing sets in a 3:1 ratio comprising 120 and 40 samples, respectively. A two-step methylation-based model was trained using the training data for classification: a principal component analysis (PCA) model, consisting of 30 features, to classify samples as normal or tumor; then a gradient boosting model, containing 16 features, to further stratify tumor samples as benign or malignant. The two-step model achieved an accuracy of 0.88 and an F1-score of 0.86 on the testing data. Furthermore, an over-representation analysis was conducted to explore the functions associated with genes mapped from differentially methylated positions (DMPs) in comparisons between normal and tumor samples, as well as between benign and malignant samples. These results suggest that DMPs may be associated with olfactory transduction when comparing normal versus tumor samples, and immune regulation when comparing benign and malignant samples.</p><p><strong>Conclusions: </strong>Our two-step model shows promise for predicting ovarian cancer and suggests that cervical scrapings may be a viable alternative for sample collection during screening.</p>","PeriodicalId":13183,"journal":{"name":"Human Genomics","volume":"19 1","pages":"56"},"PeriodicalIF":3.8000,"publicationDate":"2025-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12085859/pdf/","citationCount":"0","resultStr":"{\"title\":\"Exploring potential methylation markers for ovarian cancer from cervical scraping samples.\",\"authors\":\"Ju-Yin Lien, Lu Ann Hii, Po-Hsuan Su, Lin-Yu Chen, Kuo-Chang Wen, Hung-Cheng Lai, Yu-Chao Wang\",\"doi\":\"10.1186/s40246-025-00763-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Ovarian cancer has the highest mortality rate among gynecological cancers, making early detection crucial, as the five-year survival rate drops from 92% with early-stage diagnosis compared to 31% with late-stage diagnosis. Current diagnostic methods such as histopathological examination and detection of cancer antigen 125 and human epididymis protein 4 biomarkers are either invasive or lack specificity and sensitivity. However, the Papanicolaou (Pap) test, which is widely used for cervical cancer screening, shows the potential for detecting ovarian cancer by identifying tumor DNA in cervical scrapings. Since aberrant DNA methylation patterns are linked to cancer progression, DNA methylation offers a promising avenue for early diagnosis. Therefore, this study aimed to develop a methylation-based machine-learning model to stratify patients with ovarian cancer from the cervical scraping samples collected via Pap test.</p><p><strong>Results: </strong>Cervical scrapings were collected by gynecologists using conventional Pap smears. In total, 160 samples were collected: 95 normal, 37 benign, and 28 malignant. Methylation data were generated using the Illumina Infinium MethylationEPIC BeadChip array, which contains approximately 850,000 CpG loci. Methylation data were initially divided into training and testing sets in a 3:1 ratio comprising 120 and 40 samples, respectively. A two-step methylation-based model was trained using the training data for classification: a principal component analysis (PCA) model, consisting of 30 features, to classify samples as normal or tumor; then a gradient boosting model, containing 16 features, to further stratify tumor samples as benign or malignant. The two-step model achieved an accuracy of 0.88 and an F1-score of 0.86 on the testing data. Furthermore, an over-representation analysis was conducted to explore the functions associated with genes mapped from differentially methylated positions (DMPs) in comparisons between normal and tumor samples, as well as between benign and malignant samples. These results suggest that DMPs may be associated with olfactory transduction when comparing normal versus tumor samples, and immune regulation when comparing benign and malignant samples.</p><p><strong>Conclusions: </strong>Our two-step model shows promise for predicting ovarian cancer and suggests that cervical scrapings may be a viable alternative for sample collection during screening.</p>\",\"PeriodicalId\":13183,\"journal\":{\"name\":\"Human Genomics\",\"volume\":\"19 1\",\"pages\":\"56\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-05-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12085859/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Human Genomics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s40246-025-00763-4\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Genomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40246-025-00763-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

背景:卵巢癌在妇科癌症中死亡率最高,因此早期检测至关重要,因为早期诊断的五年生存率从92%下降到晚期诊断的31%。目前的诊断方法,如组织病理学检查和检测癌症抗原125和人附睾蛋白4生物标志物,要么是侵入性的,要么缺乏特异性和敏感性。然而,广泛用于宫颈癌筛查的巴氏试验(Pap)显示出通过识别宫颈刮痕中的肿瘤DNA来检测卵巢癌的潜力。由于异常的DNA甲基化模式与癌症进展有关,DNA甲基化为早期诊断提供了一个有希望的途径。因此,本研究旨在开发一种基于甲基化的机器学习模型,从通过巴氏试验收集的宫颈刮痧样本中对卵巢癌患者进行分层。结果:妇科医生使用常规巴氏涂片收集宫颈刮痕。共收集了160个样本:正常95个,良性37个,恶性28个。甲基化数据使用Illumina Infinium MethylationEPIC BeadChip阵列生成,该阵列包含约85万个CpG位点。甲基化数据最初以3:1的比例分为训练集和测试集,分别包含120和40个样本。使用训练数据训练基于两步甲基化的模型进行分类:主成分分析(PCA)模型,由30个特征组成,用于将样本分类为正常或肿瘤;然后建立包含16个特征的梯度增强模型,进一步将肿瘤样本划分为良性或恶性。两步模型在测试数据上的准确率为0.88,f1得分为0.86。此外,在正常和肿瘤样本之间以及良性和恶性样本之间的比较中,进行了过度代表性分析,以探索从差异甲基化位置(dmp)定位的基因相关功能。这些结果表明,在比较正常和肿瘤样本时,dmp可能与嗅觉转导有关,在比较良性和恶性样本时,dmp可能与免疫调节有关。结论:我们的两步模型显示了预测卵巢癌的希望,并表明宫颈刮痕可能是筛查期间样本收集的可行替代方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Exploring potential methylation markers for ovarian cancer from cervical scraping samples.

Background: Ovarian cancer has the highest mortality rate among gynecological cancers, making early detection crucial, as the five-year survival rate drops from 92% with early-stage diagnosis compared to 31% with late-stage diagnosis. Current diagnostic methods such as histopathological examination and detection of cancer antigen 125 and human epididymis protein 4 biomarkers are either invasive or lack specificity and sensitivity. However, the Papanicolaou (Pap) test, which is widely used for cervical cancer screening, shows the potential for detecting ovarian cancer by identifying tumor DNA in cervical scrapings. Since aberrant DNA methylation patterns are linked to cancer progression, DNA methylation offers a promising avenue for early diagnosis. Therefore, this study aimed to develop a methylation-based machine-learning model to stratify patients with ovarian cancer from the cervical scraping samples collected via Pap test.

Results: Cervical scrapings were collected by gynecologists using conventional Pap smears. In total, 160 samples were collected: 95 normal, 37 benign, and 28 malignant. Methylation data were generated using the Illumina Infinium MethylationEPIC BeadChip array, which contains approximately 850,000 CpG loci. Methylation data were initially divided into training and testing sets in a 3:1 ratio comprising 120 and 40 samples, respectively. A two-step methylation-based model was trained using the training data for classification: a principal component analysis (PCA) model, consisting of 30 features, to classify samples as normal or tumor; then a gradient boosting model, containing 16 features, to further stratify tumor samples as benign or malignant. The two-step model achieved an accuracy of 0.88 and an F1-score of 0.86 on the testing data. Furthermore, an over-representation analysis was conducted to explore the functions associated with genes mapped from differentially methylated positions (DMPs) in comparisons between normal and tumor samples, as well as between benign and malignant samples. These results suggest that DMPs may be associated with olfactory transduction when comparing normal versus tumor samples, and immune regulation when comparing benign and malignant samples.

Conclusions: Our two-step model shows promise for predicting ovarian cancer and suggests that cervical scrapings may be a viable alternative for sample collection during screening.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Human Genomics
Human Genomics GENETICS & HEREDITY-
CiteScore
6.00
自引率
2.20%
发文量
55
审稿时长
11 weeks
期刊介绍: Human Genomics is a peer-reviewed, open access, online journal that focuses on the application of genomic analysis in all aspects of human health and disease, as well as genomic analysis of drug efficacy and safety, and comparative genomics. Topics covered by the journal include, but are not limited to: pharmacogenomics, genome-wide association studies, genome-wide sequencing, exome sequencing, next-generation deep-sequencing, functional genomics, epigenomics, translational genomics, expression profiling, proteomics, bioinformatics, animal models, statistical genetics, genetic epidemiology, human population genetics and comparative genomics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信