预测甲状腺乳头状癌淋巴结转移的特征选择和分类技术

IF 2 4区 化学 Q3 CHEMISTRY, MULTIDISCIPLINARY
Dan Wu, Zhuang Yan, Guoliang Liao, Lin Han, Ke Chen, Cheng Li, Zhan Hua, Jiangli Lin
{"title":"预测甲状腺乳头状癌淋巴结转移的特征选择和分类技术","authors":"Dan Wu, Zhuang Yan, Guoliang Liao, Lin Han, Ke Chen, Cheng Li, Zhan Hua, Jiangli Lin","doi":"10.1142/s2737416524400064","DOIUrl":null,"url":null,"abstract":"Papillary thyroid carcinoma (PTC) is typically an indolent cancer, yet a minority of cases develop lymph node metastasis. Due to the unclear mechanisms of lymph node metastasis, a considerable number of patients undergo unnecessary surgeries. Currently, the identification of key genetic biomarkers in high-dimensional data presents a significant challenge, thereby limiting research progress in this area. Here, we proposed a hybrid filter-wrapper feature selection strategy for core factor detection and developed MethyAE, a metastasis prediction model based on DNA methylation, utilizing an end-to-end learning auto-encoder. 46 methylated CpG sites were successfully identified as crucial biomarkers for lymph node metastasis. Leveraging 447 PTC samples from the Cancer Genome Atlas (221 with metastasis, 226 without), the MethyAE model achieves 88.9% accuracy and a recall rate of 88.6% in predicting lymph node metastasis, outperforming commonly used machine learning methods like logistic regression and random forest. Furthermore, the MethyAE model exhibits favorable performance in DNA methylation data from colon cancer, bladder cancer, and breast cancer. To the best of our knowledge, this is the first attempt to predict PTC lymph node metastasis through DNA methylation, offering pivotal decision-making criteria for avoiding unnecessary surgeries and selecting appropriate treatment plans for a substantial cohort of PTC patients.","PeriodicalId":15603,"journal":{"name":"Journal of Computational Biophysics and Chemistry","volume":null,"pages":null},"PeriodicalIF":2.0000,"publicationDate":"2024-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Feature Selection and Classification Technique for Predicting Lymph Node Metastasis of Papillary Thyroid Carcinoma\",\"authors\":\"Dan Wu, Zhuang Yan, Guoliang Liao, Lin Han, Ke Chen, Cheng Li, Zhan Hua, Jiangli Lin\",\"doi\":\"10.1142/s2737416524400064\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Papillary thyroid carcinoma (PTC) is typically an indolent cancer, yet a minority of cases develop lymph node metastasis. Due to the unclear mechanisms of lymph node metastasis, a considerable number of patients undergo unnecessary surgeries. Currently, the identification of key genetic biomarkers in high-dimensional data presents a significant challenge, thereby limiting research progress in this area. Here, we proposed a hybrid filter-wrapper feature selection strategy for core factor detection and developed MethyAE, a metastasis prediction model based on DNA methylation, utilizing an end-to-end learning auto-encoder. 46 methylated CpG sites were successfully identified as crucial biomarkers for lymph node metastasis. Leveraging 447 PTC samples from the Cancer Genome Atlas (221 with metastasis, 226 without), the MethyAE model achieves 88.9% accuracy and a recall rate of 88.6% in predicting lymph node metastasis, outperforming commonly used machine learning methods like logistic regression and random forest. Furthermore, the MethyAE model exhibits favorable performance in DNA methylation data from colon cancer, bladder cancer, and breast cancer. To the best of our knowledge, this is the first attempt to predict PTC lymph node metastasis through DNA methylation, offering pivotal decision-making criteria for avoiding unnecessary surgeries and selecting appropriate treatment plans for a substantial cohort of PTC patients.\",\"PeriodicalId\":15603,\"journal\":{\"name\":\"Journal of Computational Biophysics and Chemistry\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2024-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computational Biophysics and Chemistry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1142/s2737416524400064\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Biophysics and Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s2737416524400064","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

甲状腺乳头状癌(PTC)是一种典型的隐匿性癌症,但少数病例会发生淋巴结转移。由于淋巴结转移的机制尚不明确,相当多的患者接受了不必要的手术。目前,在高维数据中识别关键基因生物标志物是一项重大挑战,从而限制了这一领域的研究进展。在此,我们提出了一种用于核心因子检测的混合滤波器-包装器特征选择策略,并利用端到端学习自动编码器开发了基于DNA甲基化的转移预测模型MethyAE。46 个甲基化 CpG 位点被成功鉴定为淋巴结转移的关键生物标志物。利用癌症基因组图谱中的 447 个 PTC 样本(221 个有转移,226 个无转移),MethyAE 模型在预测淋巴结转移方面达到了 88.9% 的准确率和 88.6% 的召回率,优于逻辑回归和随机森林等常用机器学习方法。此外,MethyAE 模型在结肠癌、膀胱癌和乳腺癌的 DNA 甲基化数据中表现出良好的性能。据我们所知,这是首次尝试通过DNA甲基化预测PTC淋巴结转移,为大量PTC患者避免不必要的手术和选择合适的治疗方案提供了关键的决策标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Feature Selection and Classification Technique for Predicting Lymph Node Metastasis of Papillary Thyroid Carcinoma
Papillary thyroid carcinoma (PTC) is typically an indolent cancer, yet a minority of cases develop lymph node metastasis. Due to the unclear mechanisms of lymph node metastasis, a considerable number of patients undergo unnecessary surgeries. Currently, the identification of key genetic biomarkers in high-dimensional data presents a significant challenge, thereby limiting research progress in this area. Here, we proposed a hybrid filter-wrapper feature selection strategy for core factor detection and developed MethyAE, a metastasis prediction model based on DNA methylation, utilizing an end-to-end learning auto-encoder. 46 methylated CpG sites were successfully identified as crucial biomarkers for lymph node metastasis. Leveraging 447 PTC samples from the Cancer Genome Atlas (221 with metastasis, 226 without), the MethyAE model achieves 88.9% accuracy and a recall rate of 88.6% in predicting lymph node metastasis, outperforming commonly used machine learning methods like logistic regression and random forest. Furthermore, the MethyAE model exhibits favorable performance in DNA methylation data from colon cancer, bladder cancer, and breast cancer. To the best of our knowledge, this is the first attempt to predict PTC lymph node metastasis through DNA methylation, offering pivotal decision-making criteria for avoiding unnecessary surgeries and selecting appropriate treatment plans for a substantial cohort of PTC patients.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.60
自引率
9.10%
发文量
62
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信