鉴定甲状腺乳头状癌的潜在生物标记物。

IF 3 3区 医学 Q2 ENDOCRINOLOGY & METABOLISM
Sabire Kilicarslan, Meliha Merve Hiz-Cicekliyurt
{"title":"鉴定甲状腺乳头状癌的潜在生物标记物。","authors":"Sabire Kilicarslan, Meliha Merve Hiz-Cicekliyurt","doi":"10.1007/s12020-024-04068-9","DOIUrl":null,"url":null,"abstract":"<p><p>Papillary thyroid cancer (PTC) is the predominant form of malignant tumor affecting the thyroid gland.</p><p><strong>Aim: </strong>This study aimed to identify candidate biomarkers for papillary thyroid carcinoma using an integrative analysis of bioinformatics and machine learning (ML).</p><p><strong>Material and method: </strong>The PTC datasets GSE6004, GSE3467, and GSE33630 (species: Homo sapiens) were downloaded from NCBI and analyzed using the limma package to obtain DEGs. Once DEGs were identified, GO and KEGG enrichment analyses were performed as the first step in the bioinformatics process. Subsequently, a protein-protein interaction (PPI) network was constructed according to the common genes in bioinformatics and machine learning using STRING to elucidate the important genes involved in PTC pathogenesis. In machine learning, finding genes entails feature selection to identify the key genes that distinguish biological states. Hybrid feature selection will be used for this. In the second step, the original data sets were preprocessed to detect and correct missing and noisy data; after that, all data were merged. Following performing Linear and Discriminative Hybrid Feature Selection (LDHFS) on the processed dataset, machine learning algorithms such as Random Forest (RF), Naive Bayes (NB), and Support Vector Machines (SVM) are utilized.</p><p><strong>Results: </strong>Bioinformatics and machine learning analyses indicate that the genes RXRG, CDH2, ETV5, QPCT, LRP4, FN1, and LPAR5 are integral to the progression of thyroid cancer. This study attained the highest accuracy utilizing the RF algorithm, achieving an accuracy rate of 94.62%, a Kappa value of 91.36%, and an AUC value of 96.13%. These results offer additional evidence and confirmation for the genetic alterations of these genes. These findings may accelerate the development of prospective therapeutic and diagnostic methods in future research.</p><p><strong>Conclusions: </strong>Bioinformatics and machine learning techniques identified the common genes \"RXRG, CDH2, ETV5, QPCT, LRP4, FN1, and LPAR5\" as PTC biomarkers, providing novel reference markers for the diagnosis and treatment of PTC patients. The model is anticipated to possess significant predictive value and assist in the early diagnosis and screening of clinical PTC. These insights enhance the field of PTC management and offer guidance for future research.</p>","PeriodicalId":49211,"journal":{"name":"Endocrine","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identification of potential biomarkers of papillary thyroid carcinoma.\",\"authors\":\"Sabire Kilicarslan, Meliha Merve Hiz-Cicekliyurt\",\"doi\":\"10.1007/s12020-024-04068-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Papillary thyroid cancer (PTC) is the predominant form of malignant tumor affecting the thyroid gland.</p><p><strong>Aim: </strong>This study aimed to identify candidate biomarkers for papillary thyroid carcinoma using an integrative analysis of bioinformatics and machine learning (ML).</p><p><strong>Material and method: </strong>The PTC datasets GSE6004, GSE3467, and GSE33630 (species: Homo sapiens) were downloaded from NCBI and analyzed using the limma package to obtain DEGs. Once DEGs were identified, GO and KEGG enrichment analyses were performed as the first step in the bioinformatics process. Subsequently, a protein-protein interaction (PPI) network was constructed according to the common genes in bioinformatics and machine learning using STRING to elucidate the important genes involved in PTC pathogenesis. In machine learning, finding genes entails feature selection to identify the key genes that distinguish biological states. Hybrid feature selection will be used for this. In the second step, the original data sets were preprocessed to detect and correct missing and noisy data; after that, all data were merged. Following performing Linear and Discriminative Hybrid Feature Selection (LDHFS) on the processed dataset, machine learning algorithms such as Random Forest (RF), Naive Bayes (NB), and Support Vector Machines (SVM) are utilized.</p><p><strong>Results: </strong>Bioinformatics and machine learning analyses indicate that the genes RXRG, CDH2, ETV5, QPCT, LRP4, FN1, and LPAR5 are integral to the progression of thyroid cancer. This study attained the highest accuracy utilizing the RF algorithm, achieving an accuracy rate of 94.62%, a Kappa value of 91.36%, and an AUC value of 96.13%. These results offer additional evidence and confirmation for the genetic alterations of these genes. These findings may accelerate the development of prospective therapeutic and diagnostic methods in future research.</p><p><strong>Conclusions: </strong>Bioinformatics and machine learning techniques identified the common genes \\\"RXRG, CDH2, ETV5, QPCT, LRP4, FN1, and LPAR5\\\" as PTC biomarkers, providing novel reference markers for the diagnosis and treatment of PTC patients. The model is anticipated to possess significant predictive value and assist in the early diagnosis and screening of clinical PTC. These insights enhance the field of PTC management and offer guidance for future research.</p>\",\"PeriodicalId\":49211,\"journal\":{\"name\":\"Endocrine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Endocrine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s12020-024-04068-9\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Endocrine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12020-024-04068-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

摘要

甲状腺乳头状癌(PTC)是影响甲状腺的主要恶性肿瘤形式。目的:本研究旨在利用生物信息学和机器学习(ML)的综合分析方法确定甲状腺乳头状癌的候选生物标志物:从NCBI下载PTC数据集GSE6004、GSE3467和GSE33630(物种:智人),并使用limma软件包进行分析以获得DEGs。确定 DEGs 后,生物信息学流程的第一步是进行 GO 和 KEGG 富集分析。随后,根据生物信息学和机器学习中的常见基因,利用 STRING 构建了蛋白质-蛋白质相互作用(PPI)网络,以阐明参与 PTC 发病机制的重要基因。在机器学习中,寻找基因需要进行特征选择,以确定区分生物状态的关键基因。为此将使用混合特征选择。第二步,对原始数据集进行预处理,以检测和纠正缺失数据和噪声数据;然后,合并所有数据。在对处理后的数据集进行线性和判别混合特征选择(LDHFS)后,使用随机森林(RF)、奈夫贝叶斯(NB)和支持向量机(SVM)等机器学习算法:生物信息学和机器学习分析表明,RXRG、CDH2、ETV5、QPCT、LRP4、FN1 和 LPAR5 基因与甲状腺癌的进展密切相关。这项研究利用 RF 算法达到了最高的准确率,准确率为 94.62%,Kappa 值为 91.36%,AUC 值为 96.13%。这些结果为这些基因的遗传改变提供了更多的证据和确认。这些发现可能会加速未来研究中前瞻性治疗和诊断方法的开发:生物信息学和机器学习技术确定了 "RXRG、CDH2、ETV5、QPCT、LRP4、FN1 和 LPAR5 "等常见基因为 PTC 生物标记物,为 PTC 患者的诊断和治疗提供了新的参考标记。预计该模型将具有重要的预测价值,有助于临床 PTC 的早期诊断和筛查。这些见解将促进 PTC 管理领域的发展,并为未来的研究提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of potential biomarkers of papillary thyroid carcinoma.

Papillary thyroid cancer (PTC) is the predominant form of malignant tumor affecting the thyroid gland.

Aim: This study aimed to identify candidate biomarkers for papillary thyroid carcinoma using an integrative analysis of bioinformatics and machine learning (ML).

Material and method: The PTC datasets GSE6004, GSE3467, and GSE33630 (species: Homo sapiens) were downloaded from NCBI and analyzed using the limma package to obtain DEGs. Once DEGs were identified, GO and KEGG enrichment analyses were performed as the first step in the bioinformatics process. Subsequently, a protein-protein interaction (PPI) network was constructed according to the common genes in bioinformatics and machine learning using STRING to elucidate the important genes involved in PTC pathogenesis. In machine learning, finding genes entails feature selection to identify the key genes that distinguish biological states. Hybrid feature selection will be used for this. In the second step, the original data sets were preprocessed to detect and correct missing and noisy data; after that, all data were merged. Following performing Linear and Discriminative Hybrid Feature Selection (LDHFS) on the processed dataset, machine learning algorithms such as Random Forest (RF), Naive Bayes (NB), and Support Vector Machines (SVM) are utilized.

Results: Bioinformatics and machine learning analyses indicate that the genes RXRG, CDH2, ETV5, QPCT, LRP4, FN1, and LPAR5 are integral to the progression of thyroid cancer. This study attained the highest accuracy utilizing the RF algorithm, achieving an accuracy rate of 94.62%, a Kappa value of 91.36%, and an AUC value of 96.13%. These results offer additional evidence and confirmation for the genetic alterations of these genes. These findings may accelerate the development of prospective therapeutic and diagnostic methods in future research.

Conclusions: Bioinformatics and machine learning techniques identified the common genes "RXRG, CDH2, ETV5, QPCT, LRP4, FN1, and LPAR5" as PTC biomarkers, providing novel reference markers for the diagnosis and treatment of PTC patients. The model is anticipated to possess significant predictive value and assist in the early diagnosis and screening of clinical PTC. These insights enhance the field of PTC management and offer guidance for future research.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Endocrine
Endocrine ENDOCRINOLOGY & METABOLISM-
CiteScore
6.50
自引率
5.40%
发文量
295
审稿时长
1.5 months
期刊介绍: Well-established as a major journal in today’s rapidly advancing experimental and clinical research areas, Endocrine publishes original articles devoted to basic (including molecular, cellular and physiological studies), translational and clinical research in all the different fields of endocrinology and metabolism. Articles will be accepted based on peer-reviews, priority, and editorial decision. Invited reviews, mini-reviews and viewpoints on relevant pathophysiological and clinical topics, as well as Editorials on articles appearing in the Journal, are published. Unsolicited Editorials will be evaluated by the editorial team. Outcomes of scientific meetings, as well as guidelines and position statements, may be submitted. The Journal also considers special feature articles in the field of endocrine genetics and epigenetics, as well as articles devoted to novel methods and techniques in endocrinology. Endocrine covers controversial, clinical endocrine issues. Meta-analyses on endocrine and metabolic topics are also accepted. Descriptions of single clinical cases and/or small patients studies are not published unless of exceptional interest. However, reports of novel imaging studies and endocrine side effects in single patients may be considered. Research letters and letters to the editor related or unrelated to recently published articles can be submitted. Endocrine covers leading topics in endocrinology such as neuroendocrinology, pituitary and hypothalamic peptides, thyroid physiological and clinical aspects, bone and mineral metabolism and osteoporosis, obesity, lipid and energy metabolism and food intake control, insulin, Type 1 and Type 2 diabetes, hormones of male and female reproduction, adrenal diseases pediatric and geriatric endocrinology, endocrine hypertension and endocrine oncology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信