Development and validation of machine learning models for early diagnosis and prognosis of lung adenocarcinoma using miRNA expression profiles.

IF 2.2 4区 医学 Q3 ONCOLOGY
Cancer Biomarkers Pub Date : 2025-01-01 Epub Date: 2025-04-02 DOI:10.1177/18758592241308756
Lin Lin, Yongxia Bao
{"title":"Development and validation of machine learning models for early diagnosis and prognosis of lung adenocarcinoma using miRNA expression profiles.","authors":"Lin Lin, Yongxia Bao","doi":"10.1177/18758592241308756","DOIUrl":null,"url":null,"abstract":"<p><p>ObjectiveStudy aims to develop diagnostic and prognostic models for lung adenocarcinoma (LUAD) using Machine learning(ML)algorithms, aiming to enhance clinical decision-making accuracy.MethodsData from The Cancer Genome Atlas (TCGA) for LUAD patients were split into training (n = 196) and test sets (n = 133). Feature selection (Least Absolute Shrinkage and Selection Operator (LASSO), Random Forest (RF), and Support Vector Machine (SVM)) identified miRNAs distinguishing stage I LUAD. Six ML algorithms predicted pulmonary node classification. Model performance was evaluated using Receiver Operating Characteristic (ROC) curve, Precision-Recall (PR) curves, and Error Rates (CE). A prognostic model was constructed using Lasso Cox regression. Risk score plots were generated, and model performance was assessed using Kaplan-Meier (K-M) and time-dependent ROC curves. Functional enrichment analyses investigated miRNA function and mechanism.ResultsThe feature selection results identified five miRNA molecules as distinguishing characteristics between early-stage LUAD and adjacent non-cancerous tissues. A prognostic model using 13 miRNAs predicted poorer outcomes for patients with higher risk scores, supported by time-dependent ROC curves and a nomogram. Functional enrichment analysis identified cancer-related signaling pathways for the biomarkers.ConclusionML identified a diagnostic five-miRNA signature and a prognostic 13-miRNA model for LUAD, both robust and reliable.</p>","PeriodicalId":56320,"journal":{"name":"Cancer Biomarkers","volume":"42 1","pages":"18758592241308756"},"PeriodicalIF":2.2000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Biomarkers","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/18758592241308756","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/2 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

ObjectiveStudy aims to develop diagnostic and prognostic models for lung adenocarcinoma (LUAD) using Machine learning(ML)algorithms, aiming to enhance clinical decision-making accuracy.MethodsData from The Cancer Genome Atlas (TCGA) for LUAD patients were split into training (n = 196) and test sets (n = 133). Feature selection (Least Absolute Shrinkage and Selection Operator (LASSO), Random Forest (RF), and Support Vector Machine (SVM)) identified miRNAs distinguishing stage I LUAD. Six ML algorithms predicted pulmonary node classification. Model performance was evaluated using Receiver Operating Characteristic (ROC) curve, Precision-Recall (PR) curves, and Error Rates (CE). A prognostic model was constructed using Lasso Cox regression. Risk score plots were generated, and model performance was assessed using Kaplan-Meier (K-M) and time-dependent ROC curves. Functional enrichment analyses investigated miRNA function and mechanism.ResultsThe feature selection results identified five miRNA molecules as distinguishing characteristics between early-stage LUAD and adjacent non-cancerous tissues. A prognostic model using 13 miRNAs predicted poorer outcomes for patients with higher risk scores, supported by time-dependent ROC curves and a nomogram. Functional enrichment analysis identified cancer-related signaling pathways for the biomarkers.ConclusionML identified a diagnostic five-miRNA signature and a prognostic 13-miRNA model for LUAD, both robust and reliable.

利用miRNA表达谱开发和验证用于肺腺癌早期诊断和预后的机器学习模型。
目的利用机器学习(ML)算法建立肺腺癌(LUAD)的诊断和预后模型,提高临床决策的准确性。方法将LUAD患者的癌症基因组图谱(TCGA)数据分为训练组(n = 196)和测试组(n = 133)。特征选择(最小绝对收缩和选择算子(LASSO)、随机森林(RF)和支持向量机(SVM))识别出区分I期LUAD的mirna。6种ML算法预测肺淋巴结分类。采用受试者工作特征(ROC)曲线、精确召回率(PR)曲线和错误率(CE)来评估模型的性能。采用Lasso Cox回归建立预后模型。生成风险评分图,并使用Kaplan-Meier (K-M)和随时间变化的ROC曲线评估模型的性能。功能富集分析研究了miRNA的功能和机制。结果特征选择结果确定了5种miRNA分子作为早期LUAD与邻近非癌性组织的区别特征。使用13个mirna的预后模型预测高风险评分较高的患者预后较差,并得到时间依赖性ROC曲线和nomogram支持。功能富集分析确定了生物标志物的癌症相关信号通路。结论ml确定了LUAD的5 - mirna诊断特征和13-miRNA预后模型,稳健可靠。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cancer Biomarkers
Cancer Biomarkers ONCOLOGY-
CiteScore
5.20
自引率
3.20%
发文量
195
审稿时长
3 months
期刊介绍: Concentrating on molecular biomarkers in cancer research, Cancer Biomarkers publishes original research findings (and reviews solicited by the editor) on the subject of the identification of markers associated with the disease processes whether or not they are an integral part of the pathological lesion. The disease markers may include, but are not limited to, genomic, epigenomic, proteomics, cellular and morphologic, and genetic factors predisposing to the disease or indicating the occurrence of the disease. Manuscripts on these factors or biomarkers, either in altered forms, abnormal concentrations or with abnormal tissue distribution leading to disease causation will be accepted.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信