Doeun Lee, Chanhee Lee, Kyulhee Han, Taewan Goo, Boram Kim, Youngmin Han, Wooil Kwon, Seungyeoun Lee, Jin-Young Jang, Taesung Park
{"title":"基于血清细胞外囊泡微生物组标记的胰腺癌诊断机器学习模型。","authors":"Doeun Lee, Chanhee Lee, Kyulhee Han, Taewan Goo, Boram Kim, Youngmin Han, Wooil Kwon, Seungyeoun Lee, Jin-Young Jang, Taesung Park","doi":"10.1038/s41598-025-94183-y","DOIUrl":null,"url":null,"abstract":"<p><p>Pancreatic cancer (PC) is a fatal disease with an extremely low 5-year survival rate, mainly because of its poor detection rate in early stages. Given emerging evidence of the relationship between microbiota composition and diseases, this study aims to identify microbiome markers linked to the diagnosis of pancreatic cancer. We utilized extracellular vesicles (EVs) data obtained from blood samples of 38 pancreatic cancer patients and 51 health controls. Least absolute shrinkage and selection operator (LASSO) and stepwise method were used to obtain some candidate markers in genus and phylum levels. These markers were used to develop various machine learning models including logistic regression (LR), random forest (RF), support vector machine (SVM), and Deep Neural Network (DNN) methods. In phylum level, DNN performed best with three markers (Verrucomicrobia, Actinobacteria and Proteobacteria) selected by stepwise method with the test AUC 0.959. In genus level, DNN using 11 markers selected by LASSO (Ruminococcaceae UCG-013, Ruminiclostridium, Propionibacterium, Lachnospiraceae NK4A136 group, Corynebacterium.1, Akkermansia, Mucispirillum, Pseudomonas, Diaphorobacter, Clostridium sensu stricto 1 and Turicibacter) outperformed others with 0.961 test AUCs. These results highlight the potential of microbiome markers and prediction models in clinical studies of PC diagnosis.</p>","PeriodicalId":21811,"journal":{"name":"Scientific Reports","volume":"15 1","pages":"10995"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11958759/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning models for pancreatic cancer diagnosis based on microbiome markers from serum extracellular vesicles.\",\"authors\":\"Doeun Lee, Chanhee Lee, Kyulhee Han, Taewan Goo, Boram Kim, Youngmin Han, Wooil Kwon, Seungyeoun Lee, Jin-Young Jang, Taesung Park\",\"doi\":\"10.1038/s41598-025-94183-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Pancreatic cancer (PC) is a fatal disease with an extremely low 5-year survival rate, mainly because of its poor detection rate in early stages. Given emerging evidence of the relationship between microbiota composition and diseases, this study aims to identify microbiome markers linked to the diagnosis of pancreatic cancer. We utilized extracellular vesicles (EVs) data obtained from blood samples of 38 pancreatic cancer patients and 51 health controls. Least absolute shrinkage and selection operator (LASSO) and stepwise method were used to obtain some candidate markers in genus and phylum levels. These markers were used to develop various machine learning models including logistic regression (LR), random forest (RF), support vector machine (SVM), and Deep Neural Network (DNN) methods. In phylum level, DNN performed best with three markers (Verrucomicrobia, Actinobacteria and Proteobacteria) selected by stepwise method with the test AUC 0.959. In genus level, DNN using 11 markers selected by LASSO (Ruminococcaceae UCG-013, Ruminiclostridium, Propionibacterium, Lachnospiraceae NK4A136 group, Corynebacterium.1, Akkermansia, Mucispirillum, Pseudomonas, Diaphorobacter, Clostridium sensu stricto 1 and Turicibacter) outperformed others with 0.961 test AUCs. These results highlight the potential of microbiome markers and prediction models in clinical studies of PC diagnosis.</p>\",\"PeriodicalId\":21811,\"journal\":{\"name\":\"Scientific Reports\",\"volume\":\"15 1\",\"pages\":\"10995\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11958759/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Reports\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41598-025-94183-y\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Reports","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41598-025-94183-y","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
胰腺癌(PC)是一种致命的疾病,其5年生存率极低,主要原因是其早期检出率低。鉴于微生物群组成与疾病之间关系的新证据,本研究旨在确定与胰腺癌诊断相关的微生物组标记物。我们利用了38名胰腺癌患者和51名健康对照者的血液样本中获得的细胞外囊泡(EVs)数据。采用最小绝对收缩和选择算子(LASSO)和逐步法获得了一些属和门水平的候选标记。这些标记被用于开发各种机器学习模型,包括逻辑回归(LR)、随机森林(RF)、支持向量机(SVM)和深度神经网络(DNN)方法。在门水平上,逐级筛选的3个标记(Verrucomicrobia、Actinobacteria和Proteobacteria)在DNN上表现最佳,检验AUC为0.959。在属水平上,使用LASSO选择的11个标记(Ruminococcaceae UCG-013、Ruminiclostridium、Propionibacterium、Lachnospiraceae NK4A136组、corynebacterum1、Akkermansia、Mucispirillum、Pseudomonas、Diaphorobacter、Clostridium sensu stricto 1和Turicibacter)的DNN以0.961的检验auc优于其他标记。这些结果突出了微生物组标记物和预测模型在PC诊断临床研究中的潜力。
Machine learning models for pancreatic cancer diagnosis based on microbiome markers from serum extracellular vesicles.
Pancreatic cancer (PC) is a fatal disease with an extremely low 5-year survival rate, mainly because of its poor detection rate in early stages. Given emerging evidence of the relationship between microbiota composition and diseases, this study aims to identify microbiome markers linked to the diagnosis of pancreatic cancer. We utilized extracellular vesicles (EVs) data obtained from blood samples of 38 pancreatic cancer patients and 51 health controls. Least absolute shrinkage and selection operator (LASSO) and stepwise method were used to obtain some candidate markers in genus and phylum levels. These markers were used to develop various machine learning models including logistic regression (LR), random forest (RF), support vector machine (SVM), and Deep Neural Network (DNN) methods. In phylum level, DNN performed best with three markers (Verrucomicrobia, Actinobacteria and Proteobacteria) selected by stepwise method with the test AUC 0.959. In genus level, DNN using 11 markers selected by LASSO (Ruminococcaceae UCG-013, Ruminiclostridium, Propionibacterium, Lachnospiraceae NK4A136 group, Corynebacterium.1, Akkermansia, Mucispirillum, Pseudomonas, Diaphorobacter, Clostridium sensu stricto 1 and Turicibacter) outperformed others with 0.961 test AUCs. These results highlight the potential of microbiome markers and prediction models in clinical studies of PC diagnosis.
期刊介绍:
We publish original research from all areas of the natural sciences, psychology, medicine and engineering. You can learn more about what we publish by browsing our specific scientific subject areas below or explore Scientific Reports by browsing all articles and collections.
Scientific Reports has a 2-year impact factor: 4.380 (2021), and is the 6th most-cited journal in the world, with more than 540,000 citations in 2020 (Clarivate Analytics, 2021).
•Engineering
Engineering covers all aspects of engineering, technology, and applied science. It plays a crucial role in the development of technologies to address some of the world''s biggest challenges, helping to save lives and improve the way we live.
•Physical sciences
Physical sciences are those academic disciplines that aim to uncover the underlying laws of nature — often written in the language of mathematics. It is a collective term for areas of study including astronomy, chemistry, materials science and physics.
•Earth and environmental sciences
Earth and environmental sciences cover all aspects of Earth and planetary science and broadly encompass solid Earth processes, surface and atmospheric dynamics, Earth system history, climate and climate change, marine and freshwater systems, and ecology. It also considers the interactions between humans and these systems.
•Biological sciences
Biological sciences encompass all the divisions of natural sciences examining various aspects of vital processes. The concept includes anatomy, physiology, cell biology, biochemistry and biophysics, and covers all organisms from microorganisms, animals to plants.
•Health sciences
The health sciences study health, disease and healthcare. This field of study aims to develop knowledge, interventions and technology for use in healthcare to improve the treatment of patients.