New algorithm to predict colorectal cancer based on fecal volatile organic compounds profile

IF 6.3 2区 医学 Q1 BIOLOGY
Laura Ripoll , Hector Gisbert , Iván Rubio , David Guill-Berbegal , Antonio Canals , Rodrigo Jover , Lorena Vidal
{"title":"New algorithm to predict colorectal cancer based on fecal volatile organic compounds profile","authors":"Laura Ripoll ,&nbsp;Hector Gisbert ,&nbsp;Iván Rubio ,&nbsp;David Guill-Berbegal ,&nbsp;Antonio Canals ,&nbsp;Rodrigo Jover ,&nbsp;Lorena Vidal","doi":"10.1016/j.compbiomed.2025.111093","DOIUrl":null,"url":null,"abstract":"<div><div>In this study, an algorithm designed for the analysis of fecal samples for colorectal cancer diagnostics, utilizing the data from the advanced technique of thermal-desorption-gas chromatography-mass spectrometry (TD-GC-MS), is constructed. The algorithm performs a comprehensive analysis across the entire spectral range to identify compound patterns for differentiating among three distinct health states: colorectal cancer, colorectal adenomas and controls with normal colonoscopy. The algorithm underwent a rigorous optimization process, resulting in a sensitivity and specificity of 100 %, effectively eliminating both false positives and false negatives. During the validation phase, the algorithm demonstrated remarkable performance, with sensitivity ranging from 74 % to 68 %, specificity ranging from 58 % to 52 %, and accuracy 66 %–62 % (range across twenty randomized train-test splits). Notably, in the context of polyp samples, the algorithm obtained a sensitivity range from 54 % to 50 %, even when trained with data from only healthy individuals (i.e., controls) and cancer patients. Moreover, a detailed table of compounds and their probabilities of occurrence in cancer, adenomas, and healthy samples is provided, offering insight into the interpretability of the algorithm. This qualitative approach signals a significant advancement in diagnostic precision and promises to enhance early detection of colorectal cancer, marking a substantial contribution to the field.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"197 ","pages":"Article 111093"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525014453","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

In this study, an algorithm designed for the analysis of fecal samples for colorectal cancer diagnostics, utilizing the data from the advanced technique of thermal-desorption-gas chromatography-mass spectrometry (TD-GC-MS), is constructed. The algorithm performs a comprehensive analysis across the entire spectral range to identify compound patterns for differentiating among three distinct health states: colorectal cancer, colorectal adenomas and controls with normal colonoscopy. The algorithm underwent a rigorous optimization process, resulting in a sensitivity and specificity of 100 %, effectively eliminating both false positives and false negatives. During the validation phase, the algorithm demonstrated remarkable performance, with sensitivity ranging from 74 % to 68 %, specificity ranging from 58 % to 52 %, and accuracy 66 %–62 % (range across twenty randomized train-test splits). Notably, in the context of polyp samples, the algorithm obtained a sensitivity range from 54 % to 50 %, even when trained with data from only healthy individuals (i.e., controls) and cancer patients. Moreover, a detailed table of compounds and their probabilities of occurrence in cancer, adenomas, and healthy samples is provided, offering insight into the interpretability of the algorithm. This qualitative approach signals a significant advancement in diagnostic precision and promises to enhance early detection of colorectal cancer, marking a substantial contribution to the field.
基于粪便挥发性有机化合物谱预测结直肠癌的新算法
本研究利用先进的热解吸-气相色谱-质谱(TD-GC-MS)技术,构建了一种用于大肠癌诊断的粪便样本分析算法。该算法在整个光谱范围内进行综合分析,以确定区分三种不同健康状态的复合模式:结直肠癌、结直肠腺瘤和正常结肠镜检查的对照组。该算法经过了严格的优化过程,灵敏度和特异性达到100%,有效地消除了假阳性和假阴性。在验证阶段,该算法表现出显着的性能,灵敏度范围为74%至68%,特异性范围为58%至52%,准确率为66%至62%(范围跨越20个随机训练测试分割)。值得注意的是,在息肉样本的背景下,即使仅使用健康个体(即对照组)和癌症患者的数据进行训练,该算法也获得了54%至50%的灵敏度范围。此外,还提供了化合物的详细表及其在癌症、腺瘤和健康样本中发生的概率,从而深入了解了该算法的可解释性。这种定性方法标志着诊断精度的重大进步,并有望加强结直肠癌的早期检测,标志着对该领域的重大贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信