Machine Learning-based Prediction of the Likelihood of Colorectal Cancer Using miRNA Expression

Q3 Multidisciplinary
Aamer Sultan, Aaron Austin de Asa, Tesah Mae Guimbangunan, Ezekiel Dmitri Serapio, Allan Fellizar, P. M. Albano, Rock Christian Tomas
{"title":"Machine Learning-based Prediction of the Likelihood of Colorectal Cancer Using miRNA Expression","authors":"Aamer Sultan, Aaron Austin de Asa, Tesah Mae Guimbangunan, Ezekiel Dmitri Serapio, Allan Fellizar, P. M. Albano, Rock Christian Tomas","doi":"10.56899/152.04.12","DOIUrl":null,"url":null,"abstract":"[Background] Colorectal cancer (CRC) comprises 10% of all cancer diagnoses, making it the third most diagnosed cancer globally. Despite its prevalence, most current methods for identifying CRC lack sensitivity and consistency while being invasive and costly. Thus, this study aimed to develop artificial neural network (ANN) models that could accurately detect CRC using miRNA expressions in tissue and plasma samples. [Methods] The study used miRNA expression profiles of formalin-fixed paraffin-embedded tissue and plasma samples obtained from CRC patients and healthy controls. ANNs were trained to discriminate between CRC patients from healthy controls using the relative expression of miR-21-5p, miR-196b-5p, miR- 135b-5p, miR-92a-3p, miR-29a-3p, and miR-197-3p in colorectal tissues and blood plasma. Multivariate logistic regression (MLR) and decision tree (DT) models were used to compare the performance of the ANN models. [Results] The ANNs achieved an accuracy of 98.5 and 88.2%, a sensitivity of 90.9 and 80.4%, a specificity of 92.6 and 84.7%, and an area under the ROC curve of 0.92 and 0.83 for the plasma and tissue samples, respectively. Moreover, sensitivity analysis of the ANN models showed that miR-135b-5p and miR-92a-3p had the greatest influence in distinguishing CRC from healthy plasma and malignant from neoplasm-free colorectal tissues, respectively. However, only miR-135b-5p was significantly downregulated in both CRC plasma and malignant colorectal tissue samples. Results from the MLR and DT models support the results from the ANN sensitivity analysis. [Conclusion] Our results show that the trained ANNs were able to accurately and confidently detect CRC using the considered six miRNA expression levels in colorectal tissue and plasma samples, providing an accurate, rapid, and less-invasive approach to diagnosing CRC.","PeriodicalId":39096,"journal":{"name":"Philippine Journal of Science","volume":"205 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philippine Journal of Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56899/152.04.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0

Abstract

[Background] Colorectal cancer (CRC) comprises 10% of all cancer diagnoses, making it the third most diagnosed cancer globally. Despite its prevalence, most current methods for identifying CRC lack sensitivity and consistency while being invasive and costly. Thus, this study aimed to develop artificial neural network (ANN) models that could accurately detect CRC using miRNA expressions in tissue and plasma samples. [Methods] The study used miRNA expression profiles of formalin-fixed paraffin-embedded tissue and plasma samples obtained from CRC patients and healthy controls. ANNs were trained to discriminate between CRC patients from healthy controls using the relative expression of miR-21-5p, miR-196b-5p, miR- 135b-5p, miR-92a-3p, miR-29a-3p, and miR-197-3p in colorectal tissues and blood plasma. Multivariate logistic regression (MLR) and decision tree (DT) models were used to compare the performance of the ANN models. [Results] The ANNs achieved an accuracy of 98.5 and 88.2%, a sensitivity of 90.9 and 80.4%, a specificity of 92.6 and 84.7%, and an area under the ROC curve of 0.92 and 0.83 for the plasma and tissue samples, respectively. Moreover, sensitivity analysis of the ANN models showed that miR-135b-5p and miR-92a-3p had the greatest influence in distinguishing CRC from healthy plasma and malignant from neoplasm-free colorectal tissues, respectively. However, only miR-135b-5p was significantly downregulated in both CRC plasma and malignant colorectal tissue samples. Results from the MLR and DT models support the results from the ANN sensitivity analysis. [Conclusion] Our results show that the trained ANNs were able to accurately and confidently detect CRC using the considered six miRNA expression levels in colorectal tissue and plasma samples, providing an accurate, rapid, and less-invasive approach to diagnosing CRC.
基于机器学习的miRNA表达预测结直肠癌的可能性
【背景】结直肠癌(CRC)占所有癌症诊断的10%,是全球第三大确诊癌症。尽管它很普遍,但目前大多数识别结直肠癌的方法缺乏敏感性和一致性,而且是侵入性的和昂贵的。因此,本研究旨在建立人工神经网络(ANN)模型,利用组织和血浆样本中的miRNA表达准确检测结直肠癌。[方法]研究采用福尔马林固定石蜡包埋组织和CRC患者及健康对照血浆样本的miRNA表达谱。通过在结直肠组织和血浆中miR-21-5p、miR-196b-5p、miR- 135b-5p、miR-92a-3p、miR-29a-3p和miR-197-3p的相对表达,ann被训练来区分结直肠癌患者和健康对照组。使用多元逻辑回归(MLR)和决策树(DT)模型来比较人工神经网络模型的性能。[结果]人工神经网络对血浆和组织样本的准确率分别为98.5和88.2%,灵敏度分别为90.9和80.4%,特异性分别为92.6和84.7%,ROC曲线下面积分别为0.92和0.83。此外,ANN模型的敏感性分析显示,miR-135b-5p和miR-92a-3p分别在区分CRC与健康血浆和恶性肿瘤与无肿瘤结直肠组织方面具有最大的影响。然而,在结直肠癌血浆和恶性结直肠组织样本中,只有miR-135b-5p显著下调。MLR和DT模型的结果支持了人工神经网络灵敏度分析的结果。【结论】我们的研究结果表明,经过训练的人工神经网络能够准确、自信地利用结直肠组织和血浆样本中所考虑的六种miRNA表达水平检测结直肠癌,为结直肠癌的诊断提供了一种准确、快速、微创的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Philippine Journal of Science
Philippine Journal of Science Multidisciplinary-Multidisciplinary
CiteScore
1.20
自引率
0.00%
发文量
55
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信