A systematic evaluation of big data-driven colorectal cancer studies.

Q2 Medicine
Eslam Bani Mohammad, Muayyad Ahmad
{"title":"A systematic evaluation of big data-driven colorectal cancer studies.","authors":"Eslam Bani Mohammad, Muayyad Ahmad","doi":"10.17392/1684-23","DOIUrl":null,"url":null,"abstract":"<p><p>Aim To assess machine-learning models, their methodological quality, compare their performance, and highlight their limitations. Methods The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) recommendations were applied. Electronic databases Science Direct, MEDLINE through (PubMed, Google Scholar), EBSCO, ERIC, and CINAHL were searched for the period of January 2016 to September 2023. Using a pre-designed data extraction sheet, the review data were extracted. Big data, risk assessment, colorectal cancer, and artificial intelligence were the main terms. Results Fifteen studies were included. A total of 3,057,329 colorectal cancer (CRC) health records, including those of adult patients older than 18, were used to generate the results. The curve's area under the curve ranged from 0.704 to 0.976. Logistic regression, random forests, and colon flag were often employed techniques. Overall, these trials provide a considerable and accurate CRC risk prediction. Conclusion An up-to-date summary of recent research on the use of big data in CRC prediction was given. Future research can be facilitated by the review's identification of gaps in the literature. Missing data, a lack of external validation, and the diversity of machine learning algorithms are the current obstacles. Despite having a sound mathematical definition, area under the curve application depends on the modelling context.</p>","PeriodicalId":51129,"journal":{"name":"Medicinski Glasnik","volume":"21 1","pages":"63-77"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medicinski Glasnik","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17392/1684-23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Aim To assess machine-learning models, their methodological quality, compare their performance, and highlight their limitations. Methods The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) recommendations were applied. Electronic databases Science Direct, MEDLINE through (PubMed, Google Scholar), EBSCO, ERIC, and CINAHL were searched for the period of January 2016 to September 2023. Using a pre-designed data extraction sheet, the review data were extracted. Big data, risk assessment, colorectal cancer, and artificial intelligence were the main terms. Results Fifteen studies were included. A total of 3,057,329 colorectal cancer (CRC) health records, including those of adult patients older than 18, were used to generate the results. The curve's area under the curve ranged from 0.704 to 0.976. Logistic regression, random forests, and colon flag were often employed techniques. Overall, these trials provide a considerable and accurate CRC risk prediction. Conclusion An up-to-date summary of recent research on the use of big data in CRC prediction was given. Future research can be facilitated by the review's identification of gaps in the literature. Missing data, a lack of external validation, and the diversity of machine learning algorithms are the current obstacles. Despite having a sound mathematical definition, area under the curve application depends on the modelling context.

对大数据驱动的结直肠癌研究进行系统评估。
目的 评估机器学习模型及其方法学质量,比较其性能并强调其局限性。方法 采用系统综述和元分析首选报告项目(PRISMA)建议。在 2016 年 1 月至 2023 年 9 月期间检索了 Science Direct、MEDLINE through(PubMed、Google Scholar)、EBSCO、ERIC 和 CINAHL 等电子数据库。使用预先设计的数据提取表提取了综述数据。大数据、风险评估、结直肠癌和人工智能是主要术语。结果 共纳入 15 项研究。结果共使用了 3,057,329 份结直肠癌(CRC)健康记录,其中包括 18 岁以上成年患者的健康记录。曲线下面积介于 0.704 到 0.976 之间。逻辑回归、随机森林和结肠标志是经常采用的技术。总体而言,这些试验提供了相当准确的 CRC 风险预测。结论 本文对近期有关在 CRC 预测中使用大数据的研究进行了最新总结。综述指出了文献中存在的不足,这有助于今后的研究。数据缺失、缺乏外部验证以及机器学习算法的多样性是目前的障碍。尽管有一个合理的数学定义,但曲线下面积的应用取决于建模环境。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medicinski Glasnik
Medicinski Glasnik 医学-医学:内科
CiteScore
2.30
自引率
0.00%
发文量
0
审稿时长
6-12 weeks
期刊介绍: Medicinski Glasnik (MG) is the official publication (two times per year) of the Medical Association of Zenica-Doboj Canton. Manuscripts that present of original basic and applied research from all fields of medicine (general and clinical practice, and basic medical sciences) are invited.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信