Akshaya Srikanth Bhagavathula, Ahmed Mourtada Al Qady, Wafa A Aldhaleei
{"title":"Diagnostic accuracy and quality of artificial intelligence models in irritable bowel syndrome: A systematic review.","authors":"Akshaya Srikanth Bhagavathula, Ahmed Mourtada Al Qady, Wafa A Aldhaleei","doi":"10.3748/wjg.v31.i23.106836","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Irritable bowel syndrome (IBS) affects approximately 9%-12% of the global population, presenting substantial diagnostic challenges due to symptom subjectivity and lack of definitive biomarkers.</p><p><strong>Aim: </strong>To systematically examine the diagnostic accuracy of artificial intelligence (AI) models applied to various biomarkers in IBS diagnosis.</p><p><strong>Methods: </strong>A comprehensive search of six databases identified 18053 articles published up to May 31, 2024. Following screening and eligibility criteria, six observational studies involving 1366 participants from the United Kingdom, China, and Japan were included. Risk of bias and reporting quality were assessed using quality assessment of diagnostic accuracy studies-2, prediction model risk of bias assessment tool-AI, and transparent reporting of a multivariable prediction model for individual prognosis or diagnosis-AI tools. Key metrics included sensitivity, specificity, accuracy, and area under the curve (AUC).</p><p><strong>Results: </strong>The included studies applied AI models such as random forests, support vector machines, and neural networks to biomarkers like fecal microbiome composition, gas chromatography data, neuroimaging features, and protease activity. Diagnostic accuracy ranged from 54% to 98% (AUC: 0.61-0.99). Models using fecal microbiome data achieved the highest performance, with one study reporting 98% sensitivity and specificity (AUC = 0.99). While most studies demonstrated high methodological quality, significant variability in datasets, biomarkers, and validation methods limited meta-analysis feasibility and generalizability.</p><p><strong>Conclusion: </strong>AI models show potential to improve IBS diagnostic accuracy by integrating complex biomarkers which will aid the development of algorithms to direct treatment strategies. However, methodological inconsistencies and limited population diversity underscore the need for standardized protocols and external validation to ensure clinical applicability.</p>","PeriodicalId":23778,"journal":{"name":"World Journal of Gastroenterology","volume":"31 23","pages":"106836"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12188767/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3748/wjg.v31.i23.106836","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Irritable bowel syndrome (IBS) affects approximately 9%-12% of the global population, presenting substantial diagnostic challenges due to symptom subjectivity and lack of definitive biomarkers.
Aim: To systematically examine the diagnostic accuracy of artificial intelligence (AI) models applied to various biomarkers in IBS diagnosis.
Methods: A comprehensive search of six databases identified 18053 articles published up to May 31, 2024. Following screening and eligibility criteria, six observational studies involving 1366 participants from the United Kingdom, China, and Japan were included. Risk of bias and reporting quality were assessed using quality assessment of diagnostic accuracy studies-2, prediction model risk of bias assessment tool-AI, and transparent reporting of a multivariable prediction model for individual prognosis or diagnosis-AI tools. Key metrics included sensitivity, specificity, accuracy, and area under the curve (AUC).
Results: The included studies applied AI models such as random forests, support vector machines, and neural networks to biomarkers like fecal microbiome composition, gas chromatography data, neuroimaging features, and protease activity. Diagnostic accuracy ranged from 54% to 98% (AUC: 0.61-0.99). Models using fecal microbiome data achieved the highest performance, with one study reporting 98% sensitivity and specificity (AUC = 0.99). While most studies demonstrated high methodological quality, significant variability in datasets, biomarkers, and validation methods limited meta-analysis feasibility and generalizability.
Conclusion: AI models show potential to improve IBS diagnostic accuracy by integrating complex biomarkers which will aid the development of algorithms to direct treatment strategies. However, methodological inconsistencies and limited population diversity underscore the need for standardized protocols and external validation to ensure clinical applicability.
期刊介绍:
The primary aims of the WJG are to improve diagnostic, therapeutic and preventive modalities and the skills of clinicians and to guide clinical practice in gastroenterology and hepatology.