{"title":"Multimodal learning in gastrointestinal diseases","authors":"Luwen Zhang , Yubing Shen , Wentao Gu , Peng Wu","doi":"10.1016/j.gande.2025.10.001","DOIUrl":null,"url":null,"abstract":"<div><div>Gastrointestinal diseases represent a significant global health challenge, with current diagnostic and management approaches constrained by the integration of heterogeneous multimodal data. This review synthesizes the development prospects of multimodal applications in gastroenterology. By summarizing recent literatures, we demonstrate how multimodal integration can facilitate gastrointestinal disease screening, enable more accurate staging, support treatment decision-making, and optimize clinical workflows. Feature-level fusion serves as the dominant technique in current implementations, while hybrid approaches combining multiple fusion levels are increasingly adopted to enhance flexibility in complex clinical scenarios. Despite these advances, retrospective performance does not guarantee clinical success. Persistent challenges, including data heterogeneity, modality incompleteness, and barriers to clinical translation, remain to be addressed. Overall, this review underscores the transformative potential of multimodal learning to advance precision gastroenterology through integrated diagnostic and therapeutic.</div></div>","PeriodicalId":100571,"journal":{"name":"Gastroenterology & Endoscopy","volume":"3 4","pages":"Pages 251-258"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gastroenterology & Endoscopy","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949752325000809","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Gastrointestinal diseases represent a significant global health challenge, with current diagnostic and management approaches constrained by the integration of heterogeneous multimodal data. This review synthesizes the development prospects of multimodal applications in gastroenterology. By summarizing recent literatures, we demonstrate how multimodal integration can facilitate gastrointestinal disease screening, enable more accurate staging, support treatment decision-making, and optimize clinical workflows. Feature-level fusion serves as the dominant technique in current implementations, while hybrid approaches combining multiple fusion levels are increasingly adopted to enhance flexibility in complex clinical scenarios. Despite these advances, retrospective performance does not guarantee clinical success. Persistent challenges, including data heterogeneity, modality incompleteness, and barriers to clinical translation, remain to be addressed. Overall, this review underscores the transformative potential of multimodal learning to advance precision gastroenterology through integrated diagnostic and therapeutic.