Shaopan Wang, Xin He, Zhongquan Jian, Jie Li, Changsheng Xu, Yuguang Chen, Yuwen Liu, Han Chen, Caihong Huang, Jiaoyue Hu, Zuguo Liu
{"title":"Advances and prospects of multi-modal ophthalmic artificial intelligence based on deep learning: a review.","authors":"Shaopan Wang, Xin He, Zhongquan Jian, Jie Li, Changsheng Xu, Yuguang Chen, Yuwen Liu, Han Chen, Caihong Huang, Jiaoyue Hu, Zuguo Liu","doi":"10.1186/s40662-024-00405-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>In recent years, ophthalmology has emerged as a new frontier in medical artificial intelligence (AI) with multi-modal AI in ophthalmology garnering significant attention across interdisciplinary research. This integration of various types and data models holds paramount importance as it enables the provision of detailed and precise information for diagnosing eye and vision diseases. By leveraging multi-modal ophthalmology AI techniques, clinicians can enhance the accuracy and efficiency of diagnoses, and thus reduce the risks associated with misdiagnosis and oversight while also enabling more precise management of eye and vision health. However, the widespread adoption of multi-modal ophthalmology poses significant challenges.</p><p><strong>Main text: </strong>In this review, we first summarize comprehensively the concept of modalities in the field of ophthalmology, the forms of fusion between modalities, and the progress of multi-modal ophthalmic AI technology. Finally, we discuss the challenges of current multi-modal AI technology applications in ophthalmology and future feasible research directions.</p><p><strong>Conclusion: </strong>In the field of ophthalmic AI, evidence suggests that when utilizing multi-modal data, deep learning-based multi-modal AI technology exhibits excellent diagnostic efficacy in assisting the diagnosis of various ophthalmic diseases. Particularly, in the current era marked by the proliferation of large-scale models, multi-modal techniques represent the most promising and advantageous solution for addressing the diagnosis of various ophthalmic diseases from a comprehensive perspective. However, it must be acknowledged that there are still numerous challenges associated with the application of multi-modal techniques in ophthalmic AI before they can be effectively employed in the clinical setting.</p>","PeriodicalId":12194,"journal":{"name":"Eye and Vision","volume":"11 1","pages":"38"},"PeriodicalIF":4.1000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11443922/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eye and Vision","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40662-024-00405-1","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: In recent years, ophthalmology has emerged as a new frontier in medical artificial intelligence (AI) with multi-modal AI in ophthalmology garnering significant attention across interdisciplinary research. This integration of various types and data models holds paramount importance as it enables the provision of detailed and precise information for diagnosing eye and vision diseases. By leveraging multi-modal ophthalmology AI techniques, clinicians can enhance the accuracy and efficiency of diagnoses, and thus reduce the risks associated with misdiagnosis and oversight while also enabling more precise management of eye and vision health. However, the widespread adoption of multi-modal ophthalmology poses significant challenges.
Main text: In this review, we first summarize comprehensively the concept of modalities in the field of ophthalmology, the forms of fusion between modalities, and the progress of multi-modal ophthalmic AI technology. Finally, we discuss the challenges of current multi-modal AI technology applications in ophthalmology and future feasible research directions.
Conclusion: In the field of ophthalmic AI, evidence suggests that when utilizing multi-modal data, deep learning-based multi-modal AI technology exhibits excellent diagnostic efficacy in assisting the diagnosis of various ophthalmic diseases. Particularly, in the current era marked by the proliferation of large-scale models, multi-modal techniques represent the most promising and advantageous solution for addressing the diagnosis of various ophthalmic diseases from a comprehensive perspective. However, it must be acknowledged that there are still numerous challenges associated with the application of multi-modal techniques in ophthalmic AI before they can be effectively employed in the clinical setting.
期刊介绍:
Eye and Vision is an open access, peer-reviewed journal for ophthalmologists and visual science specialists. It welcomes research articles, reviews, methodologies, commentaries, case reports, perspectives and short reports encompassing all aspects of eye and vision. Topics of interest include but are not limited to: current developments of theoretical, experimental and clinical investigations in ophthalmology, optometry and vision science which focus on novel and high-impact findings on central issues pertaining to biology, pathophysiology and etiology of eye diseases as well as advances in diagnostic techniques, surgical treatment, instrument updates, the latest drug findings, results of clinical trials and research findings. It aims to provide ophthalmologists and visual science specialists with the latest developments in theoretical, experimental and clinical investigations in eye and vision.