Christopher Nielsen;Matthias Wilms;Nils D. Forkert
{"title":"A Novel Foundation Model-Based Framework for Multimodal Retinal Age Prediction","authors":"Christopher Nielsen;Matthias Wilms;Nils D. Forkert","doi":"10.1109/JTEHM.2025.3576596","DOIUrl":null,"url":null,"abstract":"The retinal age gap (RAG; the difference between the retina’s biological and chronological age) has recently gained increased attention as a potential image-based, non-invasive, and accessible biomarker for a broad spectrum of ocular and non-ocular diseases. Traditionally, machine learning predictions of biological retinal age utilize convolutional neural network (CNN) architectures and data from color fundus photography (CFP). Despite being previously unexplored, the multimodal fusion of two-dimensional CFP with three-dimensional optical coherence tomography (OCT) data has significant potential to enhance retinal age prediction accuracy and the diagnostic utility of the RAG biomarker. Therefore, this work presents a novel foundation model-based framework for multimodal retinal age prediction. Technology or Method: Feature representations from CFP and OCT images were extracted using RETFound, a powerful foundation model for retinal image analysis. These representations were then combined using an innovative fusion strategy to train a lightweight linear regression head model for predicting retinal age. Training and evaluation of the developed multimodal retinal age prediction model was achieved using retinal images from over 80,000 participants in the UK Biobank. Results: The developed multimodal model sets a new benchmark in retinal age prediction (mean absolute error of 2.75 years), outperforming traditional CNN and single-modality approaches. Additionally, multimodal RAG values demonstrated superior performance in classifying patients with diabetes mellitus type 1, multiple sclerosis, and chronic kidney disease, highlighting the clinical relevance of the proposed multimodal approach for non-ocular disease detection. Conclusions: This work demonstrates that multimodal fusion of CFP and OCT significantly improves retinal age prediction and subsequent RAG-based analyses. By leveraging foundation models and multimodal retinal imaging, the proposed approach enhances disease classification accuracy and demonstrates the potential of integrating the RAG into clinical workflows as a scalable, non-invasive screening tool. Significance: The findings underscore the potential of multimodal retinal imaging to transform RAG into a clinically relevant and highly accessible biomarker for disease detection.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"13 ","pages":"299-309"},"PeriodicalIF":3.7000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11023594","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11023594/","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The retinal age gap (RAG; the difference between the retina’s biological and chronological age) has recently gained increased attention as a potential image-based, non-invasive, and accessible biomarker for a broad spectrum of ocular and non-ocular diseases. Traditionally, machine learning predictions of biological retinal age utilize convolutional neural network (CNN) architectures and data from color fundus photography (CFP). Despite being previously unexplored, the multimodal fusion of two-dimensional CFP with three-dimensional optical coherence tomography (OCT) data has significant potential to enhance retinal age prediction accuracy and the diagnostic utility of the RAG biomarker. Therefore, this work presents a novel foundation model-based framework for multimodal retinal age prediction. Technology or Method: Feature representations from CFP and OCT images were extracted using RETFound, a powerful foundation model for retinal image analysis. These representations were then combined using an innovative fusion strategy to train a lightweight linear regression head model for predicting retinal age. Training and evaluation of the developed multimodal retinal age prediction model was achieved using retinal images from over 80,000 participants in the UK Biobank. Results: The developed multimodal model sets a new benchmark in retinal age prediction (mean absolute error of 2.75 years), outperforming traditional CNN and single-modality approaches. Additionally, multimodal RAG values demonstrated superior performance in classifying patients with diabetes mellitus type 1, multiple sclerosis, and chronic kidney disease, highlighting the clinical relevance of the proposed multimodal approach for non-ocular disease detection. Conclusions: This work demonstrates that multimodal fusion of CFP and OCT significantly improves retinal age prediction and subsequent RAG-based analyses. By leveraging foundation models and multimodal retinal imaging, the proposed approach enhances disease classification accuracy and demonstrates the potential of integrating the RAG into clinical workflows as a scalable, non-invasive screening tool. Significance: The findings underscore the potential of multimodal retinal imaging to transform RAG into a clinically relevant and highly accessible biomarker for disease detection.
期刊介绍:
The IEEE Journal of Translational Engineering in Health and Medicine is an open access product that bridges the engineering and clinical worlds, focusing on detailed descriptions of advanced technical solutions to a clinical need along with clinical results and healthcare relevance. The journal provides a platform for state-of-the-art technology directions in the interdisciplinary field of biomedical engineering, embracing engineering, life sciences and medicine. A unique aspect of the journal is its ability to foster a collaboration between physicians and engineers for presenting broad and compelling real world technological and engineering solutions that can be implemented in the interest of improving quality of patient care and treatment outcomes, thereby reducing costs and improving efficiency. The journal provides an active forum for clinical research and relevant state-of the-art technology for members of all the IEEE societies that have an interest in biomedical engineering as well as reaching out directly to physicians and the medical community through the American Medical Association (AMA) and other clinical societies. The scope of the journal includes, but is not limited, to topics on: Medical devices, healthcare delivery systems, global healthcare initiatives, and ICT based services; Technological relevance to healthcare cost reduction; Technology affecting healthcare management, decision-making, and policy; Advanced technical work that is applied to solving specific clinical needs.