Awais Ahmed , Xiaoyang Zeng , Rui Xi , Mengshu Hou , Syed Attique Shah
{"title":"Enhancing multimodal medical image analysis with Slice-Fusion: A novel fusion approach to address modality imbalance","authors":"Awais Ahmed , Xiaoyang Zeng , Rui Xi , Mengshu Hou , Syed Attique Shah","doi":"10.1016/j.cmpb.2025.108615","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and objective:</h3><div>In recent times, medical imaging analysis (MIA) has seen an increasing interest due to its core application in computer-aided diagnosis systems (CADs). A modality in MIA refers to a specific technology used to produce human body images, such as MRI, CT scans, or X-rays. Each modality presents unique challenges and characteristics, often leading to imbalances within datasets. This significant challenge impedes model training and generalization due to the varying convergence rates of different modalities and the suppression of gradients in less dominant modalities.</div></div><div><h3>Methods:</h3><div>This paper proposes a novel fusion approach, and we named it Slice-Fusion. The proposed approach aims to mitigate the modality imbalance problem by implementing a “Modality-Specific-Balancing-Factor” fusion strategy. Furthermore, it incorporates an auxiliary (uni-modal) task that generates balanced modality pairs based on the image orientations of different modalities. Subsequently, a novel multimodal classification framework is presented to learn from the generated balanced modalities. The effectiveness of the proposed approach is evaluated through comparative assessments on a publicly available BraTS2021 <span><span>dataset</span><svg><path></path></svg></span>. The results demonstrate the efficiency of Slice-Fusion in resolving the modality imbalance problem. By enhancing the representation of balanced features and reducing modality bias, this approach holds promise for advancing visual health informatics and facilitating more accurate and reliable medical image analysis.</div></div><div><h3>Results:</h3><div>In the experiment section, three diverse experiments are conducted such as i) Fusion Loss Metrics Evaluation, ii) Classification, and iii) Visual Health Informatics. Notably, the proposed approach achieved an F1-Score of (100%, 81.25%) on the training and validation sets for the classification generalization task. In addition to the Slice-Fusion’s out-performance, the study also created a new modality-aligned dataset (a highly balanced and informative modality-specific image collection) that aids further research and improves MIA’s robustness. These advancements not only enhance the capability of medical diagnostic tools but also create opportunities for future innovations in the field.</div></div><div><h3>Conclusion:</h3><div>This study contributes to advancing medical image analysis, such as effective modality fusion, image reconstruction, comparison, and glioma classification, facilitating more accurate and reliable results, and holds promise for further advancements in visual health informatics.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"261 ","pages":"Article 108615"},"PeriodicalIF":4.9000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S016926072500032X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and objective:
In recent times, medical imaging analysis (MIA) has seen an increasing interest due to its core application in computer-aided diagnosis systems (CADs). A modality in MIA refers to a specific technology used to produce human body images, such as MRI, CT scans, or X-rays. Each modality presents unique challenges and characteristics, often leading to imbalances within datasets. This significant challenge impedes model training and generalization due to the varying convergence rates of different modalities and the suppression of gradients in less dominant modalities.
Methods:
This paper proposes a novel fusion approach, and we named it Slice-Fusion. The proposed approach aims to mitigate the modality imbalance problem by implementing a “Modality-Specific-Balancing-Factor” fusion strategy. Furthermore, it incorporates an auxiliary (uni-modal) task that generates balanced modality pairs based on the image orientations of different modalities. Subsequently, a novel multimodal classification framework is presented to learn from the generated balanced modalities. The effectiveness of the proposed approach is evaluated through comparative assessments on a publicly available BraTS2021 dataset. The results demonstrate the efficiency of Slice-Fusion in resolving the modality imbalance problem. By enhancing the representation of balanced features and reducing modality bias, this approach holds promise for advancing visual health informatics and facilitating more accurate and reliable medical image analysis.
Results:
In the experiment section, three diverse experiments are conducted such as i) Fusion Loss Metrics Evaluation, ii) Classification, and iii) Visual Health Informatics. Notably, the proposed approach achieved an F1-Score of (100%, 81.25%) on the training and validation sets for the classification generalization task. In addition to the Slice-Fusion’s out-performance, the study also created a new modality-aligned dataset (a highly balanced and informative modality-specific image collection) that aids further research and improves MIA’s robustness. These advancements not only enhance the capability of medical diagnostic tools but also create opportunities for future innovations in the field.
Conclusion:
This study contributes to advancing medical image analysis, such as effective modality fusion, image reconstruction, comparison, and glioma classification, facilitating more accurate and reliable results, and holds promise for further advancements in visual health informatics.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.