BrainDx: a dual-transformer framework using PVT and SegFormer for tumor diagnosis

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL
Arshleen Kaur , Vinay Kukreja , Modafar Ati , Ankit Bansal , Shanmugasundaram Hariharan
{"title":"BrainDx: a dual-transformer framework using PVT and SegFormer for tumor diagnosis","authors":"Arshleen Kaur ,&nbsp;Vinay Kukreja ,&nbsp;Modafar Ati ,&nbsp;Ankit Bansal ,&nbsp;Shanmugasundaram Hariharan","doi":"10.1016/j.bspc.2025.108917","DOIUrl":null,"url":null,"abstract":"<div><h3>Context</h3><div>Brain tumor diagnosis is challenging due to their complex morphology, indistinct boundaries, and subtle variations in Magnetic Resonance Imaging (MRI) scans. Manual diagnosis is time-consuming and error-prone, making the need for automated systems crucial. Recent advancements in deep learning, particularly in transformer models, have led to improved accuracy and speed in medical image analysis.</div></div><div><h3>Objective</h3><div>This research aims to develop an Artificial Intelligencee (AI) based framework that integrates the Pyramid Vision Transformer (PVT) for tumor classification and the SegFormer for tumor segmentation, thereby enhancing diagnostic accuracy, speed, and reducing human error in brain tumor detection.</div></div><div><h3>Methodology</h3><div>The proposed framework, BrainDX, utilizes PVT to classify MRI images into tumor types (Gliomas, Meningiomas, Pituitary Tumors, and Healthy Brain), and SegFormer to segment tumor regions in real-time. The dataset consists of annotated MRI images that undergo preprocessing (normalization, resizing, and augmentation). The models are trained and evaluated based on performance metrics, including accuracy, Dice score, Intersection over Union (IoU), and segmentation time.</div></div><div><h3>Results</h3><div>The framework was evaluated across three benchmark MRI datasets, achieving a classification accuracy of 94.0% and a Dice score of 0.87 for tumor segmentation. SegFormer demonstrated real-time segmentation, processing MRI images in under 50 ms. Both models maintained high efficiency while delivering robust performance, even in cases of irregular tumor boundaries.</div></div><div><h3>Future Scope</h3><div>Future work will focus on further optimizing the model for real-time clinical use, improving generalization across diverse tumor types and MRI modalities. This AI-powered system has the potential to enhance diagnostic processes and improve patient outcomes significantly.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"113 ","pages":"Article 108917"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425014284","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Context

Brain tumor diagnosis is challenging due to their complex morphology, indistinct boundaries, and subtle variations in Magnetic Resonance Imaging (MRI) scans. Manual diagnosis is time-consuming and error-prone, making the need for automated systems crucial. Recent advancements in deep learning, particularly in transformer models, have led to improved accuracy and speed in medical image analysis.

Objective

This research aims to develop an Artificial Intelligencee (AI) based framework that integrates the Pyramid Vision Transformer (PVT) for tumor classification and the SegFormer for tumor segmentation, thereby enhancing diagnostic accuracy, speed, and reducing human error in brain tumor detection.

Methodology

The proposed framework, BrainDX, utilizes PVT to classify MRI images into tumor types (Gliomas, Meningiomas, Pituitary Tumors, and Healthy Brain), and SegFormer to segment tumor regions in real-time. The dataset consists of annotated MRI images that undergo preprocessing (normalization, resizing, and augmentation). The models are trained and evaluated based on performance metrics, including accuracy, Dice score, Intersection over Union (IoU), and segmentation time.

Results

The framework was evaluated across three benchmark MRI datasets, achieving a classification accuracy of 94.0% and a Dice score of 0.87 for tumor segmentation. SegFormer demonstrated real-time segmentation, processing MRI images in under 50 ms. Both models maintained high efficiency while delivering robust performance, even in cases of irregular tumor boundaries.

Future Scope

Future work will focus on further optimizing the model for real-time clinical use, improving generalization across diverse tumor types and MRI modalities. This AI-powered system has the potential to enhance diagnostic processes and improve patient outcomes significantly.
BrainDx:使用PVT和SegFormer进行肿瘤诊断的双变压器框架
脑肿瘤的诊断是具有挑战性的,由于其复杂的形态,模糊的边界,和微妙的变化在磁共振成像(MRI)扫描。手动诊断既耗时又容易出错,因此对自动化系统的需求至关重要。深度学习的最新进展,特别是在变压器模型方面,提高了医学图像分析的准确性和速度。目的开发一种基于人工智能(AI)的框架,将用于肿瘤分类的金字塔视觉转换器(Pyramid Vision Transformer, PVT)和用于肿瘤分割的SegFormer集成在一起,从而提高脑肿瘤检测的准确性、速度,减少人为错误。提出的框架BrainDX利用PVT将MRI图像分类为肿瘤类型(胶质瘤、脑膜瘤、垂体瘤和健康脑),并利用SegFormer实时分割肿瘤区域。该数据集由经过预处理(规范化、调整大小和增强)的带注释的MRI图像组成。这些模型是基于性能指标进行训练和评估的,包括准确性、骰子分数、交集比联合(IoU)和分割时间。结果该框架在三个基准MRI数据集上进行了评估,在肿瘤分割方面实现了94.0%的分类准确率和0.87的Dice评分。SegFormer演示了实时分割,在50 ms内处理MRI图像。即使在不规则肿瘤边界的情况下,这两种模型都保持了高效率,同时提供了强大的性能。未来的工作将集中在进一步优化实时临床应用的模型,提高不同肿瘤类型和MRI模式的通用性。这种人工智能驱动的系统有可能加强诊断过程,并显著改善患者的治疗效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信