{"title":"TDGU-Net: A Hybrid CNN-Transformer Model for Intracranial Aneurysm Segmentation","authors":"Xiaoqing Lin, Chen Wang, Zhengkui Chen, Jianwei Pan, Jijun Tong","doi":"10.1002/ima.70157","DOIUrl":"https://doi.org/10.1002/ima.70157","url":null,"abstract":"<div>\u0000 \u0000 <p>Intracranial aneurysms are life-threatening cerebrovascular conditions, and their accurate identification is crucial for early diagnosis and treatment planning. Automated segmentation technology plays a key role in enhancing diagnostic accuracy and enabling timely intervention. However, the segmentation task is challenging due to the diverse morphologies of aneurysms, indistinct boundaries, and their resemblance to adjacent vascular structures. This study introduces TDGU-Net, a deep learning-based method that combines Convolutional Neural Networks (CNNs) with Transformer architecture to improve segmentation accuracy and efficiency. The model uses CNNs for efficient local feature extraction, while Transformer blocks are employed to establish global relationships within local regions, enhancing the model's ability to capture contextual dependencies. Furthermore, a multi-scale feature fusion module is incorporated to capture critical information across different resolutions, and the Attention Gate mechanism is used to improve the model's ability to accurately identify aneurysm regions. The proposed model was evaluated on the Large IA Segmentation dataset and further validated on the MICCAI 2020 ADAM dataset to demonstrate its adaptability to different datasets. It achieved a Dice coefficient of 76.92% and a sensitivity of 79.65%, demonstrating robust segmentation performance and accurate detection of aneurysms. The proposed method provides a promising tool for the automated diagnosis of intracranial aneurysms, with significant potential for clinical application and improving patient outcomes.</p>\u0000 </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144582381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vivekanand Thakare, Shailendra S. Aote, Abhijeet Raipurkar
{"title":"A Novel Framework for Lung Disease Classification Using Multiscale Convolutional Neural Networks With an Integrated Dynamic Attention Mechanism","authors":"Vivekanand Thakare, Shailendra S. Aote, Abhijeet Raipurkar","doi":"10.1002/ima.70155","DOIUrl":"https://doi.org/10.1002/ima.70155","url":null,"abstract":"<div>\u0000 \u0000 <p>Lung disease diagnosis remains a significant clinical challenge due to the similarity in radiological features across various conditions such as COPD, pneumonia, tuberculosis, COVID-19, and lung cancer. Manual interpretation of chest CT scans is time-consuming and subject to inter-observer variability, particularly in resource-limited settings. To address these challenges, this study proposes a novel deep learning framework Multiscale Convolutional Neural Networks with Attention Mechanism (MCNN-AM) for automated classification of lung diseases into six categories, including normal lungs. The model leverages multiscale convolutional layers to extract both localized and global features, enabling better discrimination between diseases with overlapping characteristics. A dynamic attention mechanism, comprising both spatial and channel attention modules, is integrated to emphasize disease-relevant regions and suppress background noise, enhancing the model's diagnostic focus. Additionally, depthwise separable convolutions are utilized to reduce computational complexity while preserving feature richness. The MCNN-AM model is trained and evaluated on publicly available datasets, comprising 6000 training images and 1200 testing images equally distributed across all classes. The model achieves a classification accuracy of 96.84%, outperforming state-of-the-art models such as ResNet50, DenseNet121, and InceptionV3 in terms of precision, recall, F1-score, sensitivity, and specificity. Ablation studies further validate the critical role of the attention modules in achieving high performance. These results demonstrate the potential of MCNN-AM as a reliable, scalable tool for computer-aided diagnosis of lung diseases.</p>\u0000 </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144573892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparative Assessment of CNN and Transformer U-Nets in Multiple Sclerosis Lesion Segmentation","authors":"Beytullah Sarica, Yunus Serhat Bicakci, Dursun Zafer Seker","doi":"10.1002/ima.70146","DOIUrl":"https://doi.org/10.1002/ima.70146","url":null,"abstract":"<div>\u0000 \u0000 <p>Multiple sclerosis (MS) is a chronic autoimmune disease that causes lesions in the central nervous system. Accurate segmentation and quantification of these lesions are essential to monitor disease progression and evaluate treatments. Several architectures are used for such studies, the most popular being U-Net-based models. Therefore, this study compares CNN-based and Transformer-based U-Net architectures for MS lesion segmentation. Six U-Net architectures based on CNN and transformer, namely U-Net, R2U-Net, V-Net, Attention U-Net, TransUNet, and SwinUNet, were trained and evaluated on two MS datasets, ISBI2015 and MSSEG2016. T1-w, T2-w, and FLAIR sequences were jointly used to obtain more detailed features. A hybrid loss function, which involves the addition of focal Tversky and Dice losses, was exploited to improve the performance of models. This study was carried out in three steps. First, each model was trained separately and evaluated in each dataset. Second, each model was trained on the ISBI2015 dataset and evaluated on the MSSEG2016 dataset and vice versa. Finally, these two datasets were combined to increase the training samples and assessed on the ISBI2015 dataset. Accordingly, the R2U-Net and the V-Net models (CNN-based) achieved the best ISBI scores among the other models. The R2U-Net model achieved the best ISBI scores in the first and last steps with average scores of 92.82 and 92.91, while the V-Net model achieved the best ISBI score in the second step with an average score of 91.28. Our results show that CNN-based models surpass the Transformer-based U-Net models in most metrics for MS lesion segmentation.</p>\u0000 </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144573525","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Advancing Computer-Assisted Diabetic Retinopathy Grading: A Super Learner Ensemble Technique for Fundus Imagery","authors":"Mili Rosline Mathews, S. M. Anzar","doi":"10.1002/ima.70152","DOIUrl":"https://doi.org/10.1002/ima.70152","url":null,"abstract":"<div>\u0000 \u0000 <p>Diabetic retinopathy (DR) is a severe complication of diabetes mellitus and is a predominant global cause of blindness. The accuracy of DR grading is of paramount importance to enable timely and appropriate clinical interventions. This study presents an innovative and comprehensive approach to DR grading that combines convolutional neural networks with an ensemble of diverse machine learning algorithms, referred to as a super learner ensemble. Our methodology includes a preprocessing pipeline designed to enhance the quality of the fundus images in the dataset. To further refine DR grading, we introduce a novel feature extraction model named “RetinaXtract” in conjunction with advanced machine learning classifiers. Statistical analysis tools, specifically the Friedman and Nemenyi tests, are employed to identify the most effective machine learning algorithms. Subsequently, a super learner ensemble is devised by integrating the predictions of the highest-performing machine learning algorithms. This ensemble approach captures a wide range of patterns, thereby enhancing the system's ability to accurately distinguish between different DR stages. Notably, accuracy rates of 99.64%, 99.51%, and 99.16% are achieved on the IDRiD, Kaggle, and Messidor datasets, respectively. This research represents a significant contribution to the field of DR grading, offering a balanced, efficient, and precise classification solution. The introduced methodology has demonstrated substantial promise and holds significant potential for practical applications in the detection and grading of DR from fundus images, ultimately leading to improved clinical outcomes in ophthalmology.</p>\u0000 </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144573704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A New Adaptive Sliding Window Method for fMRI Dynamic Functional Connectivity Analysis","authors":"Ningfei Jiang, Yuhu Shi","doi":"10.1002/ima.70154","DOIUrl":"https://doi.org/10.1002/ima.70154","url":null,"abstract":"<div>\u0000 \u0000 <p>The fixed-window sliding time window method is widely used in exploring dynamics functional connectivity of functional magnetic resonance imaging data analysis, but it is difficult to select a suitable window to capture the dynamic changes in brain function. Therefore, a local polynomial regression (LPR) method is proposed to fit the region of interest (ROI) time series in this paper, in which observations are locally modeled by a least-squares polynomial with a kernel of a certain bandwidth that allows for better bias-variance tradeoff. It combines a data-driven variable bandwidth selection mechanism with intersection of confidence intervals (ICI) and a bandwidth optimization algorithm of particle swarm optimization (PSO). Among them, ICI is used to adaptively determine the locally optimal bandwidth that minimizes the mean square error (MSE), and then the bandwidth values at various time points within all ROIs are computed for each subject. Subsequently, the averaged bandwidth values at these time points is regarded as the bandwidth value for that subject at each time point, followed by generating a time-varying bandwidth sequence for each subject, which is used in the PSO-based bandwidth optimization algorithm. Finally, the results of experiments conducted on simulated data showed that the LPR–ICI–PSO method exhibited lower MSE values on time-varying correlation coefficient estimation for different noise scenarios. Furthermore, we applied the proposed method to the autism spectrum disorder (ASD) study, and obtained a classification accuracy of 74.1% from typical controls (TC) through support vector machine (SVM) with the 10-fold cross-validation strategy. These results demonstrated that our proposed method can effectively capture the dynamic changes in brain function, which is valid in clinical diagnosis and helps to reveal the differences in brain functional connectivity patterns.</p>\u0000 </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144558207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Enhancing Brain Tumor Classification Through Optimal Kernel Selection With \u0000 \u0000 \u0000 \u0000 GL\u0000 1\u0000 \u0000 \u0000 $$ {GL}_1 $$\u0000 -Regularization","authors":"Otmane Mallouk, Nour-Eddine Joudar, Mohamed Ettaouil","doi":"10.1002/ima.70150","DOIUrl":"https://doi.org/10.1002/ima.70150","url":null,"abstract":"<div>\u0000 \u0000 <p>Brain tumors, known for their rapid and aggressive growth, are among the most serious and life-threatening diseases worldwide. This makes the development of automated detection methods essential for saving lives. Deep transfer learning has become a highly effective approach for automating brain tumor classification and medical imaging, offering promising solutions on a global scale. However, leveraging a pretrained model typically involves special adaptations. Existing adaptation methods involve freezing or fine-tuning specific layers without considering the contribution level of individual kernels. This work aims to extend the concept of layer-level contributions to the kernel level by employing an adaptive optimization model. Indeed, this paper presents a novel optimization model that incorporates group lasso regularization to control which kernels are frozen and which are fine-tuned. The proposed model selects optimal source features that contribute to the target task. Additionally, the proposed optimization model is solved utilizing proximal gradient descent. The method was evaluated on a three-class brain tumor classification task, distinguishing between glioma, meningioma, and pituitary tumors, using a medical MRI dataset. Several experiments confirm the efficacy of our model in identifying both frozen and fine-tuned kernels, thereby improving data classification. Subsequently, the results obtained are compared with those of state-of-the-art transfer learning methods for comprehensive comparison.</p>\u0000 </div>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 4","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144550970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}