{"title":"LungNet-ViT: Efficient lung disease classification using a multistage vision transformer model from chest radiographs.","authors":"V Padmavathi, Kavitha Ganesan","doi":"10.1177/08953996251320262","DOIUrl":null,"url":null,"abstract":"<p><p>This research introduces a Multistage-Vision Transformer (Multistage-ViT) model for precisely classifying various lung diseases using chest radiographic (CXR) images. The dataset in the proposed method includes four classes: Normal, COVID-19, Viral Pneumonia and Lung Opacity. This model demonstrates its efficacy on imbalanced and balanced datasets by enhancing classifier accuracy through deep feature extraction. It integrates backbone models with the ViT architecture, creating rigorously hybrid configurations compared to their standalone counterparts. These hybrid models utilize optimized features for classification, significantly improving their performance. Notably, the multistage-ViT model achieved accuracies of 99.93% on an imbalanced dataset and 99.97% on a balanced dataset using the InceptionV3 combined with the ViT model. These findings highlight the superior accuracy and robustness of multistage-ViT models, underscoring their potential to enhance lung disease classification through advanced feature extraction and model integration techniques. The proposed model effectively demonstrates the benefits of employing ViT for deep feature extraction from CXR images.</p>","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"8953996251320262"},"PeriodicalIF":1.7000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of X-Ray Science and Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/08953996251320262","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0
Abstract
This research introduces a Multistage-Vision Transformer (Multistage-ViT) model for precisely classifying various lung diseases using chest radiographic (CXR) images. The dataset in the proposed method includes four classes: Normal, COVID-19, Viral Pneumonia and Lung Opacity. This model demonstrates its efficacy on imbalanced and balanced datasets by enhancing classifier accuracy through deep feature extraction. It integrates backbone models with the ViT architecture, creating rigorously hybrid configurations compared to their standalone counterparts. These hybrid models utilize optimized features for classification, significantly improving their performance. Notably, the multistage-ViT model achieved accuracies of 99.93% on an imbalanced dataset and 99.97% on a balanced dataset using the InceptionV3 combined with the ViT model. These findings highlight the superior accuracy and robustness of multistage-ViT models, underscoring their potential to enhance lung disease classification through advanced feature extraction and model integration techniques. The proposed model effectively demonstrates the benefits of employing ViT for deep feature extraction from CXR images.
期刊介绍:
Research areas within the scope of the journal include:
Interaction of x-rays with matter: x-ray phenomena, biological effects of radiation, radiation safety and optical constants
X-ray sources: x-rays from synchrotrons, x-ray lasers, plasmas, and other sources, conventional or unconventional
Optical elements: grazing incidence optics, multilayer mirrors, zone plates, gratings, other diffraction optics
Optical instruments: interferometers, spectrometers, microscopes, telescopes, microprobes