Enhanced Vision Transformer with Custom Attention Mechanism for Automated Idiopathic Scoliosis Classification.

Nevzat Yeşilmen, Çağla Danacı, Merve Parlak Baydoğan, Seda Arslan Tuncer, Ahmet Çınar, Taner Tuncer
{"title":"Enhanced Vision Transformer with Custom Attention Mechanism for Automated Idiopathic Scoliosis Classification.","authors":"Nevzat Yeşilmen, Çağla Danacı, Merve Parlak Baydoğan, Seda Arslan Tuncer, Ahmet Çınar, Taner Tuncer","doi":"10.1007/s10278-025-01564-w","DOIUrl":null,"url":null,"abstract":"<p><p>Scoliosis is a three-dimensional spinal deformity that is the most common among spinal deformities and causes extremely serious posture disorders in advanced stages. Scoliosis can lead to various health problems, including pain, respiratory dysfunction, heart problems, mental health disorders, stress, and emotional difficulties. The current gold standard for grading scoliosis and planning treatment is based on the Cobb angle measurement on X-rays. The Cobb angle measurement is performed by physical medicine and rehabilitation specialists, orthopedists, radiologists, etc., in branches dealing with the musculoskeletal system. Manual calculation of the Cobb angle for this process is subjective and takes more time. Deep learning-based systems that can evaluate the Cobb angle objectively have been frequently used recently. In this article, we propose an enhanced ViT that allows doctors to evaluate the diagnosis of scoliosis more objectively without wasting time. The proposed model uses a custom attention mechanism instead of the standard multi-head attention mechanism for the ViT model. A dataset with 7 different classes was obtained from 1456 patients in total from Elazığ Fethi Sekin City Hospital Physical Medicine and Rehabilitation Clinic. Multiple models were used to compare the proposed architecture in the classification of scoliosis disease. The proposed improved ViT architecture exhibited the best performance with 95.21% accuracy. This result shows that a superior classification success was achieved compared to ResNet50, Swin Transformer, and standard ViT models.</p>","PeriodicalId":516858,"journal":{"name":"Journal of imaging informatics in medicine","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of imaging informatics in medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10278-025-01564-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Scoliosis is a three-dimensional spinal deformity that is the most common among spinal deformities and causes extremely serious posture disorders in advanced stages. Scoliosis can lead to various health problems, including pain, respiratory dysfunction, heart problems, mental health disorders, stress, and emotional difficulties. The current gold standard for grading scoliosis and planning treatment is based on the Cobb angle measurement on X-rays. The Cobb angle measurement is performed by physical medicine and rehabilitation specialists, orthopedists, radiologists, etc., in branches dealing with the musculoskeletal system. Manual calculation of the Cobb angle for this process is subjective and takes more time. Deep learning-based systems that can evaluate the Cobb angle objectively have been frequently used recently. In this article, we propose an enhanced ViT that allows doctors to evaluate the diagnosis of scoliosis more objectively without wasting time. The proposed model uses a custom attention mechanism instead of the standard multi-head attention mechanism for the ViT model. A dataset with 7 different classes was obtained from 1456 patients in total from Elazığ Fethi Sekin City Hospital Physical Medicine and Rehabilitation Clinic. Multiple models were used to compare the proposed architecture in the classification of scoliosis disease. The proposed improved ViT architecture exhibited the best performance with 95.21% accuracy. This result shows that a superior classification success was achieved compared to ResNet50, Swin Transformer, and standard ViT models.

带有自定义注意机制的增强视觉变压器用于特发性脊柱侧凸自动分类。
脊柱侧凸是一种三维脊柱畸形,是脊柱畸形中最常见的,在晚期会导致极其严重的姿势障碍。脊柱侧凸会导致各种健康问题,包括疼痛、呼吸功能障碍、心脏问题、精神健康障碍、压力和情绪困难。目前脊柱侧凸分级和治疗计划的金标准是基于x射线上的Cobb角测量。科布角测量是由物理医学和康复专家、骨科医生、放射科医生等在处理肌肉骨骼系统的分支机构进行的。手动计算这个过程的Cobb角是主观的,需要更多的时间。基于深度学习的系统可以客观地评估Cobb角,最近已经被频繁使用。在本文中,我们提出了一种增强的ViT,使医生能够更客观地评估脊柱侧凸的诊断,而不浪费时间。该模型使用自定义的注意机制代替了ViT模型的标准多头注意机制。从Elazığ Fethi Sekin市医院物理医学和康复诊所的1456名患者中获得了7个不同类别的数据集。多个模型被用来比较脊柱侧凸疾病分类中提出的结构。改进后的ViT结构具有最佳性能,准确率为95.21%。该结果表明,与ResNet50、Swin Transformer和标准ViT模型相比,该方法的分类成功率更高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信