A comprehensive review on transformer network for natural and medical image analysis

IF 13.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Ramkumar Thirunavukarasu , Evans Kotei
{"title":"A comprehensive review on transformer network for natural and medical image analysis","authors":"Ramkumar Thirunavukarasu ,&nbsp;Evans Kotei","doi":"10.1016/j.cosrev.2024.100648","DOIUrl":null,"url":null,"abstract":"<div><p>The Transformer network is the main application area for natural language processing. It has gained traction lately and exhibits potential in the field of computer vision. This cutting-edge method has proven to offer a significant impact on image analysis, a crucial area of computer vision. The transformer's outstanding performance in vision computing places it as an alternative to the convolutional neural network for vision tasks. Transformers have taken center stage in the field of natural language processing. Despite the outstanding performance of transformer networks in natural image processing, their implementation in medical image analysis is gradually gaining roots. This study focuses on the transformer application in natural and medical image analysis. The first part of the study provides an overview of the core concepts of the attention mechanism built into transformers for long-range feature extraction. The study again highlights the various transformer architectures proposed for natural and medical image tasks such as segmentation, classification, image registration and diagnosis. Finally, the paper presents limitations identified in proposed transformer networks for natural and medical image processing. It also highlights prospective study opportunities for further research to better the computer vision domain, especially medical image analysis. This study offers knowledge to scholars and researchers studying computer vision applications as they focus on creating innovative transformer network-based solutions.</p></div>","PeriodicalId":48633,"journal":{"name":"Computer Science Review","volume":"53 ","pages":"Article 100648"},"PeriodicalIF":13.3000,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Review","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574013724000327","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The Transformer network is the main application area for natural language processing. It has gained traction lately and exhibits potential in the field of computer vision. This cutting-edge method has proven to offer a significant impact on image analysis, a crucial area of computer vision. The transformer's outstanding performance in vision computing places it as an alternative to the convolutional neural network for vision tasks. Transformers have taken center stage in the field of natural language processing. Despite the outstanding performance of transformer networks in natural image processing, their implementation in medical image analysis is gradually gaining roots. This study focuses on the transformer application in natural and medical image analysis. The first part of the study provides an overview of the core concepts of the attention mechanism built into transformers for long-range feature extraction. The study again highlights the various transformer architectures proposed for natural and medical image tasks such as segmentation, classification, image registration and diagnosis. Finally, the paper presents limitations identified in proposed transformer networks for natural and medical image processing. It also highlights prospective study opportunities for further research to better the computer vision domain, especially medical image analysis. This study offers knowledge to scholars and researchers studying computer vision applications as they focus on creating innovative transformer network-based solutions.

用于自然和医学图像分析的变压器网络综述
变形网络是自然语言处理的主要应用领域。最近,它在计算机视觉领域获得了广泛应用,并展现出巨大潜力。事实证明,这种前沿方法对图像分析这一计算机视觉的重要领域具有重大影响。变压器在视觉计算方面的出色表现使其成为视觉任务中卷积神经网络的替代品。变压器在自然语言处理领域占据了中心位置。尽管变换器网络在自然图像处理中表现出色,但其在医学图像分析中的应用正逐渐扎根。本研究重点关注变换器在自然和医学图像分析中的应用。研究的第一部分概述了转换器中用于远距离特征提取的注意力机制的核心概念。研究再次强调了针对自然和医学图像任务(如分割、分类、图像配准和诊断)提出的各种变换器架构。最后,论文介绍了针对自然和医学图像处理提出的变换器网络的局限性。它还强调了进一步研究的前瞻性机会,以改善计算机视觉领域,尤其是医学图像分析。这项研究为研究计算机视觉应用的学者和研究人员提供了知识,因为他们专注于创建基于变压器网络的创新解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computer Science Review
Computer Science Review Computer Science-General Computer Science
CiteScore
32.70
自引率
0.00%
发文量
26
审稿时长
51 days
期刊介绍: Computer Science Review, a publication dedicated to research surveys and expository overviews of open problems in computer science, targets a broad audience within the field seeking comprehensive insights into the latest developments. The journal welcomes articles from various fields as long as their content impacts the advancement of computer science. In particular, articles that review the application of well-known Computer Science methods to other areas are in scope only if these articles advance the fundamental understanding of those methods.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信