{"title":"Automatic Recognition of Standard Liver Sections Based on Vision-Transformer","authors":"Jiansong Zhang, Yongjian Chen, Peizhong Liu","doi":"10.1109/ASID56930.2022.9995936","DOIUrl":null,"url":null,"abstract":"Acquisition of ultrasound standard views is a prerequisite for performing ultrasound diagnosis. With the aim of solving the clinical imaging challenge that adult liver standard sections have long been constrained by physicians' subjective experience, this paper collects 12 common liver ultrasound standard sections from the Second Hospital of Fujian Medical University and investigates and discusses the adaptability of the Vision-Transformer(ViT)-based deep learning automatic recognition method in liver ultrasound standard sections. Using a regional pixel set segmentation operation on liver ultrasound images, we found that the ViT model achieved recognition accuracy of 92.9% in the available ultrasound dataset when the basic segmentation module was 16*16 and the depth was 12. We also compared other mainstream deep learning frameworks based on convolutional neural networks, and the ViT model outperformed all other methods, guided by the features of the visual attention mechanism. The work in this paper provides a rich research base for deep learning of liver ultrasound based on the visual attention mechanism, and to a certain extent standardises the medical examination of the liver in adults by ultrasound-based means.","PeriodicalId":183908,"journal":{"name":"2022 IEEE 16th International Conference on Anti-counterfeiting, Security, and Identification (ASID)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 16th International Conference on Anti-counterfeiting, Security, and Identification (ASID)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASID56930.2022.9995936","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Acquisition of ultrasound standard views is a prerequisite for performing ultrasound diagnosis. With the aim of solving the clinical imaging challenge that adult liver standard sections have long been constrained by physicians' subjective experience, this paper collects 12 common liver ultrasound standard sections from the Second Hospital of Fujian Medical University and investigates and discusses the adaptability of the Vision-Transformer(ViT)-based deep learning automatic recognition method in liver ultrasound standard sections. Using a regional pixel set segmentation operation on liver ultrasound images, we found that the ViT model achieved recognition accuracy of 92.9% in the available ultrasound dataset when the basic segmentation module was 16*16 and the depth was 12. We also compared other mainstream deep learning frameworks based on convolutional neural networks, and the ViT model outperformed all other methods, guided by the features of the visual attention mechanism. The work in this paper provides a rich research base for deep learning of liver ultrasound based on the visual attention mechanism, and to a certain extent standardises the medical examination of the liver in adults by ultrasound-based means.