Multi-Task Medical Image-to-Images Translation using Transformer for Chest X-Ray Radiography

2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE) Pub Date : 2021-11-01 DOI:10.1109/icaice54393.2021.00139

Jingyu Xie

{"title":"Multi-Task Medical Image-to-Images Translation using Transformer for Chest X-Ray Radiography","authors":"Jingyu Xie","doi":"10.1109/icaice54393.2021.00139","DOIUrl":null,"url":null,"abstract":"Chest X-ray is one of the main methods for screening chest diseases, which has the characteristics of low radiation dose, fast imaging and low cost. In order to better assist doctors in disease diagnosis, usually the X-ray bone suppression and organ segmentation are performed. Many research progress has been made in this field, but the accuracy of the above two tasks is still limited due to the inherent characteristics of medical images. Firstly, the shape of organs of different individuals varies greatly.So there are inevitable segmentation errors if the overall shape is not perceived. Generally, the boundary of organs is fuzzy, so it is prone to misclassification near the boundary. In addition, existing bone suppression methods still can't completely remove bone shadows. In this paper, we propose a deep learning model whose overall architecture is designed based on the pix2pix network. This model generates both bone suppression images and organ segmentation images.Aiming at the above three issues, we make some improvements. We innovatively use the Transformer structure to enhance the attention to the global context and enhance the perception of the overall shape of the organ in the feature extraction process. Also, we design a new loss function, which gives larger weight to the position error near the organ boundary in the later stage of network training. This loss function pays more attention to edge information and helps determine the position of organ boundaries. We evaluate the effectiveness of the model on the X-ray image dataset, and compare it with the latest algorithm comprehensively. We also evaluate the effectiveness of our improvements through ablation study which shows that our improvements are effective.","PeriodicalId":388444,"journal":{"name":"2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)","volume":"200 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaice54393.2021.00139","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Chest X-ray is one of the main methods for screening chest diseases, which has the characteristics of low radiation dose, fast imaging and low cost. In order to better assist doctors in disease diagnosis, usually the X-ray bone suppression and organ segmentation are performed. Many research progress has been made in this field, but the accuracy of the above two tasks is still limited due to the inherent characteristics of medical images. Firstly, the shape of organs of different individuals varies greatly.So there are inevitable segmentation errors if the overall shape is not perceived. Generally, the boundary of organs is fuzzy, so it is prone to misclassification near the boundary. In addition, existing bone suppression methods still can't completely remove bone shadows. In this paper, we propose a deep learning model whose overall architecture is designed based on the pix2pix network. This model generates both bone suppression images and organ segmentation images.Aiming at the above three issues, we make some improvements. We innovatively use the Transformer structure to enhance the attention to the global context and enhance the perception of the overall shape of the organ in the feature extraction process. Also, we design a new loss function, which gives larger weight to the position error near the organ boundary in the later stage of network training. This loss function pays more attention to edge information and helps determine the position of organ boundaries. We evaluate the effectiveness of the model on the X-ray image dataset, and compare it with the latest algorithm comprehensively. We also evaluate the effectiveness of our improvements through ablation study which shows that our improvements are effective.

查看原文本刊更多论文

使用Transformer进行胸部x射线摄影的多任务医学图像到图像转换

胸部x线检查是胸部疾病筛查的主要方法之一，具有辐射剂量低、成像速度快、成本低等特点。为了更好地协助医生诊断疾病，通常进行x线骨抑制和器官分割。这方面的研究已经取得了很多进展，但由于医学图像的固有特性，上述两项任务的准确性仍然受到限制。首先，不同个体的器官形状差异很大。因此，如果不能感知到整体形状，则不可避免地会出现分割错误。通常，器官的边界是模糊的，因此在边界附近容易出现误分类。此外，现有的骨抑制方法仍不能完全去除骨影。本文提出了一种基于pix2pix网络的深度学习模型，该模型的总体架构是基于pix2pix网络设计的。该模型同时生成骨抑制图像和器官分割图像。针对以上三个问题，我们做了一些改进。我们创新地使用了变形结构，以增强对全局背景的关注，并在特征提取过程中增强对器官整体形状的感知。同时，我们设计了一种新的损失函数，在网络训练的后期对器官边界附近的位置误差给予更大的权重。这种损失函数更关注边缘信息，有助于确定器官边界的位置。我们评估了该模型在x射线图像数据集上的有效性，并将其与最新算法进行了综合比较。我们还通过消融研究评估了我们的改进的有效性，表明我们的改进是有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 2nd International Conference on Artificial Intelligence and Computer Engineering (ICAICE)

自引率

0.00%

发文量