DCCAT: Dual-Coordinate Cross-Attention Transformer for thrombus segmentation on coronary OCT

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2024-07-05 DOI:10.1016/j.media.2024.103265

Miao Chu , Giovanni Luigi De Maria , Ruobing Dai , Stefano Benenati , Wei Yu , Jiaxin Zhong , Rafail Kotronias , Jason Walsh , Stefano Andreaggi , Vittorio Zuccarelli , Jason Chai , Oxford Acute Myocardial Infarction (OxAMI) Study investigators , Keith Channon , Adrian Banning , Shengxian Tu

{"title":"DCCAT: Dual-Coordinate Cross-Attention Transformer for thrombus segmentation on coronary OCT","authors":"Miao Chu , Giovanni Luigi De Maria , Ruobing Dai , Stefano Benenati , Wei Yu , Jiaxin Zhong , Rafail Kotronias , Jason Walsh , Stefano Andreaggi , Vittorio Zuccarelli , Jason Chai , Oxford Acute Myocardial Infarction (OxAMI) Study investigators , Keith Channon , Adrian Banning , Shengxian Tu","doi":"10.1016/j.media.2024.103265","DOIUrl":null,"url":null,"abstract":"<div><p>Acute coronary syndromes (ACS) are one of the leading causes of mortality worldwide, with atherosclerotic plaque rupture and subsequent thrombus formation as the main underlying substrate. Thrombus burden evaluation is important for tailoring treatment therapy and predicting prognosis. Coronary optical coherence tomography (OCT) enables in-vivo visualization of thrombus that cannot otherwise be achieved by other image modalities. However, automatic quantification of thrombus on OCT has not been implemented. The main challenges are due to the variation in location, size and irregularities of thrombus in addition to the small data set. In this paper, we propose a novel dual-coordinate cross-attention transformer network, termed DCCAT, to overcome the above challenges and achieve the first automatic segmentation of thrombus on OCT. Imaging features from both Cartesian and polar coordinates are encoded and fused based on long-range correspondence via multi-head cross-attention mechanism. The dual-coordinate cross-attention block is hierarchically stacked amid convolutional layers at multiple levels, allowing comprehensive feature enhancement. The model was developed based on 5,649 OCT frames from 339 patients and tested using independent external OCT data from 548 frames of 52 patients. DCCAT achieved Dice similarity score (DSC) of 0.706 in segmenting thrombus, which is significantly higher than the CNN-based (0.656) and Transformer-based (0.584) models. We prove that the additional input of polar image not only leverages discriminative features from another coordinate but also improves model robustness for geometrical transformation.Experiment results show that DCCAT achieves competitive performance with only 10% of the total data, highlighting its data efficiency. The proposed dual-coordinate cross-attention design can be easily integrated into other developed Transformer models to boost performance.</p></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"97 ","pages":"Article 103265"},"PeriodicalIF":11.8000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841524001907","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Acute coronary syndromes (ACS) are one of the leading causes of mortality worldwide, with atherosclerotic plaque rupture and subsequent thrombus formation as the main underlying substrate. Thrombus burden evaluation is important for tailoring treatment therapy and predicting prognosis. Coronary optical coherence tomography (OCT) enables in-vivo visualization of thrombus that cannot otherwise be achieved by other image modalities. However, automatic quantification of thrombus on OCT has not been implemented. The main challenges are due to the variation in location, size and irregularities of thrombus in addition to the small data set. In this paper, we propose a novel dual-coordinate cross-attention transformer network, termed DCCAT, to overcome the above challenges and achieve the first automatic segmentation of thrombus on OCT. Imaging features from both Cartesian and polar coordinates are encoded and fused based on long-range correspondence via multi-head cross-attention mechanism. The dual-coordinate cross-attention block is hierarchically stacked amid convolutional layers at multiple levels, allowing comprehensive feature enhancement. The model was developed based on 5,649 OCT frames from 339 patients and tested using independent external OCT data from 548 frames of 52 patients. DCCAT achieved Dice similarity score (DSC) of 0.706 in segmenting thrombus, which is significantly higher than the CNN-based (0.656) and Transformer-based (0.584) models. We prove that the additional input of polar image not only leverages discriminative features from another coordinate but also improves model robustness for geometrical transformation.Experiment results show that DCCAT achieves competitive performance with only 10% of the total data, highlighting its data efficiency. The proposed dual-coordinate cross-attention design can be easily integrated into other developed Transformer models to boost performance.

Abstract Image

查看原文本刊更多论文

DCCAT：用于冠状动脉 OCT 上血栓分割的双坐标交叉注意力变换器

急性冠状动脉综合征（ACS）是导致全球死亡的主要原因之一，其主要基础是动脉粥样硬化斑块破裂和随后的血栓形成。血栓负荷评估对于调整治疗方法和预测预后非常重要。冠状动脉光学相干断层扫描（OCT）可在体内观察血栓，这是其他图像模式无法实现的。然而，OCT 对血栓的自动量化尚未实现。除了数据集较小之外，血栓的位置、大小和不规则性的变化也是主要挑战。在本文中，我们提出了一种新颖的双坐标交叉注意变换器网络（称为 DCCAT）来克服上述挑战，并首次实现了 OCT 上血栓的自动分割。来自直角坐标和极坐标的成像特征通过多头交叉注意机制，基于长距离对应关系进行编码和融合。双坐标交叉注意块在多层卷积层中分层堆叠，可实现全面的特征增强。该模型是基于 339 名患者的 5,649 个 OCT 帧开发的，并使用 52 名患者的 548 个帧的独立外部 OCT 数据进行了测试。在分割血栓时，DCCAT 的 Dice 相似度得分 (DSC) 达到了 0.706，明显高于基于 CNN 的模型（0.656）和基于 Transformer 的模型（0.584）。实验结果表明，DCCAT 只用了总数据量的 10%，就取得了极具竞争力的性能，凸显了其数据效率。实验结果表明，DCCAT 只需总数据量的 10%，就能实现具有竞争力的性能，突出了其数据效率。所提出的双坐标交叉关注设计可以轻松集成到其他已开发的变换器模型中，以提高性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.