LSMD:基于长短记忆的检测网络,用于 B 型超声视频流中的颈动脉检测。

IF 3 2区 工程技术 Q1 ACOUSTICS
Chunjie Shan;Yidan Zhang;Chunrui Liu;Zhibin Jin;Hanlin Cheng;Yidi Chen;Jing Yao;Shouhua Luo
{"title":"LSMD:基于长短记忆的检测网络,用于 B 型超声视频流中的颈动脉检测。","authors":"Chunjie Shan;Yidan Zhang;Chunrui Liu;Zhibin Jin;Hanlin Cheng;Yidi Chen;Jing Yao;Shouhua Luo","doi":"10.1109/TUFFC.2024.3494019","DOIUrl":null,"url":null,"abstract":"Carotid atherosclerotic plaques are a major complication associated with type II diabetes, and carotid ultrasound is commonly used for diagnosing carotid vascular disease. In primary hospitals, less experienced ultrasound physicians often struggle to consistently capture standard carotid images and identify plaques. To address this issue, we propose a novel approach, the long-short memory-based detection (LSMD) network, for carotid artery detection in ultrasound video streams, facilitating the identification and localization of critical anatomical structures and plaques. This approach models short- and long-distance spatiotemporal features through short-term temporal aggregation (STA) and long-term temporal aggregation (LTA) modules, effectively expanding the temporal receptive field with minimal delay and enhancing the detection efficiency of carotid anatomy and plaques. Specifically, we introduce memory buffers with a dynamic updating strategy to ensure extensive temporal receptive field coverage while minimizing memory and computation costs. The proposed model was trained on 80 carotid ultrasound videos and evaluated on 50, with all videos annotated by physicians for carotid anatomies and plaques. The trained LSMD was evaluated for performance on the validation and test sets using the single-frame image-based single shot multibox detector (SSD) algorithm as a baseline. The results show that the precision, recall, average precision (AP) at \n<inline-formula> <tex-math>$\\text {IoU}={0.50}$ </tex-math></inline-formula>\n (\n<inline-formula> <tex-math>$\\text {AP}_{{50}}$ </tex-math></inline-formula>\n), and mean AP (mAP) are 6.83%, 12.29%, 11.23%, and 13.21% higher than the baseline (\n<inline-formula> <tex-math>${p}\\lt {0.001}$ </tex-math></inline-formula>\n), respectively, while the model’s inference latency reaches 6.97 ms on a desktop-level GPU (NVIDIA RTX 3090Ti) and 29.69 ms on an edge computing device (Jetson Orin Nano). These findings demonstrate that LSMD can accurately localize carotid anatomy and plaques with real-time inference, indicating its potential for enhancing diagnostic accuracy in clinical practice.","PeriodicalId":13322,"journal":{"name":"IEEE transactions on ultrasonics, ferroelectrics, and frequency control","volume":"71 11","pages":"1464-1477"},"PeriodicalIF":3.0000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LSMD: Long-Short Memory-Based Detection Network for Carotid Artery Detection in B-Mode Ultrasound Video Streams\",\"authors\":\"Chunjie Shan;Yidan Zhang;Chunrui Liu;Zhibin Jin;Hanlin Cheng;Yidi Chen;Jing Yao;Shouhua Luo\",\"doi\":\"10.1109/TUFFC.2024.3494019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Carotid atherosclerotic plaques are a major complication associated with type II diabetes, and carotid ultrasound is commonly used for diagnosing carotid vascular disease. In primary hospitals, less experienced ultrasound physicians often struggle to consistently capture standard carotid images and identify plaques. To address this issue, we propose a novel approach, the long-short memory-based detection (LSMD) network, for carotid artery detection in ultrasound video streams, facilitating the identification and localization of critical anatomical structures and plaques. This approach models short- and long-distance spatiotemporal features through short-term temporal aggregation (STA) and long-term temporal aggregation (LTA) modules, effectively expanding the temporal receptive field with minimal delay and enhancing the detection efficiency of carotid anatomy and plaques. Specifically, we introduce memory buffers with a dynamic updating strategy to ensure extensive temporal receptive field coverage while minimizing memory and computation costs. The proposed model was trained on 80 carotid ultrasound videos and evaluated on 50, with all videos annotated by physicians for carotid anatomies and plaques. The trained LSMD was evaluated for performance on the validation and test sets using the single-frame image-based single shot multibox detector (SSD) algorithm as a baseline. The results show that the precision, recall, average precision (AP) at \\n<inline-formula> <tex-math>$\\\\text {IoU}={0.50}$ </tex-math></inline-formula>\\n (\\n<inline-formula> <tex-math>$\\\\text {AP}_{{50}}$ </tex-math></inline-formula>\\n), and mean AP (mAP) are 6.83%, 12.29%, 11.23%, and 13.21% higher than the baseline (\\n<inline-formula> <tex-math>${p}\\\\lt {0.001}$ </tex-math></inline-formula>\\n), respectively, while the model’s inference latency reaches 6.97 ms on a desktop-level GPU (NVIDIA RTX 3090Ti) and 29.69 ms on an edge computing device (Jetson Orin Nano). These findings demonstrate that LSMD can accurately localize carotid anatomy and plaques with real-time inference, indicating its potential for enhancing diagnostic accuracy in clinical practice.\",\"PeriodicalId\":13322,\"journal\":{\"name\":\"IEEE transactions on ultrasonics, ferroelectrics, and frequency control\",\"volume\":\"71 11\",\"pages\":\"1464-1477\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on ultrasonics, ferroelectrics, and frequency control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10747830/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on ultrasonics, ferroelectrics, and frequency control","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10747830/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

摘要

颈动脉粥样硬化斑块是 II 型糖尿病的主要并发症,颈动脉超声通常用于诊断颈动脉血管疾病。在基层医院,经验较少的超声医生往往难以持续捕捉标准颈动脉图像并识别斑块。为解决这一问题,我们提出了一种新方法--基于长短记忆的检测网络(LSMD),用于超声视频流中的颈动脉检测,促进关键解剖结构和斑块的识别和定位。这种方法通过短期时空聚合(STA)和长期时空聚合(LTA)模块对短距离和长距离时空特征进行建模,以最小的延迟有效扩展时空感受野,提高颈动脉解剖结构和斑块的检测效率。具体来说,我们引入了具有动态更新策略的内存缓冲区,以确保广泛的时间感受野覆盖,同时最大限度地降低内存和计算成本。我们在 80 个颈动脉超声视频上对所提出的模型进行了训练,并在 50 个视频上进行了评估,所有视频都由医生对颈动脉解剖结构和斑块进行了注释。以基于单帧图像的单枪多箱检测器(SSD)算法为基准,对训练好的 LSMD 在验证集和测试集上的性能进行了评估。结果显示,精确度、召回率、IoU = 0.50时的平均精确度(AP50)和平均平均精确度(mAP)分别比基线高出6.83%、12.29%、11.23%和13.21%(p < 0.001),而模型的推理延迟在桌面级GPU(英伟达RTX 3090Ti)上为6.97ms,在边缘计算设备(Jetson Orin Nano)上为29.69ms。这些研究结果表明,LSMD 可以通过实时推理准确定位颈动脉解剖结构和斑块,显示了其在临床实践中提高诊断准确性的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
LSMD: Long-Short Memory-Based Detection Network for Carotid Artery Detection in B-Mode Ultrasound Video Streams
Carotid atherosclerotic plaques are a major complication associated with type II diabetes, and carotid ultrasound is commonly used for diagnosing carotid vascular disease. In primary hospitals, less experienced ultrasound physicians often struggle to consistently capture standard carotid images and identify plaques. To address this issue, we propose a novel approach, the long-short memory-based detection (LSMD) network, for carotid artery detection in ultrasound video streams, facilitating the identification and localization of critical anatomical structures and plaques. This approach models short- and long-distance spatiotemporal features through short-term temporal aggregation (STA) and long-term temporal aggregation (LTA) modules, effectively expanding the temporal receptive field with minimal delay and enhancing the detection efficiency of carotid anatomy and plaques. Specifically, we introduce memory buffers with a dynamic updating strategy to ensure extensive temporal receptive field coverage while minimizing memory and computation costs. The proposed model was trained on 80 carotid ultrasound videos and evaluated on 50, with all videos annotated by physicians for carotid anatomies and plaques. The trained LSMD was evaluated for performance on the validation and test sets using the single-frame image-based single shot multibox detector (SSD) algorithm as a baseline. The results show that the precision, recall, average precision (AP) at $\text {IoU}={0.50}$ ( $\text {AP}_{{50}}$ ), and mean AP (mAP) are 6.83%, 12.29%, 11.23%, and 13.21% higher than the baseline ( ${p}\lt {0.001}$ ), respectively, while the model’s inference latency reaches 6.97 ms on a desktop-level GPU (NVIDIA RTX 3090Ti) and 29.69 ms on an edge computing device (Jetson Orin Nano). These findings demonstrate that LSMD can accurately localize carotid anatomy and plaques with real-time inference, indicating its potential for enhancing diagnostic accuracy in clinical practice.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.70
自引率
16.70%
发文量
583
审稿时长
4.5 months
期刊介绍: IEEE Transactions on Ultrasonics, Ferroelectrics and Frequency Control includes the theory, technology, materials, and applications relating to: (1) the generation, transmission, and detection of ultrasonic waves and related phenomena; (2) medical ultrasound, including hyperthermia, bioeffects, tissue characterization and imaging; (3) ferroelectric, piezoelectric, and piezomagnetic materials, including crystals, polycrystalline solids, films, polymers, and composites; (4) frequency control, timing and time distribution, including crystal oscillators and other means of classical frequency control, and atomic, molecular and laser frequency control standards. Areas of interest range from fundamental studies to the design and/or applications of devices and systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信