Fusion channel interaction attention network for water jet detection in firefighting robots

IF 7.2 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Soft Computing Pub Date : 2025-05-26 DOI:10.1016/j.asoc.2025.113364

Juxian Zhao , Wei Li , Jinsong Zhu , Zhigang Gao , Lu Pan , Zhongguan Liu

{"title":"Fusion channel interaction attention network for water jet detection in firefighting robots","authors":"Juxian Zhao , Wei Li , Jinsong Zhu , Zhigang Gao , Lu Pan , Zhongguan Liu","doi":"10.1016/j.asoc.2025.113364","DOIUrl":null,"url":null,"abstract":"<div><div>Firefighting robots play a critical role in fire suppression. Ensuring the water stream precisely hits the target during autonomous fire extinguishing is of paramount importance. By visually detecting the landing point of the water jet, closed-loop control of the extinguishing process can be achieved. However, achieving accurate jet landing point localization in complex environments, such as changes in ambient lighting, jet end divergence, and jet breakup, presents a challenging task. To address this, we propose a novel CIA-YOLOX (Channel Interaction Attention–You Only Look Once) model for the precise identification of water jet landing points in firefighting robots using unmanned aerial vehicle (UAV) visual information. First, the model introduces the Triplet Attention (TA) mechanism to capture feature dependencies across different dimensions, enriching feature information. Second, a module named Coordinate Attention Transformer (CA-Trans) is designed to establish long-range dependencies between directional feature vectors, enabling the extraction of precise positional information critical for accurate impact point prediction. Additionally, a Dual-branch Channel Interactive Attention Fusion (DCIAF) module is proposed to enhance feature representation capabilities by facilitating feature complementation through semantic modeling of channel interactions. Experimental results indicate that the proposed model surpasses current state-of-the-art methods in performance while maintaining low computational costs, confirming its efficacy. This approach enhances the robot's ability to perceive complex environments, providing valuable insights for implementing firefighting actions in real-world scenarios.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113364"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625006751","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Firefighting robots play a critical role in fire suppression. Ensuring the water stream precisely hits the target during autonomous fire extinguishing is of paramount importance. By visually detecting the landing point of the water jet, closed-loop control of the extinguishing process can be achieved. However, achieving accurate jet landing point localization in complex environments, such as changes in ambient lighting, jet end divergence, and jet breakup, presents a challenging task. To address this, we propose a novel CIA-YOLOX (Channel Interaction Attention–You Only Look Once) model for the precise identification of water jet landing points in firefighting robots using unmanned aerial vehicle (UAV) visual information. First, the model introduces the Triplet Attention (TA) mechanism to capture feature dependencies across different dimensions, enriching feature information. Second, a module named Coordinate Attention Transformer (CA-Trans) is designed to establish long-range dependencies between directional feature vectors, enabling the extraction of precise positional information critical for accurate impact point prediction. Additionally, a Dual-branch Channel Interactive Attention Fusion (DCIAF) module is proposed to enhance feature representation capabilities by facilitating feature complementation through semantic modeling of channel interactions. Experimental results indicate that the proposed model surpasses current state-of-the-art methods in performance while maintaining low computational costs, confirming its efficacy. This approach enhances the robot's ability to perceive complex environments, providing valuable insights for implementing firefighting actions in real-world scenarios.

查看原文本刊更多论文

消防机器人水射流检测的融合通道交互关注网络

消防机器人在灭火中发挥着重要作用。在自动灭火过程中，确保水流精确地击中目标是至关重要的。通过目测水柱落点，实现灭火过程的闭环控制。然而，在复杂的环境中，如环境光照变化、射流末端发散和射流破裂等，实现精确的射流着陆点定位是一项具有挑战性的任务。为了解决这个问题，我们提出了一种新的CIA-YOLOX （Channel Interaction Attention-You Only Look Once）模型，用于利用无人机（UAV）视觉信息精确识别消防机器人的喷水着陆点。首先，该模型引入了三重关注（Triplet Attention， TA）机制来捕获不同维度的特征依赖关系，丰富了特征信息。其次，设计了一个名为坐标注意力转换器（CA-Trans）的模块，用于建立方向特征向量之间的远程依赖关系，从而能够提取精确的位置信息，这对于准确预测撞击点至关重要。此外，提出了双分支通道交互注意融合（DCIAF）模块，通过通道交互的语义建模促进特征互补，增强特征表示能力。实验结果表明，该模型在保持较低的计算成本的同时，在性能上超越了目前最先进的方法，证实了其有效性。这种方法增强了机器人感知复杂环境的能力，为在现实场景中实施消防行动提供了有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.