Juxian Zhao , Wei Li , Jinsong Zhu , Zhigang Gao , Lu Pan , Zhongguan Liu
{"title":"Fusion channel interaction attention network for water jet detection in firefighting robots","authors":"Juxian Zhao , Wei Li , Jinsong Zhu , Zhigang Gao , Lu Pan , Zhongguan Liu","doi":"10.1016/j.asoc.2025.113364","DOIUrl":null,"url":null,"abstract":"<div><div>Firefighting robots play a critical role in fire suppression. Ensuring the water stream precisely hits the target during autonomous fire extinguishing is of paramount importance. By visually detecting the landing point of the water jet, closed-loop control of the extinguishing process can be achieved. However, achieving accurate jet landing point localization in complex environments, such as changes in ambient lighting, jet end divergence, and jet breakup, presents a challenging task. To address this, we propose a novel CIA-YOLOX (Channel Interaction Attention–You Only Look Once) model for the precise identification of water jet landing points in firefighting robots using unmanned aerial vehicle (UAV) visual information. First, the model introduces the Triplet Attention (TA) mechanism to capture feature dependencies across different dimensions, enriching feature information. Second, a module named Coordinate Attention Transformer (CA-Trans) is designed to establish long-range dependencies between directional feature vectors, enabling the extraction of precise positional information critical for accurate impact point prediction. Additionally, a Dual-branch Channel Interactive Attention Fusion (DCIAF) module is proposed to enhance feature representation capabilities by facilitating feature complementation through semantic modeling of channel interactions. Experimental results indicate that the proposed model surpasses current state-of-the-art methods in performance while maintaining low computational costs, confirming its efficacy. This approach enhances the robot's ability to perceive complex environments, providing valuable insights for implementing firefighting actions in real-world scenarios.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"178 ","pages":"Article 113364"},"PeriodicalIF":7.2000,"publicationDate":"2025-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625006751","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Firefighting robots play a critical role in fire suppression. Ensuring the water stream precisely hits the target during autonomous fire extinguishing is of paramount importance. By visually detecting the landing point of the water jet, closed-loop control of the extinguishing process can be achieved. However, achieving accurate jet landing point localization in complex environments, such as changes in ambient lighting, jet end divergence, and jet breakup, presents a challenging task. To address this, we propose a novel CIA-YOLOX (Channel Interaction Attention–You Only Look Once) model for the precise identification of water jet landing points in firefighting robots using unmanned aerial vehicle (UAV) visual information. First, the model introduces the Triplet Attention (TA) mechanism to capture feature dependencies across different dimensions, enriching feature information. Second, a module named Coordinate Attention Transformer (CA-Trans) is designed to establish long-range dependencies between directional feature vectors, enabling the extraction of precise positional information critical for accurate impact point prediction. Additionally, a Dual-branch Channel Interactive Attention Fusion (DCIAF) module is proposed to enhance feature representation capabilities by facilitating feature complementation through semantic modeling of channel interactions. Experimental results indicate that the proposed model surpasses current state-of-the-art methods in performance while maintaining low computational costs, confirming its efficacy. This approach enhances the robot's ability to perceive complex environments, providing valuable insights for implementing firefighting actions in real-world scenarios.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.