{"title":"CHGAFF-YOLO:用于海洋内波实时检测的级联混合全局自适应特征融合框架","authors":"Xianwei Huo;He Gao;Baoxiang Huang;Ge Chen","doi":"10.1109/LGRS.2025.3577597","DOIUrl":null,"url":null,"abstract":"Internal waves (IWs) are widely present in the global ocean and play a crucial role in ocean dynamics, material transport, and climate change research. However, due to the large-scale variations and complex morphological features of IW objects, existing detection methods have limitations in cross-scale feature extraction and information fusion. To address the aforementioned issues, this letter proposes a cascade hybrid global adaptive feature fusion you only look once (CHGAFF-YOLO) model for ocean IW detection. First, we employ the HGNetV2 structure as the backbone network of YOLO to better capture global information and extract complex features in a lightweight manner. Second, we introduce a Cascade Hybrid Cross-head Attention Module (CHCAM), which integrates in-group cascade attention with cross-head parallel self-attention mechanisms to achieve optimized multiscale feature extraction and enhance global feature representation capability. Finally, we design a Four-Head Adaptive Feature Fusion Module (FHAFF), which dynamically fuses feature information from different scales by constructing four detection heads, further enhancing cross-scale information interaction. Extensive experimental results show that our method significantly outperforms existing approaches on both the Sentinel-1 synthetic aperture radar (SAR) and MODIS satellite IW remote sensing datasets.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CHGAFF-YOLO: A Cascade Hybrid Global Adaptive Feature Fusion Framework for Real-Time Ocean Internal Wave Detection\",\"authors\":\"Xianwei Huo;He Gao;Baoxiang Huang;Ge Chen\",\"doi\":\"10.1109/LGRS.2025.3577597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Internal waves (IWs) are widely present in the global ocean and play a crucial role in ocean dynamics, material transport, and climate change research. However, due to the large-scale variations and complex morphological features of IW objects, existing detection methods have limitations in cross-scale feature extraction and information fusion. To address the aforementioned issues, this letter proposes a cascade hybrid global adaptive feature fusion you only look once (CHGAFF-YOLO) model for ocean IW detection. First, we employ the HGNetV2 structure as the backbone network of YOLO to better capture global information and extract complex features in a lightweight manner. Second, we introduce a Cascade Hybrid Cross-head Attention Module (CHCAM), which integrates in-group cascade attention with cross-head parallel self-attention mechanisms to achieve optimized multiscale feature extraction and enhance global feature representation capability. Finally, we design a Four-Head Adaptive Feature Fusion Module (FHAFF), which dynamically fuses feature information from different scales by constructing four detection heads, further enhancing cross-scale information interaction. Extensive experimental results show that our method significantly outperforms existing approaches on both the Sentinel-1 synthetic aperture radar (SAR) and MODIS satellite IW remote sensing datasets.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11028055/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11028055/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CHGAFF-YOLO: A Cascade Hybrid Global Adaptive Feature Fusion Framework for Real-Time Ocean Internal Wave Detection
Internal waves (IWs) are widely present in the global ocean and play a crucial role in ocean dynamics, material transport, and climate change research. However, due to the large-scale variations and complex morphological features of IW objects, existing detection methods have limitations in cross-scale feature extraction and information fusion. To address the aforementioned issues, this letter proposes a cascade hybrid global adaptive feature fusion you only look once (CHGAFF-YOLO) model for ocean IW detection. First, we employ the HGNetV2 structure as the backbone network of YOLO to better capture global information and extract complex features in a lightweight manner. Second, we introduce a Cascade Hybrid Cross-head Attention Module (CHCAM), which integrates in-group cascade attention with cross-head parallel self-attention mechanisms to achieve optimized multiscale feature extraction and enhance global feature representation capability. Finally, we design a Four-Head Adaptive Feature Fusion Module (FHAFF), which dynamically fuses feature information from different scales by constructing four detection heads, further enhancing cross-scale information interaction. Extensive experimental results show that our method significantly outperforms existing approaches on both the Sentinel-1 synthetic aperture radar (SAR) and MODIS satellite IW remote sensing datasets.