Tao Wang;Hui Wang;Yunli Zhu;Xinang Fan;Guoliang Luo
{"title":"蒸馏-发酵双重处理红外-可见光目标检测","authors":"Tao Wang;Hui Wang;Yunli Zhu;Xinang Fan;Guoliang Luo","doi":"10.1109/LSP.2025.3610025","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel dual-processing framework for infrared-visible object detection, inspired by the fermentation-distillation paradigm in traditional Chinese liquor brewing. To address the complementary characteristics of RGB and thermal modalities, we first design a Dual-stage Feature Complementary Fusion module (DFCF) that sequentially performs coarse and fine processing on cross-modal features. Subsequently, a Polymorphic Convolution module (PCM) is developed by extending the YOLOv11 architecture with variable kernels and channel separation strategies. Furthermore, an Adaptive Semantic Aggregation module (ASA) effectively integrates shallow boundary details with deep semantic features. Extensive experiments on multiple datasets demonstrate that our method achieves superior performance compared to widely adopted approaches, with particularly significant improvements in challenging scenarios like low-light conditions. The ablation studies validate the contributions of each proposed component.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"3680-3684"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Infrared-Visible Object Detection via Distillation-Fermentation Dual Processing\",\"authors\":\"Tao Wang;Hui Wang;Yunli Zhu;Xinang Fan;Guoliang Luo\",\"doi\":\"10.1109/LSP.2025.3610025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel dual-processing framework for infrared-visible object detection, inspired by the fermentation-distillation paradigm in traditional Chinese liquor brewing. To address the complementary characteristics of RGB and thermal modalities, we first design a Dual-stage Feature Complementary Fusion module (DFCF) that sequentially performs coarse and fine processing on cross-modal features. Subsequently, a Polymorphic Convolution module (PCM) is developed by extending the YOLOv11 architecture with variable kernels and channel separation strategies. Furthermore, an Adaptive Semantic Aggregation module (ASA) effectively integrates shallow boundary details with deep semantic features. Extensive experiments on multiple datasets demonstrate that our method achieves superior performance compared to widely adopted approaches, with particularly significant improvements in challenging scenarios like low-light conditions. The ablation studies validate the contributions of each proposed component.\",\"PeriodicalId\":13154,\"journal\":{\"name\":\"IEEE Signal Processing Letters\",\"volume\":\"32 \",\"pages\":\"3680-3684\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Signal Processing Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11164181/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11164181/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Infrared-Visible Object Detection via Distillation-Fermentation Dual Processing
This paper proposes a novel dual-processing framework for infrared-visible object detection, inspired by the fermentation-distillation paradigm in traditional Chinese liquor brewing. To address the complementary characteristics of RGB and thermal modalities, we first design a Dual-stage Feature Complementary Fusion module (DFCF) that sequentially performs coarse and fine processing on cross-modal features. Subsequently, a Polymorphic Convolution module (PCM) is developed by extending the YOLOv11 architecture with variable kernels and channel separation strategies. Furthermore, an Adaptive Semantic Aggregation module (ASA) effectively integrates shallow boundary details with deep semantic features. Extensive experiments on multiple datasets demonstrate that our method achieves superior performance compared to widely adopted approaches, with particularly significant improvements in challenging scenarios like low-light conditions. The ablation studies validate the contributions of each proposed component.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.