{"title":"TSFD-Net: RSOD中任务和参数差异的两阶段特征解耦网络","authors":"Xinghui Song;Chunyi Chen;Gen Li;Yanan Liu;Donglin Jing;Jun Peng","doi":"10.1109/LGRS.2025.3597597","DOIUrl":null,"url":null,"abstract":"Deep learning excels in natural image object detection, but remote sensing images face challenges like multidirectional objects and neighborhood interference. Existing methods use shared features for classification and regression, causing task interference. Classification needs translation/rotation-invariant features, while regression requires translation/rotation-equivariant features. Additionally, regression parameters (e.g., center, shape, and angle) demand distinct feature properties. To address this, we propose TSFD-Net, featuring: 1) task differential decoupling module (TDDM): decouples task-specific features via parallel CNN-Transformer branches, and 2) parameter differential decoupling module (PDDM): designs specialized regressors for distinct parameters (e.g., angle versus center/shape). Together, TDDM and PDDM form the two-stage feature decoupling (TSFD) structure. We further introduce dynamic cascade activation masks (DCAMs), leveraging bounding box feedback to enhance target focus and suppress neighborhood noise. TSFD network (TSFD-Net) achieves state-of-the-art results on DOTA-v1.0 (81.37% mAP), validating its efficacy.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"TSFD-Net: Two-Stage Feature Decoupling Network for Task and Parameter Discrepancies in RSOD\",\"authors\":\"Xinghui Song;Chunyi Chen;Gen Li;Yanan Liu;Donglin Jing;Jun Peng\",\"doi\":\"10.1109/LGRS.2025.3597597\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning excels in natural image object detection, but remote sensing images face challenges like multidirectional objects and neighborhood interference. Existing methods use shared features for classification and regression, causing task interference. Classification needs translation/rotation-invariant features, while regression requires translation/rotation-equivariant features. Additionally, regression parameters (e.g., center, shape, and angle) demand distinct feature properties. To address this, we propose TSFD-Net, featuring: 1) task differential decoupling module (TDDM): decouples task-specific features via parallel CNN-Transformer branches, and 2) parameter differential decoupling module (PDDM): designs specialized regressors for distinct parameters (e.g., angle versus center/shape). Together, TDDM and PDDM form the two-stage feature decoupling (TSFD) structure. We further introduce dynamic cascade activation masks (DCAMs), leveraging bounding box feedback to enhance target focus and suppress neighborhood noise. TSFD network (TSFD-Net) achieves state-of-the-art results on DOTA-v1.0 (81.37% mAP), validating its efficacy.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11122464/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11122464/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
TSFD-Net: Two-Stage Feature Decoupling Network for Task and Parameter Discrepancies in RSOD
Deep learning excels in natural image object detection, but remote sensing images face challenges like multidirectional objects and neighborhood interference. Existing methods use shared features for classification and regression, causing task interference. Classification needs translation/rotation-invariant features, while regression requires translation/rotation-equivariant features. Additionally, regression parameters (e.g., center, shape, and angle) demand distinct feature properties. To address this, we propose TSFD-Net, featuring: 1) task differential decoupling module (TDDM): decouples task-specific features via parallel CNN-Transformer branches, and 2) parameter differential decoupling module (PDDM): designs specialized regressors for distinct parameters (e.g., angle versus center/shape). Together, TDDM and PDDM form the two-stage feature decoupling (TSFD) structure. We further introduce dynamic cascade activation masks (DCAMs), leveraging bounding box feedback to enhance target focus and suppress neighborhood noise. TSFD network (TSFD-Net) achieves state-of-the-art results on DOTA-v1.0 (81.37% mAP), validating its efficacy.