面向遥感语义分割的加速扩散模型

IF 7.6 Q1 REMOTE SENSING
Ruben Pascual , Christian Ayala , Ruben Sesma , Aranzazu Jurio , Daniel Paternain , Mikel Galar
{"title":"面向遥感语义分割的加速扩散模型","authors":"Ruben Pascual ,&nbsp;Christian Ayala ,&nbsp;Ruben Sesma ,&nbsp;Aranzazu Jurio ,&nbsp;Daniel Paternain ,&nbsp;Mikel Galar","doi":"10.1016/j.jag.2025.104636","DOIUrl":null,"url":null,"abstract":"<div><div>Denoising Diffusion Probabilistic Models (DDPMs) have demonstrated exceptional potential across various generative modeling tasks. Despite evident promise in semantic segmentation, their adoption for remote sensing remains limited primarily due to computationally demanding inference. While initial approaches using DDPMs in remote sensing achieve competitive accuracy with state-of-the-art models, the multi-step nature of their image generation process poses a major bottleneck. To address this limitation, this paper investigates three key strategies for accelerating inference: (1) optimizing training and inference steps, (2) applying DDPM acceleration techniques adapted to segmentation task (including Denoising Diffusion Implicit Models, Improved Denoising Diffusion Models, and Progressive Distillation), and (3) thoroughly analyzing the trade-off between accuracy improvement and additional inference time when using test-time augmentation. These strategies are extensively tested with two established remote sensing semantic segmentation datasets focused on buildings and roads. Finally, we compare the optimized diffusion-based model with state-of-the-art convolutional-based models in terms of accuracy and inference times, showing the narrowing gap between both approaches and the increasing viability of diffusion-based segmentation for practical applications.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"142 ","pages":"Article 104636"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Speeding-up diffusion models for remote sensing semantic segmentation\",\"authors\":\"Ruben Pascual ,&nbsp;Christian Ayala ,&nbsp;Ruben Sesma ,&nbsp;Aranzazu Jurio ,&nbsp;Daniel Paternain ,&nbsp;Mikel Galar\",\"doi\":\"10.1016/j.jag.2025.104636\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Denoising Diffusion Probabilistic Models (DDPMs) have demonstrated exceptional potential across various generative modeling tasks. Despite evident promise in semantic segmentation, their adoption for remote sensing remains limited primarily due to computationally demanding inference. While initial approaches using DDPMs in remote sensing achieve competitive accuracy with state-of-the-art models, the multi-step nature of their image generation process poses a major bottleneck. To address this limitation, this paper investigates three key strategies for accelerating inference: (1) optimizing training and inference steps, (2) applying DDPM acceleration techniques adapted to segmentation task (including Denoising Diffusion Implicit Models, Improved Denoising Diffusion Models, and Progressive Distillation), and (3) thoroughly analyzing the trade-off between accuracy improvement and additional inference time when using test-time augmentation. These strategies are extensively tested with two established remote sensing semantic segmentation datasets focused on buildings and roads. Finally, we compare the optimized diffusion-based model with state-of-the-art convolutional-based models in terms of accuracy and inference times, showing the narrowing gap between both approaches and the increasing viability of diffusion-based segmentation for practical applications.</div></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":\"142 \",\"pages\":\"Article 104636\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569843225002833\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843225002833","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0

摘要

消噪扩散概率模型(ddpm)在各种生成建模任务中表现出了非凡的潜力。尽管语义分割有明显的前景,但它们在遥感中的应用仍然有限,主要是由于计算要求的推理。虽然在遥感中使用ddpm的初始方法与最先进的模型相比具有竞争力的精度,但其图像生成过程的多步骤性质构成了主要瓶颈。为了解决这一限制,本文研究了加速推理的三个关键策略:(1)优化训练和推理步骤;(2)应用适合分割任务的DDPM加速技术(包括去噪扩散隐式模型、改进的去噪扩散模型和递进蒸馏);(3)深入分析使用测试时间增强时精度提高和额外推理时间之间的权衡。这些策略在两个已建立的以建筑物和道路为重点的遥感语义分割数据集上进行了广泛的测试。最后,我们将优化的基于扩散的模型与最先进的基于卷积的模型在准确性和推理时间方面进行了比较,表明两种方法之间的差距越来越小,并且基于扩散的分割在实际应用中的可行性越来越高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Speeding-up diffusion models for remote sensing semantic segmentation
Denoising Diffusion Probabilistic Models (DDPMs) have demonstrated exceptional potential across various generative modeling tasks. Despite evident promise in semantic segmentation, their adoption for remote sensing remains limited primarily due to computationally demanding inference. While initial approaches using DDPMs in remote sensing achieve competitive accuracy with state-of-the-art models, the multi-step nature of their image generation process poses a major bottleneck. To address this limitation, this paper investigates three key strategies for accelerating inference: (1) optimizing training and inference steps, (2) applying DDPM acceleration techniques adapted to segmentation task (including Denoising Diffusion Implicit Models, Improved Denoising Diffusion Models, and Progressive Distillation), and (3) thoroughly analyzing the trade-off between accuracy improvement and additional inference time when using test-time augmentation. These strategies are extensively tested with two established remote sensing semantic segmentation datasets focused on buildings and roads. Finally, we compare the optimized diffusion-based model with state-of-the-art convolutional-based models in terms of accuracy and inference times, showing the narrowing gap between both approaches and the increasing viability of diffusion-based segmentation for practical applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International journal of applied earth observation and geoinformation : ITC journal
International journal of applied earth observation and geoinformation : ITC journal Global and Planetary Change, Management, Monitoring, Policy and Law, Earth-Surface Processes, Computers in Earth Sciences
CiteScore
12.00
自引率
0.00%
发文量
0
审稿时长
77 days
期刊介绍: The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信