Enrique Portalés-Julià , Gonzalo Mateo-García , Luis Gómez-Chova
{"title":"Understanding flood detection models across Sentinel-1 and Sentinel-2 modalities and benchmark datasets","authors":"Enrique Portalés-Julià , Gonzalo Mateo-García , Luis Gómez-Chova","doi":"10.1016/j.rse.2025.114882","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, research in flood mapping from remote sensing satellite imagery has predominantly focused on deep learning methods. While new flood segmentation models are increasingly being proposed, most of these works focus on advancing architectures trained on single datasets. Therefore, these studies overlook the intrinsic limitations and biases of the available training and evaluation data. This often leads to poor generalization and overconfident predictions when these models are used in real-world scenarios. To address this gap, the objective of this work is twofold. First, we train and evaluate flood segmentation models on five publicly available datasets including data from Sentinel-1, Sentinel-2, and both SAR and multispectral modalities. Our findings reveal that models achieving high detection accuracy on a single dataset (intra-dataset validation) do not necessarily generalize well to unseen datasets. In contrast, models trained on more diverse samples from multiple datasets demonstrate greater robustness and generalization ability. Furthermore, we present a dual-stream multimodal architecture that can be independently trained and tested on both single-modality and dual-modality datasets. This enables the integration of all the diversity and richness of the available data into a single unified framework. The results emphasize the need for a more comprehensive validation using diverse and well-designed datasets, particularly for multimodal approaches. If not adequately addressed, the shortcomings of current datasets can significantly limit the potential of deep learning-based operational flood mapping approaches.</div></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"328 ","pages":""},"PeriodicalIF":11.1000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003442572500286X","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, research in flood mapping from remote sensing satellite imagery has predominantly focused on deep learning methods. While new flood segmentation models are increasingly being proposed, most of these works focus on advancing architectures trained on single datasets. Therefore, these studies overlook the intrinsic limitations and biases of the available training and evaluation data. This often leads to poor generalization and overconfident predictions when these models are used in real-world scenarios. To address this gap, the objective of this work is twofold. First, we train and evaluate flood segmentation models on five publicly available datasets including data from Sentinel-1, Sentinel-2, and both SAR and multispectral modalities. Our findings reveal that models achieving high detection accuracy on a single dataset (intra-dataset validation) do not necessarily generalize well to unseen datasets. In contrast, models trained on more diverse samples from multiple datasets demonstrate greater robustness and generalization ability. Furthermore, we present a dual-stream multimodal architecture that can be independently trained and tested on both single-modality and dual-modality datasets. This enables the integration of all the diversity and richness of the available data into a single unified framework. The results emphasize the need for a more comprehensive validation using diverse and well-designed datasets, particularly for multimodal approaches. If not adequately addressed, the shortcomings of current datasets can significantly limit the potential of deep learning-based operational flood mapping approaches.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.