A large-scale remote sensing scene dataset construction for semantic segmentation

IF 1.8 Q3 REMOTE SENSING

International Journal of Image and Data Fusion Pub Date : 2023-04-10 DOI:10.1080/19479832.2023.2199005

LeiLei Xu, Shanqiu Shi, Yujun Liu, Hao Zhang, Dan Wang, Lu Zhang, Wan Liang, Hao Chen

{"title":"A large-scale remote sensing scene dataset construction for semantic segmentation","authors":"LeiLei Xu, Shanqiu Shi, Yujun Liu, Hao Zhang, Dan Wang, Lu Zhang, Wan Liang, Hao Chen","doi":"10.1080/19479832.2023.2199005","DOIUrl":null,"url":null,"abstract":"ABSTRACT As fuelled by the advancement of deep learning for computer vision tasks, its application in other fields has been boosted. This technology has been increasingly applied to the interpretation of remote sensing image, showing high potential economic and societal significance, such as automatically mapping land cover. However, the model requires a considerable number of samples for training, and it is now adversely affected by the lack of a large-scale dataset. Moreover, labelling samples is a time-consuming and laborious task, and a complete land classification system suitable for deep learning has not been established. This limitation hinders the development and application of deep learning. To meet the data needs of deep learning in the field of remote sensing, this study develops JSsampleP, a large-scale dataset for segmentation, generating 110,170 data samples that cover various categories of scenes within Jiangsu Province, China. The existing Geographical Condition Dataset (GCD) and Basic Surveying and Mapping Dataset (BSMD) in Jiangsu were fully utilised, significantly reducing the cost of labelling samples. Furthermore, the samples were subject to a rigorous cleaning process to ensure data quality. Finally, the accuracy of the dataset is verified using the U-Net model, and the future version will be optimised continuously.","PeriodicalId":46012,"journal":{"name":"International Journal of Image and Data Fusion","volume":"14 1","pages":"299 - 323"},"PeriodicalIF":1.8000,"publicationDate":"2023-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Image and Data Fusion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/19479832.2023.2199005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 0

Abstract

ABSTRACT As fuelled by the advancement of deep learning for computer vision tasks, its application in other fields has been boosted. This technology has been increasingly applied to the interpretation of remote sensing image, showing high potential economic and societal significance, such as automatically mapping land cover. However, the model requires a considerable number of samples for training, and it is now adversely affected by the lack of a large-scale dataset. Moreover, labelling samples is a time-consuming and laborious task, and a complete land classification system suitable for deep learning has not been established. This limitation hinders the development and application of deep learning. To meet the data needs of deep learning in the field of remote sensing, this study develops JSsampleP, a large-scale dataset for segmentation, generating 110,170 data samples that cover various categories of scenes within Jiangsu Province, China. The existing Geographical Condition Dataset (GCD) and Basic Surveying and Mapping Dataset (BSMD) in Jiangsu were fully utilised, significantly reducing the cost of labelling samples. Furthermore, the samples were subject to a rigorous cleaning process to ensure data quality. Finally, the accuracy of the dataset is verified using the U-Net model, and the future version will be optimised continuously.

查看原文本刊更多论文

用于语义分割的大规模遥感场景数据集构建

摘要随着计算机视觉任务深度学习的发展，它在其他领域的应用也得到了推动。这项技术越来越多地应用于遥感图像的解释，显示出很高的潜在经济和社会意义，例如自动绘制土地覆盖图。然而，该模型需要大量的样本进行训练，而且由于缺乏大规模数据集，它现在受到了不利影响。此外，标记样本是一项耗时费力的任务，而且还没有建立一个适合深度学习的完整土地分类系统。这种局限性阻碍了深度学习的发展和应用。为了满足遥感领域深度学习的数据需求，本研究开发了用于分割的大型数据集JSsampleP，生成了110170个数据样本，覆盖了中国江苏省的各类场景。充分利用了江苏现有的地理条件数据集（GCD）和基础测绘数据集（BSMD），显著降低了样本标签成本。此外，为了确保数据质量，对样本进行了严格的清洁处理。最后，使用U-Net模型验证了数据集的准确性，未来的版本将不断优化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Image and Data Fusion REMOTE SENSING-

CiteScore

5.00

自引率

0.00%

发文量

期刊介绍： International Journal of Image and Data Fusion provides a single source of information for all aspects of image and data fusion methodologies, developments, techniques and applications. Image and data fusion techniques are important for combining the many sources of satellite, airborne and ground based imaging systems, and integrating these with other related data sets for enhanced information extraction and decision making. Image and data fusion aims at the integration of multi-sensor, multi-temporal, multi-resolution and multi-platform image data, together with geospatial data, GIS, in-situ, and other statistical data sets for improved information extraction, as well as to increase the reliability of the information. This leads to more accurate information that provides for robust operational performance, i.e. increased confidence, reduced ambiguity and improved classification enabling evidence based management. The journal welcomes original research papers, review papers, shorter letters, technical articles, book reviews and conference reports in all areas of image and data fusion including, but not limited to, the following aspects and topics: • Automatic registration/geometric aspects of fusing images with different spatial, spectral, temporal resolutions; phase information; or acquired in different modes • Pixel, feature and decision level fusion algorithms and methodologies • Data Assimilation: fusing data with models • Multi-source classification and information extraction • Integration of satellite, airborne and terrestrial sensor systems • Fusing temporal data sets for change detection studies (e.g. for Land Cover/Land Use Change studies) • Image and data mining from multi-platform, multi-source, multi-scale, multi-temporal data sets (e.g. geometric information, topological information, statistical information, etc.).