{"title":"Unsupervised conversion method of high bit-depth remote sensing images using contrastive learning","authors":"Tengda Zhang , Jiguang Dai , Jinsong Cheng","doi":"10.1016/j.knosys.2025.113954","DOIUrl":null,"url":null,"abstract":"<div><div>Currently, remote sensing images are frequently stored in high bit-depth formats exceeding 10 bits. However, the standard 8-bit format remains the fundamental data format for visualization and deep learning applications. Traditional methods typically rely on manually adjusting the parameter threshold of the tone mapping operator to obtain 8-bit images, resulting in low automation. Although tone mapping methods based on deep learning have gradually supplanted traditional techniques, but such methods are mainly aimed at natural scene images taken by digital cameras. There are problems such as incompatibility between data format and image semantics, and it is difficult to meet the scale dependence of remote sensing image applications. To address these challenges, we propose an unsupervised bit-depth conversion method for remote sensing images that integrates generative adversarial networks with contrastive learning. We draw an analogy between gray value mapping and the motion of thermal field particles, constructing a transformer generator based on thermodynamic principles. Leveraging the analogous characteristics of high and low bit-depth image histograms, we introduce a histogram shape context contrastive loss to regulate the color distribution of the generated images. Furthermore, in light of the large-scale application characteristics of remote sensing images, we propose a post-processing method based on hybrid histogram matching to enhance image quality while generating seamless whole-scene images. We developed relevant datasets and conducted experiments, with results demonstrating that the proposed method achieves superior bit-depth conversion effects compared to existing methods. Code and data can be found at <span><span>https://github.com/ZzzTD/Bit-depth_conversion</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"324 ","pages":"Article 113954"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125009992","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, remote sensing images are frequently stored in high bit-depth formats exceeding 10 bits. However, the standard 8-bit format remains the fundamental data format for visualization and deep learning applications. Traditional methods typically rely on manually adjusting the parameter threshold of the tone mapping operator to obtain 8-bit images, resulting in low automation. Although tone mapping methods based on deep learning have gradually supplanted traditional techniques, but such methods are mainly aimed at natural scene images taken by digital cameras. There are problems such as incompatibility between data format and image semantics, and it is difficult to meet the scale dependence of remote sensing image applications. To address these challenges, we propose an unsupervised bit-depth conversion method for remote sensing images that integrates generative adversarial networks with contrastive learning. We draw an analogy between gray value mapping and the motion of thermal field particles, constructing a transformer generator based on thermodynamic principles. Leveraging the analogous characteristics of high and low bit-depth image histograms, we introduce a histogram shape context contrastive loss to regulate the color distribution of the generated images. Furthermore, in light of the large-scale application characteristics of remote sensing images, we propose a post-processing method based on hybrid histogram matching to enhance image quality while generating seamless whole-scene images. We developed relevant datasets and conducted experiments, with results demonstrating that the proposed method achieves superior bit-depth conversion effects compared to existing methods. Code and data can be found at https://github.com/ZzzTD/Bit-depth_conversion.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.