{"title":"用于遥感语义分割的高效JPEG-AI图像编码","authors":"Junxi Zhang;Xiang Pan;Zhenzhong Chen;Shan Liu","doi":"10.1109/LGRS.2025.3596235","DOIUrl":null,"url":null,"abstract":"Efficient image compression is crucial for remote sensing (RS) satellite systems, as it determines the performance of machine vision applications analyzing the downlinked image data at ground stations. However, existing conventional or learning-based image compression approaches exhibit limitations in either high complexity or undesirable vision task performance. This letter investigates an efficient neural image compression standard, JPEG-AI-based self-supervised RS image compression approach, and SS-JPEG-AI, for semantic segmentation tasks. Our approach maintains the low-complexity advantages of JPEG-AI while incorporating: 1) a computationally efficient transformer-based attention mechanism that enhances reconstruction quality without increasing encoder complexity for onboard systems and 2) a contrastive learning strategy that improves feature discriminability and sharpens intercategory decision boundaries for segmentation tasks. Compared to the state-of-the-art image compression methods, SS-JPEG-AI achieves better Bjøntegaard delta-rate (BD-rate) performance across the mean intersection over union (mIoU) and mean F-score (mFscore) while maintaining up to <inline-formula> <tex-math>$30\\times $ </tex-math></inline-formula> smaller computational complexity.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient JPEG-AI Image Coding for Remote Sensing Semantic Segmentation\",\"authors\":\"Junxi Zhang;Xiang Pan;Zhenzhong Chen;Shan Liu\",\"doi\":\"10.1109/LGRS.2025.3596235\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Efficient image compression is crucial for remote sensing (RS) satellite systems, as it determines the performance of machine vision applications analyzing the downlinked image data at ground stations. However, existing conventional or learning-based image compression approaches exhibit limitations in either high complexity or undesirable vision task performance. This letter investigates an efficient neural image compression standard, JPEG-AI-based self-supervised RS image compression approach, and SS-JPEG-AI, for semantic segmentation tasks. Our approach maintains the low-complexity advantages of JPEG-AI while incorporating: 1) a computationally efficient transformer-based attention mechanism that enhances reconstruction quality without increasing encoder complexity for onboard systems and 2) a contrastive learning strategy that improves feature discriminability and sharpens intercategory decision boundaries for segmentation tasks. Compared to the state-of-the-art image compression methods, SS-JPEG-AI achieves better Bjøntegaard delta-rate (BD-rate) performance across the mean intersection over union (mIoU) and mean F-score (mFscore) while maintaining up to <inline-formula> <tex-math>$30\\\\times $ </tex-math></inline-formula> smaller computational complexity.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11115155/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11115155/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Efficient JPEG-AI Image Coding for Remote Sensing Semantic Segmentation
Efficient image compression is crucial for remote sensing (RS) satellite systems, as it determines the performance of machine vision applications analyzing the downlinked image data at ground stations. However, existing conventional or learning-based image compression approaches exhibit limitations in either high complexity or undesirable vision task performance. This letter investigates an efficient neural image compression standard, JPEG-AI-based self-supervised RS image compression approach, and SS-JPEG-AI, for semantic segmentation tasks. Our approach maintains the low-complexity advantages of JPEG-AI while incorporating: 1) a computationally efficient transformer-based attention mechanism that enhances reconstruction quality without increasing encoder complexity for onboard systems and 2) a contrastive learning strategy that improves feature discriminability and sharpens intercategory decision boundaries for segmentation tasks. Compared to the state-of-the-art image compression methods, SS-JPEG-AI achieves better Bjøntegaard delta-rate (BD-rate) performance across the mean intersection over union (mIoU) and mean F-score (mFscore) while maintaining up to $30\times $ smaller computational complexity.