Shinobu Kudo, Shota Orihashi, Ryuichi Tanida, A. Shimizu
{"title":"基于互信息最大化正则化的gan图像压缩","authors":"Shinobu Kudo, Shota Orihashi, Ryuichi Tanida, A. Shimizu","doi":"10.1109/PCS48520.2019.8954548","DOIUrl":null,"url":null,"abstract":"Recently, image compression systems based on convolutional neural networks that use flexible nonlinear analysis and synthesis transformations have been developed to improve the restoration accuracy of decoded images. A method using a framework called a generative adversarial network [1] has been reported as one of the methods aiming to improve the subjective image quality [2][3]. It optimizes the distribution of restored images to be close to that of natural images; thus it suppresses visual artifacts such as blurring, ringing, and blocking. However, since methods of this type are optimized to focus on whether the restored image is subjectively natural or not, components that are not correlated with the original image are mixed in the coding features obtained from the encoder. Thus, even though the appearance looks natural, it may be subjectively seen as a different object from the original image or the impression may be changed.In this paper, we describe a method we have developed to maximize mutual information between the coding features and the restored images. This method, which we call \"regularization\", makes it possible to develop image compression systems that suppress appearance differences with subjective naturalness.","PeriodicalId":237809,"journal":{"name":"2019 Picture Coding Symposium (PCS)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"GAN-based Image Compression Using Mutual Information Maximizing Regularization\",\"authors\":\"Shinobu Kudo, Shota Orihashi, Ryuichi Tanida, A. Shimizu\",\"doi\":\"10.1109/PCS48520.2019.8954548\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, image compression systems based on convolutional neural networks that use flexible nonlinear analysis and synthesis transformations have been developed to improve the restoration accuracy of decoded images. A method using a framework called a generative adversarial network [1] has been reported as one of the methods aiming to improve the subjective image quality [2][3]. It optimizes the distribution of restored images to be close to that of natural images; thus it suppresses visual artifacts such as blurring, ringing, and blocking. However, since methods of this type are optimized to focus on whether the restored image is subjectively natural or not, components that are not correlated with the original image are mixed in the coding features obtained from the encoder. Thus, even though the appearance looks natural, it may be subjectively seen as a different object from the original image or the impression may be changed.In this paper, we describe a method we have developed to maximize mutual information between the coding features and the restored images. This method, which we call \\\"regularization\\\", makes it possible to develop image compression systems that suppress appearance differences with subjective naturalness.\",\"PeriodicalId\":237809,\"journal\":{\"name\":\"2019 Picture Coding Symposium (PCS)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Picture Coding Symposium (PCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PCS48520.2019.8954548\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Picture Coding Symposium (PCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PCS48520.2019.8954548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
GAN-based Image Compression Using Mutual Information Maximizing Regularization
Recently, image compression systems based on convolutional neural networks that use flexible nonlinear analysis and synthesis transformations have been developed to improve the restoration accuracy of decoded images. A method using a framework called a generative adversarial network [1] has been reported as one of the methods aiming to improve the subjective image quality [2][3]. It optimizes the distribution of restored images to be close to that of natural images; thus it suppresses visual artifacts such as blurring, ringing, and blocking. However, since methods of this type are optimized to focus on whether the restored image is subjectively natural or not, components that are not correlated with the original image are mixed in the coding features obtained from the encoder. Thus, even though the appearance looks natural, it may be subjectively seen as a different object from the original image or the impression may be changed.In this paper, we describe a method we have developed to maximize mutual information between the coding features and the restored images. This method, which we call "regularization", makes it possible to develop image compression systems that suppress appearance differences with subjective naturalness.