Ryugo Morita, Hitoshi Nishimura, Ko Watanabe, Andreas Dengel, Jinjia Zhou
{"title":"基于边缘去噪的图像压缩","authors":"Ryugo Morita, Hitoshi Nishimura, Ko Watanabe, Andreas Dengel, Jinjia Zhou","doi":"arxiv-2409.10978","DOIUrl":null,"url":null,"abstract":"In recent years, deep learning-based image compression, particularly through\ngenerative models, has emerged as a pivotal area of research. Despite\nsignificant advancements, challenges such as diminished sharpness and quality\nin reconstructed images, learning inefficiencies due to mode collapse, and data\nloss during transmission persist. To address these issues, we propose a novel\ncompression model that incorporates a denoising step with diffusion models,\nsignificantly enhancing image reconstruction fidelity by sub-information(e.g.,\nedge and depth) from leveraging latent space. Empirical experiments demonstrate\nthat our model achieves superior or comparable results in terms of image\nquality and compression efficiency when measured against the existing models.\nNotably, our model excels in scenarios of partial image loss or excessive noise\nby introducing an edge estimation network to preserve the integrity of\nreconstructed images, offering a robust solution to the current limitations of\nimage compression.","PeriodicalId":501289,"journal":{"name":"arXiv - EE - Image and Video Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Edge-based Denoising Image Compression\",\"authors\":\"Ryugo Morita, Hitoshi Nishimura, Ko Watanabe, Andreas Dengel, Jinjia Zhou\",\"doi\":\"arxiv-2409.10978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, deep learning-based image compression, particularly through\\ngenerative models, has emerged as a pivotal area of research. Despite\\nsignificant advancements, challenges such as diminished sharpness and quality\\nin reconstructed images, learning inefficiencies due to mode collapse, and data\\nloss during transmission persist. To address these issues, we propose a novel\\ncompression model that incorporates a denoising step with diffusion models,\\nsignificantly enhancing image reconstruction fidelity by sub-information(e.g.,\\nedge and depth) from leveraging latent space. Empirical experiments demonstrate\\nthat our model achieves superior or comparable results in terms of image\\nquality and compression efficiency when measured against the existing models.\\nNotably, our model excels in scenarios of partial image loss or excessive noise\\nby introducing an edge estimation network to preserve the integrity of\\nreconstructed images, offering a robust solution to the current limitations of\\nimage compression.\",\"PeriodicalId\":501289,\"journal\":{\"name\":\"arXiv - EE - Image and Video Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Image and Video Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.10978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Image and Video Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.10978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In recent years, deep learning-based image compression, particularly through
generative models, has emerged as a pivotal area of research. Despite
significant advancements, challenges such as diminished sharpness and quality
in reconstructed images, learning inefficiencies due to mode collapse, and data
loss during transmission persist. To address these issues, we propose a novel
compression model that incorporates a denoising step with diffusion models,
significantly enhancing image reconstruction fidelity by sub-information(e.g.,
edge and depth) from leveraging latent space. Empirical experiments demonstrate
that our model achieves superior or comparable results in terms of image
quality and compression efficiency when measured against the existing models.
Notably, our model excels in scenarios of partial image loss or excessive noise
by introducing an edge estimation network to preserve the integrity of
reconstructed images, offering a robust solution to the current limitations of
image compression.