Victor Sanchez, Miguel Hernández-Cabronero, J. Serra-Sagristà
{"title":"基于在线学习的过拟合神经网络成像数据分块内预测","authors":"Victor Sanchez, Miguel Hernández-Cabronero, J. Serra-Sagristà","doi":"10.1109/mlsp52302.2021.9596526","DOIUrl":null,"url":null,"abstract":"Block-wise intra-prediction is a key technique used by modern video codecs to reduce the amount of data to be compressed. Recently, machine learning (ML) has successfully improved block-wise intra-prediction by employing neural networks. Notwithstanding, the performance of such ML-based methods depends on the amount, quality, and relevance of the training data. Furthermore, they require signalling the learned parameters into the bitstream to be able to reconstruct the original data after decompression, thus increasing bitrates. This work proposes a novel block-wise intra-prediction strategy based on fully connected neural networks (FC-NNs) that avoids the two aforementioned shortcomings within the context of lossless compression. To do so, shallow FC-NNs are used, whose parameters are refined in an on-line manner using only the data being predicted. This allows to accurately fit the FC-NNs to the data of interest and replicate the optimization process, avoiding signaling the learned parameters. Experimental results indicate that the proposed ML-based intra-prediction strategy can outperform the intra-prediction used by modern video codecs with prediction accuracy gains of up to 7.01 dB PSNR.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"222 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Block-Wise Intra-Prediction of Imaging Data Based on Overfitted Neural Networks with On-Line Learning\",\"authors\":\"Victor Sanchez, Miguel Hernández-Cabronero, J. Serra-Sagristà\",\"doi\":\"10.1109/mlsp52302.2021.9596526\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Block-wise intra-prediction is a key technique used by modern video codecs to reduce the amount of data to be compressed. Recently, machine learning (ML) has successfully improved block-wise intra-prediction by employing neural networks. Notwithstanding, the performance of such ML-based methods depends on the amount, quality, and relevance of the training data. Furthermore, they require signalling the learned parameters into the bitstream to be able to reconstruct the original data after decompression, thus increasing bitrates. This work proposes a novel block-wise intra-prediction strategy based on fully connected neural networks (FC-NNs) that avoids the two aforementioned shortcomings within the context of lossless compression. To do so, shallow FC-NNs are used, whose parameters are refined in an on-line manner using only the data being predicted. This allows to accurately fit the FC-NNs to the data of interest and replicate the optimization process, avoiding signaling the learned parameters. Experimental results indicate that the proposed ML-based intra-prediction strategy can outperform the intra-prediction used by modern video codecs with prediction accuracy gains of up to 7.01 dB PSNR.\",\"PeriodicalId\":156116,\"journal\":{\"name\":\"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)\",\"volume\":\"222 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/mlsp52302.2021.9596526\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mlsp52302.2021.9596526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
逐块内预测是现代视频编解码器用于减少压缩数据量的关键技术。最近,机器学习(ML)通过使用神经网络成功地改进了块内预测。尽管如此,这种基于ml的方法的性能取决于训练数据的数量、质量和相关性。此外,它们需要将学习到的参数发送到比特流中,以便能够在解压缩后重建原始数据,从而提高比特率。这项工作提出了一种基于全连接神经网络(FC-NNs)的新颖的分块内预测策略,该策略在无损压缩的背景下避免了上述两个缺点。为此,使用了浅层fc - nn,其参数仅使用预测数据以在线方式进行细化。这允许fc - nn准确地拟合感兴趣的数据并复制优化过程,避免向学习参数发出信号。实验结果表明,基于机器学习的内预测策略优于现代视频编解码器的内预测,预测精度增益高达7.01 dB PSNR。
Block-Wise Intra-Prediction of Imaging Data Based on Overfitted Neural Networks with On-Line Learning
Block-wise intra-prediction is a key technique used by modern video codecs to reduce the amount of data to be compressed. Recently, machine learning (ML) has successfully improved block-wise intra-prediction by employing neural networks. Notwithstanding, the performance of such ML-based methods depends on the amount, quality, and relevance of the training data. Furthermore, they require signalling the learned parameters into the bitstream to be able to reconstruct the original data after decompression, thus increasing bitrates. This work proposes a novel block-wise intra-prediction strategy based on fully connected neural networks (FC-NNs) that avoids the two aforementioned shortcomings within the context of lossless compression. To do so, shallow FC-NNs are used, whose parameters are refined in an on-line manner using only the data being predicted. This allows to accurately fit the FC-NNs to the data of interest and replicate the optimization process, avoiding signaling the learned parameters. Experimental results indicate that the proposed ML-based intra-prediction strategy can outperform the intra-prediction used by modern video codecs with prediction accuracy gains of up to 7.01 dB PSNR.