{"title":"基于神经网络的图像数据内预测图变换","authors":"Debaleena Roy, T. Guha, V. Sanchez","doi":"10.1109/mlsp52302.2021.9596317","DOIUrl":null,"url":null,"abstract":"This paper introduces a novel class of Graph-based Transform based on neural networks (GBT-NN) within the context of block-based predictive transform coding of imaging data. To reduce the signalling overhead required to reconstruct the data after transformation, the proposed GBT-NN predicts the graph information needed to compute the inverse transform via a neural network. Evaluation results on several video frames and medical images, in terms of the percentage of energy preserved by a sub-set of transform coefficients and the mean squared error of the reconstructed data, show that the GBT-NN can outperform the DCT and DST, which are widely used in modern video codecs.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Graph-Based Transform Based on Neural Networks for Intra-Prediction of Imaging Data\",\"authors\":\"Debaleena Roy, T. Guha, V. Sanchez\",\"doi\":\"10.1109/mlsp52302.2021.9596317\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper introduces a novel class of Graph-based Transform based on neural networks (GBT-NN) within the context of block-based predictive transform coding of imaging data. To reduce the signalling overhead required to reconstruct the data after transformation, the proposed GBT-NN predicts the graph information needed to compute the inverse transform via a neural network. Evaluation results on several video frames and medical images, in terms of the percentage of energy preserved by a sub-set of transform coefficients and the mean squared error of the reconstructed data, show that the GBT-NN can outperform the DCT and DST, which are widely used in modern video codecs.\",\"PeriodicalId\":156116,\"journal\":{\"name\":\"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/mlsp52302.2021.9596317\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mlsp52302.2021.9596317","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Graph-Based Transform Based on Neural Networks for Intra-Prediction of Imaging Data
This paper introduces a novel class of Graph-based Transform based on neural networks (GBT-NN) within the context of block-based predictive transform coding of imaging data. To reduce the signalling overhead required to reconstruct the data after transformation, the proposed GBT-NN predicts the graph information needed to compute the inverse transform via a neural network. Evaluation results on several video frames and medical images, in terms of the percentage of energy preserved by a sub-set of transform coefficients and the mean squared error of the reconstructed data, show that the GBT-NN can outperform the DCT and DST, which are widely used in modern video codecs.