{"title":"基于卷积神经网络的视觉伪像分类","authors":"A. Holesova, P. Sykora, P. Kamencay, M. Uhrina","doi":"10.1109/ELEKTRO53996.2022.9803377","DOIUrl":null,"url":null,"abstract":"In this paper, we present an effective convolutional classifier for recognition of visual artifacts. The proposed deep-learned model is simple in architecture and number of learnable parameters while retaining a sufficient generalization ability. The latter was achieved by compilation of extensive dataset based on ImageNet database. The model was trained sequentially on over 3 million images with 3 types of distortions with various severity, specifically 10 levels of Gaussian noise, 10 levels of Gaussian blur and 5 levels of blocking effect caused by JPEG compression. The model achieved excellent 99.88% accuracy on generated images. The performance of the model was evaluated on 4 well-known IQA datasets, where it reached 71.95% accuracy in average. Furthermore, after transferring the weights from the proposed model and short training on IQA datasets, its accuracy increased by more than 7%.","PeriodicalId":396752,"journal":{"name":"2022 ELEKTRO (ELEKTRO)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Convolutional Neural Network for Visual Artifacts Classification\",\"authors\":\"A. Holesova, P. Sykora, P. Kamencay, M. Uhrina\",\"doi\":\"10.1109/ELEKTRO53996.2022.9803377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present an effective convolutional classifier for recognition of visual artifacts. The proposed deep-learned model is simple in architecture and number of learnable parameters while retaining a sufficient generalization ability. The latter was achieved by compilation of extensive dataset based on ImageNet database. The model was trained sequentially on over 3 million images with 3 types of distortions with various severity, specifically 10 levels of Gaussian noise, 10 levels of Gaussian blur and 5 levels of blocking effect caused by JPEG compression. The model achieved excellent 99.88% accuracy on generated images. The performance of the model was evaluated on 4 well-known IQA datasets, where it reached 71.95% accuracy in average. Furthermore, after transferring the weights from the proposed model and short training on IQA datasets, its accuracy increased by more than 7%.\",\"PeriodicalId\":396752,\"journal\":{\"name\":\"2022 ELEKTRO (ELEKTRO)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 ELEKTRO (ELEKTRO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELEKTRO53996.2022.9803377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 ELEKTRO (ELEKTRO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELEKTRO53996.2022.9803377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Convolutional Neural Network for Visual Artifacts Classification
In this paper, we present an effective convolutional classifier for recognition of visual artifacts. The proposed deep-learned model is simple in architecture and number of learnable parameters while retaining a sufficient generalization ability. The latter was achieved by compilation of extensive dataset based on ImageNet database. The model was trained sequentially on over 3 million images with 3 types of distortions with various severity, specifically 10 levels of Gaussian noise, 10 levels of Gaussian blur and 5 levels of blocking effect caused by JPEG compression. The model achieved excellent 99.88% accuracy on generated images. The performance of the model was evaluated on 4 well-known IQA datasets, where it reached 71.95% accuracy in average. Furthermore, after transferring the weights from the proposed model and short training on IQA datasets, its accuracy increased by more than 7%.