Zhun Li , Yuyang Wang , Lianmin Zhang , Hongkui Wang , Haibing Yin , Yong Chen , Wei Zhang
{"title":"用于JNQP预测的多类大规模压缩视频数据集","authors":"Zhun Li , Yuyang Wang , Lianmin Zhang , Hongkui Wang , Haibing Yin , Yong Chen , Wei Zhang","doi":"10.1016/j.jvcir.2025.104513","DOIUrl":null,"url":null,"abstract":"<div><div>In order to further improve video compression, numerous compressed video datasets have been released to predict the just noticeable distortion (JND) or the just noticeable quantization parameter (JNQP) for perceptual video coding. However, existing compressed video datasets are unable to meet the precise prediction of JNQP (or JND) applicable to different codecs and coding modes. Thus, regarding the current mainstream video codecs, this paper selects 50 source videos and compresses them with H.265 and H.266 codecs with the all intra (AI), the random access (RA) and the low delay (LD) coding modes using 38 quantization parameters. Then, 50 testers are asked to evaluate the JNQP values for three perceptual quality levels for each source video. All JNQP samples have been fully processed to meet the requirement of JNQP prediction for each codec under different coding modes. Our dataset is the first one for JNQP prediction across multiple codecs and coding modes, which is named by ALR-Video and can be downloaded at https://github.com/903365130/ALR-Video.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"111 ","pages":"Article 104513"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ALR-Video: A multi-class large-scale compressed video dataset for JNQP prediction\",\"authors\":\"Zhun Li , Yuyang Wang , Lianmin Zhang , Hongkui Wang , Haibing Yin , Yong Chen , Wei Zhang\",\"doi\":\"10.1016/j.jvcir.2025.104513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In order to further improve video compression, numerous compressed video datasets have been released to predict the just noticeable distortion (JND) or the just noticeable quantization parameter (JNQP) for perceptual video coding. However, existing compressed video datasets are unable to meet the precise prediction of JNQP (or JND) applicable to different codecs and coding modes. Thus, regarding the current mainstream video codecs, this paper selects 50 source videos and compresses them with H.265 and H.266 codecs with the all intra (AI), the random access (RA) and the low delay (LD) coding modes using 38 quantization parameters. Then, 50 testers are asked to evaluate the JNQP values for three perceptual quality levels for each source video. All JNQP samples have been fully processed to meet the requirement of JNQP prediction for each codec under different coding modes. Our dataset is the first one for JNQP prediction across multiple codecs and coding modes, which is named by ALR-Video and can be downloaded at https://github.com/903365130/ALR-Video.</div></div>\",\"PeriodicalId\":54755,\"journal\":{\"name\":\"Journal of Visual Communication and Image Representation\",\"volume\":\"111 \",\"pages\":\"Article 104513\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Visual Communication and Image Representation\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1047320325001270\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001270","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
ALR-Video: A multi-class large-scale compressed video dataset for JNQP prediction
In order to further improve video compression, numerous compressed video datasets have been released to predict the just noticeable distortion (JND) or the just noticeable quantization parameter (JNQP) for perceptual video coding. However, existing compressed video datasets are unable to meet the precise prediction of JNQP (or JND) applicable to different codecs and coding modes. Thus, regarding the current mainstream video codecs, this paper selects 50 source videos and compresses them with H.265 and H.266 codecs with the all intra (AI), the random access (RA) and the low delay (LD) coding modes using 38 quantization parameters. Then, 50 testers are asked to evaluate the JNQP values for three perceptual quality levels for each source video. All JNQP samples have been fully processed to meet the requirement of JNQP prediction for each codec under different coding modes. Our dataset is the first one for JNQP prediction across multiple codecs and coding modes, which is named by ALR-Video and can be downloaded at https://github.com/903365130/ALR-Video.
期刊介绍:
The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.