Perceptual Quality Assessment of 360° Images Based on Generative Scanpath Representation

IF 13.7
Xiangjie Sui;Hanwei Zhu;Xuelin Liu;Yuming Fang;Shiqi Wang;Zhou Wang
{"title":"Perceptual Quality Assessment of 360° Images Based on Generative Scanpath Representation","authors":"Xiangjie Sui;Hanwei Zhu;Xuelin Liu;Yuming Fang;Shiqi Wang;Zhou Wang","doi":"10.1109/TIP.2025.3583181","DOIUrl":null,"url":null,"abstract":"Despite substantial efforts dedicated to the design of heuristic models for omnidirectional (i.e., 360°) image quality assessment (OIQA), a conspicuous gap remains due to the lack of consideration for the diversity of viewing behaviors that leads to the varying perceptual quality of 360° images. Two critical aspects underline this oversight: the neglect of viewing conditions that significantly sway user gaze patterns and the overreliance on a single viewport sequence from the 360° image for quality inference. To address these issues, we introduce a unique generative scanpath representation (GSR) for effective quality inference of 360° images, which aggregates varied perceptual experiences of multi-hypothesis users under a predefined viewing condition. More specifically, given a viewing condition characterized by the starting point of viewing and exploration time, a set of scanpaths consisting of dynamic visual fixations can be produced using an apt scanpath generator. Following this vein, we use the scanpaths to convert the 360° image into the unique GSR, which provides a global overview of gazed-focused contents derived from scanpaths. As such, the quality inference of the 360° image is swiftly transformed to that of GSR. We then propose an efficient OIQA computational framework by learning the quality maps of GSR. Comprehensive experimental results validate that the predictions of the proposed framework are highly consistent with human perception in the spatiotemporal domain, especially in the challenging context of locally distorted 360° images under varied viewing conditions. The code will be released at <uri>https://github.com/xiangjieSui/GSR</uri>","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"4485-4499"},"PeriodicalIF":13.7000,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11080124/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Despite substantial efforts dedicated to the design of heuristic models for omnidirectional (i.e., 360°) image quality assessment (OIQA), a conspicuous gap remains due to the lack of consideration for the diversity of viewing behaviors that leads to the varying perceptual quality of 360° images. Two critical aspects underline this oversight: the neglect of viewing conditions that significantly sway user gaze patterns and the overreliance on a single viewport sequence from the 360° image for quality inference. To address these issues, we introduce a unique generative scanpath representation (GSR) for effective quality inference of 360° images, which aggregates varied perceptual experiences of multi-hypothesis users under a predefined viewing condition. More specifically, given a viewing condition characterized by the starting point of viewing and exploration time, a set of scanpaths consisting of dynamic visual fixations can be produced using an apt scanpath generator. Following this vein, we use the scanpaths to convert the 360° image into the unique GSR, which provides a global overview of gazed-focused contents derived from scanpaths. As such, the quality inference of the 360° image is swiftly transformed to that of GSR. We then propose an efficient OIQA computational framework by learning the quality maps of GSR. Comprehensive experimental results validate that the predictions of the proposed framework are highly consistent with human perception in the spatiotemporal domain, especially in the challenging context of locally distorted 360° images under varied viewing conditions. The code will be released at https://github.com/xiangjieSui/GSR
基于生成式扫描路径表示的360°图像感知质量评估
尽管在设计全向(即360°)图像质量评估(OIQA)的启发式模型方面做出了大量努力,但由于缺乏对观看行为多样性的考虑,导致360°图像的感知质量存在差异,因此仍然存在明显的差距。两个关键方面强调了这种疏忽:忽视了显著影响用户凝视模式的观看条件,以及过度依赖来自360°图像的单个视口序列来进行质量推断。为了解决这些问题,我们引入了一种独特的生成扫描路径表示(GSR),用于360°图像的有效质量推断,该方法在预定义的观看条件下聚合了多假设用户的各种感知体验。更具体地说,给定以观看起始点和探索时间为特征的观看条件,使用apt扫描路径生成器可以生成一组由动态视觉注视点组成的扫描路径。遵循这一思路,我们使用扫描路径将360°图像转换为独特的GSR,它提供了来自扫描路径的凝视聚焦内容的全球概述。这样,360°图像的质量推断迅速转化为GSR的质量推断。然后,我们通过学习GSR的质量图,提出了一个有效的OIQA计算框架。综合实验结果验证了所提出框架的预测在时空域中与人类感知高度一致,特别是在不同观看条件下局部扭曲的360°图像具有挑战性的背景下。代码将在https://github.com/xiangjieSui/GSR上发布
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信