{"title":"Retouched Face Image Quality Assessment Based on Differential Perception and Textual Prompt","authors":"Tianwei Zhou;Songbai Tan;Gang Li;Shishun Tian;Chang Tang;Zhihua Wang;Guanghui Yue","doi":"10.1109/TBC.2024.3447454","DOIUrl":null,"url":null,"abstract":"Face retouching involves using digital techniques to alter an individual’s appearance, commonly using in social media. However, excessively retouched face (RF) images can lead to issues such as unrealistic beauty standards and psychological stress. Therefore, it is crucial to develop a reliable quality assessment method for RF images. In this paper, we propose a novel network named DIRF-IQA for RF image quality assessment (IQA). DIRF-IQA mainly includes a parameter-shared image encoder, a text encoder, and three key components, namely the Differential Feature Attention Module (DFAM), the Text-image Interaction Module (TIM), and the Multi-scale Feature Fusion Module (MFFM). Specifically, the DFAM captures both local and global differences between original and retouched images by processing multi-scale features and utilizing cross-attention and self-attention blocks for differential perception. In the TIM, textual prompts summarizing retouching operations are encoded by a text encoder and integrated with differential features extracted by the DFAM to enhance the understanding of distortions in RF images. The MFFM then fuses these text-enhanced features across different layers and combines them with the global differential feature to predict the quality of the retouched images. We conduct extensive experiments on two RF IQA databases and the results demonstrate the superiority of DIDF-IQA compared to 12 state-of-the-art full-reference IQA methods in evaluating RF images.","PeriodicalId":13159,"journal":{"name":"IEEE Transactions on Broadcasting","volume":"71 1","pages":"240-251"},"PeriodicalIF":4.8000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Broadcasting","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10663283/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Face retouching involves using digital techniques to alter an individual’s appearance, commonly using in social media. However, excessively retouched face (RF) images can lead to issues such as unrealistic beauty standards and psychological stress. Therefore, it is crucial to develop a reliable quality assessment method for RF images. In this paper, we propose a novel network named DIRF-IQA for RF image quality assessment (IQA). DIRF-IQA mainly includes a parameter-shared image encoder, a text encoder, and three key components, namely the Differential Feature Attention Module (DFAM), the Text-image Interaction Module (TIM), and the Multi-scale Feature Fusion Module (MFFM). Specifically, the DFAM captures both local and global differences between original and retouched images by processing multi-scale features and utilizing cross-attention and self-attention blocks for differential perception. In the TIM, textual prompts summarizing retouching operations are encoded by a text encoder and integrated with differential features extracted by the DFAM to enhance the understanding of distortions in RF images. The MFFM then fuses these text-enhanced features across different layers and combines them with the global differential feature to predict the quality of the retouched images. We conduct extensive experiments on two RF IQA databases and the results demonstrate the superiority of DIDF-IQA compared to 12 state-of-the-art full-reference IQA methods in evaluating RF images.
期刊介绍:
The Society’s Field of Interest is “Devices, equipment, techniques and systems related to broadcast technology, including the production, distribution, transmission, and propagation aspects.” In addition to this formal FOI statement, which is used to provide guidance to the Publications Committee in the selection of content, the AdCom has further resolved that “broadcast systems includes all aspects of transmission, propagation, and reception.”