{"title":"通过区域风格整改生成人物形象","authors":"Guiyu Xia , Yun Liu , Zhedong Jin , Yubao Sun","doi":"10.1016/j.patcog.2025.112062","DOIUrl":null,"url":null,"abstract":"<div><div>Person image generation is a challenging task and has been used in many people centered applications. Current person image generation methods mainly focus on transferring the texture features of the source image to the target pose, but ignore the constraint effect of the source image on the generated results. In this paper, we propose a regional style rectification network for person image generation, which aims to use the style information of source images to improve the textures of the generated person images. Within the proposed model, we design a regional style compensation module to rectify the shape-independent style information of the generated image to the source by producing a regional style residual map. Furthermore, we propose a regional style consistency loss to strengthen the style rectification, which realizes the direct supervision of source images over target image generation, ignoring the shape differences between the source and target images. Besides, we use a dual-attention-based texture transformation module which exploits the correlations between source and target images to better preserve the texture features of source images. The experiment results for pose transfer and attribute editing demonstrate the effectiveness of the proposed model.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"171 ","pages":"Article 112062"},"PeriodicalIF":7.6000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Person image generation via regional style rectification\",\"authors\":\"Guiyu Xia , Yun Liu , Zhedong Jin , Yubao Sun\",\"doi\":\"10.1016/j.patcog.2025.112062\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Person image generation is a challenging task and has been used in many people centered applications. Current person image generation methods mainly focus on transferring the texture features of the source image to the target pose, but ignore the constraint effect of the source image on the generated results. In this paper, we propose a regional style rectification network for person image generation, which aims to use the style information of source images to improve the textures of the generated person images. Within the proposed model, we design a regional style compensation module to rectify the shape-independent style information of the generated image to the source by producing a regional style residual map. Furthermore, we propose a regional style consistency loss to strengthen the style rectification, which realizes the direct supervision of source images over target image generation, ignoring the shape differences between the source and target images. Besides, we use a dual-attention-based texture transformation module which exploits the correlations between source and target images to better preserve the texture features of source images. The experiment results for pose transfer and attribute editing demonstrate the effectiveness of the proposed model.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"171 \",\"pages\":\"Article 112062\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325007228\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325007228","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Person image generation via regional style rectification
Person image generation is a challenging task and has been used in many people centered applications. Current person image generation methods mainly focus on transferring the texture features of the source image to the target pose, but ignore the constraint effect of the source image on the generated results. In this paper, we propose a regional style rectification network for person image generation, which aims to use the style information of source images to improve the textures of the generated person images. Within the proposed model, we design a regional style compensation module to rectify the shape-independent style information of the generated image to the source by producing a regional style residual map. Furthermore, we propose a regional style consistency loss to strengthen the style rectification, which realizes the direct supervision of source images over target image generation, ignoring the shape differences between the source and target images. Besides, we use a dual-attention-based texture transformation module which exploits the correlations between source and target images to better preserve the texture features of source images. The experiment results for pose transfer and attribute editing demonstrate the effectiveness of the proposed model.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.