Yu Liu , Yang Ding , Fatimah Binti Khalid , Cunrui Wang , Lei Wang
{"title":"通过去噪扩散和组件级细粒度样式生成少量字体","authors":"Yu Liu , Yang Ding , Fatimah Binti Khalid , Cunrui Wang , Lei Wang","doi":"10.1016/j.eswa.2025.128987","DOIUrl":null,"url":null,"abstract":"<div><div>Few-shot font generation aims to create new fonts using a small number of style examples. It is increasingly gaining attention due to its significant reduction in labor costs. Existing methods rely on GAN-based image-to-image style-transfer frameworks, which are prone to training collapse and struggle to maintain consistency between character content and style. Moreover, they capture only the global style while overlooking fine-grained features of radicals, components, and strokes. To address these challenges, we propose a diffusion model-based image-to-image font generation method.We fully consider the component styles between content glyphs and reference glyphs, assigning appropriate fine-grained styles to content glyphs through a multi-character style aggregation module. Additionally, in order to better preserve the integrity of character structures during the denoising iteration process, we propose leveraging an offset-enhanced multi-head attention mechanism. This mechanism adaptively samples and embeds multi-scale glyph content features into the diffusion model. Comprehensive experiments demonstrate that our method outperforms existing font generation methods both qualitatively and quantitatively.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"296 ","pages":"Article 128987"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Few-shot font generation via denoising diffusion and component-level fine-grained style\",\"authors\":\"Yu Liu , Yang Ding , Fatimah Binti Khalid , Cunrui Wang , Lei Wang\",\"doi\":\"10.1016/j.eswa.2025.128987\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Few-shot font generation aims to create new fonts using a small number of style examples. It is increasingly gaining attention due to its significant reduction in labor costs. Existing methods rely on GAN-based image-to-image style-transfer frameworks, which are prone to training collapse and struggle to maintain consistency between character content and style. Moreover, they capture only the global style while overlooking fine-grained features of radicals, components, and strokes. To address these challenges, we propose a diffusion model-based image-to-image font generation method.We fully consider the component styles between content glyphs and reference glyphs, assigning appropriate fine-grained styles to content glyphs through a multi-character style aggregation module. Additionally, in order to better preserve the integrity of character structures during the denoising iteration process, we propose leveraging an offset-enhanced multi-head attention mechanism. This mechanism adaptively samples and embeds multi-scale glyph content features into the diffusion model. Comprehensive experiments demonstrate that our method outperforms existing font generation methods both qualitatively and quantitatively.</div></div>\",\"PeriodicalId\":50461,\"journal\":{\"name\":\"Expert Systems with Applications\",\"volume\":\"296 \",\"pages\":\"Article 128987\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2025-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Expert Systems with Applications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0957417425026041\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425026041","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Few-shot font generation via denoising diffusion and component-level fine-grained style
Few-shot font generation aims to create new fonts using a small number of style examples. It is increasingly gaining attention due to its significant reduction in labor costs. Existing methods rely on GAN-based image-to-image style-transfer frameworks, which are prone to training collapse and struggle to maintain consistency between character content and style. Moreover, they capture only the global style while overlooking fine-grained features of radicals, components, and strokes. To address these challenges, we propose a diffusion model-based image-to-image font generation method.We fully consider the component styles between content glyphs and reference glyphs, assigning appropriate fine-grained styles to content glyphs through a multi-character style aggregation module. Additionally, in order to better preserve the integrity of character structures during the denoising iteration process, we propose leveraging an offset-enhanced multi-head attention mechanism. This mechanism adaptively samples and embeds multi-scale glyph content features into the diffusion model. Comprehensive experiments demonstrate that our method outperforms existing font generation methods both qualitatively and quantitatively.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.