{"title":"SGD-font: Style and glyph decoupling for one-shot font generation","authors":"Zhenhua Li , Siyi Chen , Dong Liang","doi":"10.1016/j.knosys.2025.114600","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic font generation aims to generate a complete font library by learning the font style from reference samples. Font generation is challenging because it needs to generate a set of sheer quantity of characters with consistent style and complicated structures of glyphs with limited reference font images, especially when the character or font style is unseen during training. In this paper, we propose a glyph control-based diffusion model for one-shot font generation. Specifically, we employ a style encoder to extract multi-scale style features and incorporate them into the reverse denoising steps of the diffusion model via cross-attention-based style fusion blocks. Decoupling style and glyph enables the combination of arbitrary styles and glyphs in font creation and allows users to generate fonts with unseen styles and unseen glyphs. In the inference stage, we introduce a multi-condition sampling strategy to effectively align the desired style and target glyph. Comprehensive experiments and a user study show that our framework surpasses existing approaches for both seen and unseen fonts. We further demonstrate its capability for style interpolation and cross-lingual font generation. The code is available at <span><span>https://github.com/ChenSiyi1/SGD-Font</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114600"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125016399","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic font generation aims to generate a complete font library by learning the font style from reference samples. Font generation is challenging because it needs to generate a set of sheer quantity of characters with consistent style and complicated structures of glyphs with limited reference font images, especially when the character or font style is unseen during training. In this paper, we propose a glyph control-based diffusion model for one-shot font generation. Specifically, we employ a style encoder to extract multi-scale style features and incorporate them into the reverse denoising steps of the diffusion model via cross-attention-based style fusion blocks. Decoupling style and glyph enables the combination of arbitrary styles and glyphs in font creation and allows users to generate fonts with unseen styles and unseen glyphs. In the inference stage, we introduce a multi-condition sampling strategy to effectively align the desired style and target glyph. Comprehensive experiments and a user study show that our framework surpasses existing approaches for both seen and unseen fonts. We further demonstrate its capability for style interpolation and cross-lingual font generation. The code is available at https://github.com/ChenSiyi1/SGD-Font.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.