{"title":"InkSpirit:一种专业知识驱动的方法,用于增强传统中国画文本到图像生成的视觉逻辑","authors":"Di Zhang , Chen Yi , Xinyu Gao , Xiangsheng Zeng , Runqiao Xia , Yongbo Jiang , Yingchaojie Feng , Wei Zhang , Wei Chen","doi":"10.1016/j.cag.2025.104330","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional Chinese painting (TCP) presents unique challenges for text-to-image models, including composition logic deficiency, lack of inscription semantics, and style deviations. This study proposes the “InkSpirit” framework, employing an expert knowledge-driven approach to address these issues by: (1) constructing a TCP dataset with composition-based Blank Space Principles, (2) building an Artistic conception-Inscription corpus, and (3) designing a generation framework based on ComfyUI workflow for precise control over TCP elements. Experiments demonstrate superior performance in image quality metrics, with validation through expert and user evaluations, advancing the integration of traditional art with AI technology.</div></div>","PeriodicalId":50628,"journal":{"name":"Computers & Graphics-Uk","volume":"132 ","pages":"Article 104330"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"InkSpirit: An expert knowledge-driven approach for enhancing the visual logic of traditional Chinese painting text-to-image generation\",\"authors\":\"Di Zhang , Chen Yi , Xinyu Gao , Xiangsheng Zeng , Runqiao Xia , Yongbo Jiang , Yingchaojie Feng , Wei Zhang , Wei Chen\",\"doi\":\"10.1016/j.cag.2025.104330\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traditional Chinese painting (TCP) presents unique challenges for text-to-image models, including composition logic deficiency, lack of inscription semantics, and style deviations. This study proposes the “InkSpirit” framework, employing an expert knowledge-driven approach to address these issues by: (1) constructing a TCP dataset with composition-based Blank Space Principles, (2) building an Artistic conception-Inscription corpus, and (3) designing a generation framework based on ComfyUI workflow for precise control over TCP elements. Experiments demonstrate superior performance in image quality metrics, with validation through expert and user evaluations, advancing the integration of traditional art with AI technology.</div></div>\",\"PeriodicalId\":50628,\"journal\":{\"name\":\"Computers & Graphics-Uk\",\"volume\":\"132 \",\"pages\":\"Article 104330\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Graphics-Uk\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0097849325001724\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Graphics-Uk","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0097849325001724","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
InkSpirit: An expert knowledge-driven approach for enhancing the visual logic of traditional Chinese painting text-to-image generation
Traditional Chinese painting (TCP) presents unique challenges for text-to-image models, including composition logic deficiency, lack of inscription semantics, and style deviations. This study proposes the “InkSpirit” framework, employing an expert knowledge-driven approach to address these issues by: (1) constructing a TCP dataset with composition-based Blank Space Principles, (2) building an Artistic conception-Inscription corpus, and (3) designing a generation framework based on ComfyUI workflow for precise control over TCP elements. Experiments demonstrate superior performance in image quality metrics, with validation through expert and user evaluations, advancing the integration of traditional art with AI technology.
期刊介绍:
Computers & Graphics is dedicated to disseminate information on research and applications of computer graphics (CG) techniques. The journal encourages articles on:
1. Research and applications of interactive computer graphics. We are particularly interested in novel interaction techniques and applications of CG to problem domains.
2. State-of-the-art papers on late-breaking, cutting-edge research on CG.
3. Information on innovative uses of graphics principles and technologies.
4. Tutorial papers on both teaching CG principles and innovative uses of CG in education.