{"title":"Art creator: Steering styles in diffusion model","authors":"Shan Tang , Wenhua Qian , Peng Liu , Jinde Cao","doi":"10.1016/j.neucom.2025.129511","DOIUrl":null,"url":null,"abstract":"<div><div>Large-scale text-to-image (T2I) generative models are extensively used in the art and creative industries because of their remarkable capability in generating high-quality images. The generation of ideal images in a single attempt is nearly impossible, necessitating complex and precise post-image editing. However, stylization pose significant challenges in post-editing. In this context, we introduce the Art Creator, which facilitates style controls based on a simple description or a single image. Art Creator enables nuanced image style edits, alterations in painting materials, colors, and brushstrokes, and understanding of high-level attributes such as object shapes. Furthermore, we manually annotated and released a dataset named ChinArt, comprising over 20,000 eastern artworks, aiming to address the gap in the global art creation domain. We showcase the quality and efficiency of our method across various art style creations.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"626 ","pages":"Article 129511"},"PeriodicalIF":5.5000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225001833","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Large-scale text-to-image (T2I) generative models are extensively used in the art and creative industries because of their remarkable capability in generating high-quality images. The generation of ideal images in a single attempt is nearly impossible, necessitating complex and precise post-image editing. However, stylization pose significant challenges in post-editing. In this context, we introduce the Art Creator, which facilitates style controls based on a simple description or a single image. Art Creator enables nuanced image style edits, alterations in painting materials, colors, and brushstrokes, and understanding of high-level attributes such as object shapes. Furthermore, we manually annotated and released a dataset named ChinArt, comprising over 20,000 eastern artworks, aiming to address the gap in the global art creation domain. We showcase the quality and efficiency of our method across various art style creations.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.