Diffusedesigner：基于草图的可控服装图像生成

IF 3.5 4区管理学 Q1 MATERIALS SCIENCE, TEXTILES

Fashion and Textiles Pub Date : 2025-09-08 DOI:10.1186/s40691-025-00426-x

Jangho Lee, Jinpyung Kim, Hyeonbeen Lee

{"title":"Diffusedesigner：基于草图的可控服装图像生成","authors":"Jangho Lee, Jinpyung Kim, Hyeonbeen Lee","doi":"10.1186/s40691-025-00426-x","DOIUrl":null,"url":null,"abstract":"<div><p>A tech pack is a set of documents that guides the production of a fashion product and includes all the necessary details to create the garment as designed. It helps to communicate between designers and manufacturers by specifying the detailed requirements for actual garment production. The motivation for this study originates from the resemblance between the flat sketch included in the tech pack and the final garment design. We propose <i>DiffuseDesigner</i>, a sketch-based controllable clothing image generation method, which is trained using pseudo flat sketches and prompts describing the desired output design. In light of this, we collected a large-scale fashion image dataset composed of multiple categories of clothing from three shopping websites. An edge detection algorithm was applied for each image to generate a pseudo flat sketch, and textual prompt information was extracted using a CLIP. Then, we constructed a triplet consisting of the pseudo flat sketch, textual prompt, and original image, and fine-tuned ControlNet to synthesize clothing images based on a pseudo flat sketch while leveraging the prompt information to guide the generation process. We conducted extensive experiments for performance evaluation based on the prompt the user wishes to generate, and quantitatively verified the effectiveness of clothing generation using five evaluation metrics. Finally, through variations in the prompt and pseudo flat sketch, we visually confirmed that the trained model was well-controlled in the direction desired by the designer.</p></div>","PeriodicalId":555,"journal":{"name":"Fashion and Textiles","volume":"12 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://fashionandtextiles.springeropen.com/counter/pdf/10.1186/s40691-025-00426-x","citationCount":"0","resultStr":"{\"title\":\"Diffusedesigner: sketch-based controllable clothing image generation\",\"authors\":\"Jangho Lee, Jinpyung Kim, Hyeonbeen Lee\",\"doi\":\"10.1186/s40691-025-00426-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A tech pack is a set of documents that guides the production of a fashion product and includes all the necessary details to create the garment as designed. It helps to communicate between designers and manufacturers by specifying the detailed requirements for actual garment production. The motivation for this study originates from the resemblance between the flat sketch included in the tech pack and the final garment design. We propose <i>DiffuseDesigner</i>, a sketch-based controllable clothing image generation method, which is trained using pseudo flat sketches and prompts describing the desired output design. In light of this, we collected a large-scale fashion image dataset composed of multiple categories of clothing from three shopping websites. An edge detection algorithm was applied for each image to generate a pseudo flat sketch, and textual prompt information was extracted using a CLIP. Then, we constructed a triplet consisting of the pseudo flat sketch, textual prompt, and original image, and fine-tuned ControlNet to synthesize clothing images based on a pseudo flat sketch while leveraging the prompt information to guide the generation process. We conducted extensive experiments for performance evaluation based on the prompt the user wishes to generate, and quantitatively verified the effectiveness of clothing generation using five evaluation metrics. Finally, through variations in the prompt and pseudo flat sketch, we visually confirmed that the trained model was well-controlled in the direction desired by the designer.</p></div>\",\"PeriodicalId\":555,\"journal\":{\"name\":\"Fashion and Textiles\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://fashionandtextiles.springeropen.com/counter/pdf/10.1186/s40691-025-00426-x\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fashion and Textiles\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://link.springer.com/article/10.1186/s40691-025-00426-x\",\"RegionNum\":4,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, TEXTILES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fashion and Textiles","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1186/s40691-025-00426-x","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, TEXTILES","Score":null,"Total":0}

引用次数: 0

摘要

技术包是一套指导时尚产品生产的文件，包括按照设计制作服装的所有必要细节。它通过规定实际服装生产的详细要求，有助于设计师和制造商之间的沟通。这项研究的动机源于技术包中包含的平面草图与最终服装设计之间的相似性。我们提出了一种基于草图的可控服装图像生成方法DiffuseDesigner，该方法使用伪平面草图和描述所需输出设计的提示进行训练。鉴于此，我们从三个购物网站上收集了一个由多类服装组成的大规模时尚图像数据集。利用边缘检测算法对每张图像生成伪平面草图，并利用CLIP提取文本提示信息。然后，我们构建了一个由伪平面草图、文本提示和原始图像组成的三元组，并对ControlNet进行了微调，在伪平面草图的基础上合成服装图像，同时利用提示信息指导生成过程。我们根据用户希望生成的提示进行了大量的性能评估实验，并使用五个评估指标定量验证了服装生成的有效性。最后，通过提示符和伪平面草图的变化，我们从视觉上证实了训练好的模型在设计师想要的方向上得到了很好的控制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Diffusedesigner: sketch-based controllable clothing image generation

A tech pack is a set of documents that guides the production of a fashion product and includes all the necessary details to create the garment as designed. It helps to communicate between designers and manufacturers by specifying the detailed requirements for actual garment production. The motivation for this study originates from the resemblance between the flat sketch included in the tech pack and the final garment design. We propose DiffuseDesigner, a sketch-based controllable clothing image generation method, which is trained using pseudo flat sketches and prompts describing the desired output design. In light of this, we collected a large-scale fashion image dataset composed of multiple categories of clothing from three shopping websites. An edge detection algorithm was applied for each image to generate a pseudo flat sketch, and textual prompt information was extracted using a CLIP. Then, we constructed a triplet consisting of the pseudo flat sketch, textual prompt, and original image, and fine-tuned ControlNet to synthesize clothing images based on a pseudo flat sketch while leveraging the prompt information to guide the generation process. We conducted extensive experiments for performance evaluation based on the prompt the user wishes to generate, and quantitatively verified the effectiveness of clothing generation using five evaluation metrics. Finally, through variations in the prompt and pseudo flat sketch, we visually confirmed that the trained model was well-controlled in the direction desired by the designer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Fashion and Textiles Business, Management and Accounting-Marketing

CiteScore

4.40

自引率

4.20%

发文量

审稿时长

13 weeks

期刊介绍： Fashion and Textiles aims to advance knowledge and to seek new perspectives in the fashion and textiles industry worldwide. We welcome original research articles, reviews, case studies, book reviews and letters to the editor. The scope of the journal includes the following four technical research divisions: Textile Science and Technology: Textile Material Science and Technology; Dyeing and Finishing; Smart and Intelligent Textiles Clothing Science and Technology: Physiology of Clothing/Textile Products; Protective clothing ; Smart and Intelligent clothing; Sportswear; Mass customization ; Apparel manufacturing Economics of Clothing and Textiles/Fashion Business: Management of the Clothing and Textiles Industry; Merchandising; Retailing; Fashion Marketing; Consumer Behavior; Socio-psychology of Fashion Fashion Design and Cultural Study on Fashion: Aesthetic Aspects of Fashion Product or Design Process; Textiles/Clothing/Fashion Design; Fashion Trend; History of Fashion; Costume or Dress; Fashion Theory; Fashion journalism; Fashion exhibition.