{"title":"Diffusedesigner:基于草图的可控服装图像生成","authors":"Jangho Lee, Jinpyung Kim, Hyeonbeen Lee","doi":"10.1186/s40691-025-00426-x","DOIUrl":null,"url":null,"abstract":"<div><p>A tech pack is a set of documents that guides the production of a fashion product and includes all the necessary details to create the garment as designed. It helps to communicate between designers and manufacturers by specifying the detailed requirements for actual garment production. The motivation for this study originates from the resemblance between the flat sketch included in the tech pack and the final garment design. We propose <i>DiffuseDesigner</i>, a sketch-based controllable clothing image generation method, which is trained using pseudo flat sketches and prompts describing the desired output design. In light of this, we collected a large-scale fashion image dataset composed of multiple categories of clothing from three shopping websites. An edge detection algorithm was applied for each image to generate a pseudo flat sketch, and textual prompt information was extracted using a CLIP. Then, we constructed a triplet consisting of the pseudo flat sketch, textual prompt, and original image, and fine-tuned ControlNet to synthesize clothing images based on a pseudo flat sketch while leveraging the prompt information to guide the generation process. We conducted extensive experiments for performance evaluation based on the prompt the user wishes to generate, and quantitatively verified the effectiveness of clothing generation using five evaluation metrics. Finally, through variations in the prompt and pseudo flat sketch, we visually confirmed that the trained model was well-controlled in the direction desired by the designer.</p></div>","PeriodicalId":555,"journal":{"name":"Fashion and Textiles","volume":"12 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://fashionandtextiles.springeropen.com/counter/pdf/10.1186/s40691-025-00426-x","citationCount":"0","resultStr":"{\"title\":\"Diffusedesigner: sketch-based controllable clothing image generation\",\"authors\":\"Jangho Lee, Jinpyung Kim, Hyeonbeen Lee\",\"doi\":\"10.1186/s40691-025-00426-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A tech pack is a set of documents that guides the production of a fashion product and includes all the necessary details to create the garment as designed. It helps to communicate between designers and manufacturers by specifying the detailed requirements for actual garment production. The motivation for this study originates from the resemblance between the flat sketch included in the tech pack and the final garment design. We propose <i>DiffuseDesigner</i>, a sketch-based controllable clothing image generation method, which is trained using pseudo flat sketches and prompts describing the desired output design. In light of this, we collected a large-scale fashion image dataset composed of multiple categories of clothing from three shopping websites. An edge detection algorithm was applied for each image to generate a pseudo flat sketch, and textual prompt information was extracted using a CLIP. Then, we constructed a triplet consisting of the pseudo flat sketch, textual prompt, and original image, and fine-tuned ControlNet to synthesize clothing images based on a pseudo flat sketch while leveraging the prompt information to guide the generation process. We conducted extensive experiments for performance evaluation based on the prompt the user wishes to generate, and quantitatively verified the effectiveness of clothing generation using five evaluation metrics. Finally, through variations in the prompt and pseudo flat sketch, we visually confirmed that the trained model was well-controlled in the direction desired by the designer.</p></div>\",\"PeriodicalId\":555,\"journal\":{\"name\":\"Fashion and Textiles\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://fashionandtextiles.springeropen.com/counter/pdf/10.1186/s40691-025-00426-x\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Fashion and Textiles\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://link.springer.com/article/10.1186/s40691-025-00426-x\",\"RegionNum\":4,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATERIALS SCIENCE, TEXTILES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fashion and Textiles","FirstCategoryId":"88","ListUrlMain":"https://link.springer.com/article/10.1186/s40691-025-00426-x","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, TEXTILES","Score":null,"Total":0}
A tech pack is a set of documents that guides the production of a fashion product and includes all the necessary details to create the garment as designed. It helps to communicate between designers and manufacturers by specifying the detailed requirements for actual garment production. The motivation for this study originates from the resemblance between the flat sketch included in the tech pack and the final garment design. We propose DiffuseDesigner, a sketch-based controllable clothing image generation method, which is trained using pseudo flat sketches and prompts describing the desired output design. In light of this, we collected a large-scale fashion image dataset composed of multiple categories of clothing from three shopping websites. An edge detection algorithm was applied for each image to generate a pseudo flat sketch, and textual prompt information was extracted using a CLIP. Then, we constructed a triplet consisting of the pseudo flat sketch, textual prompt, and original image, and fine-tuned ControlNet to synthesize clothing images based on a pseudo flat sketch while leveraging the prompt information to guide the generation process. We conducted extensive experiments for performance evaluation based on the prompt the user wishes to generate, and quantitatively verified the effectiveness of clothing generation using five evaluation metrics. Finally, through variations in the prompt and pseudo flat sketch, we visually confirmed that the trained model was well-controlled in the direction desired by the designer.
期刊介绍:
Fashion and Textiles aims to advance knowledge and to seek new perspectives in the fashion and textiles industry worldwide. We welcome original research articles, reviews, case studies, book reviews and letters to the editor.
The scope of the journal includes the following four technical research divisions:
Textile Science and Technology: Textile Material Science and Technology; Dyeing and Finishing; Smart and Intelligent Textiles
Clothing Science and Technology: Physiology of Clothing/Textile Products; Protective clothing ; Smart and Intelligent clothing; Sportswear; Mass customization ; Apparel manufacturing
Economics of Clothing and Textiles/Fashion Business: Management of the Clothing and Textiles Industry; Merchandising; Retailing; Fashion Marketing; Consumer Behavior; Socio-psychology of Fashion
Fashion Design and Cultural Study on Fashion: Aesthetic Aspects of Fashion Product or Design Process; Textiles/Clothing/Fashion Design; Fashion Trend; History of Fashion; Costume or Dress; Fashion Theory; Fashion journalism; Fashion exhibition.