超越魔法：在视觉生成媒体中将风格作为负担能力的现实化进行提示

IF 4.5 1区文学 Q1 COMMUNICATION

New Media & Society Pub Date : 2024-10-29 DOI:10.1177/14614448241286144

Nataliia Laba

{"title":"超越魔法：在视觉生成媒体中将风格作为负担能力的现实化进行提示","authors":"Nataliia Laba","doi":"10.1177/14614448241286144","DOIUrl":null,"url":null,"abstract":"As a sociotechnical practice at the nexus of humans, machines, and visual culture, text-to-image generation relies on verbal prompts as the primary technique to guide generative models. To align desired aesthetic outcomes with computer vision, human prompters engage in extensive experimentation, leveraging the model’s affordances through prompting for style. Focusing on the interplay between machine originality and repetition, this study addresses the dynamics of human-model interaction on Midjourney, a popular generative model (version 6) hosted on Discord. It examines style modifiers that users of visual generative media add to their prompts and addresses the aesthetic quality of AI images as a multilayered construct resulting from affordance actualization. I argue that while visual generative media holds promise for expanding the boundaries of creative expression, prompting for style is implicated in the practice of generating a visual aesthetic that mimics paradigms of existing cultural phenomena, which are never fully reduced to the optimized target output.","PeriodicalId":19149,"journal":{"name":"New Media & Society","volume":"15 1","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Beyond magic: Prompting for style as affordance actualization in visual generative media\",\"authors\":\"Nataliia Laba\",\"doi\":\"10.1177/14614448241286144\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a sociotechnical practice at the nexus of humans, machines, and visual culture, text-to-image generation relies on verbal prompts as the primary technique to guide generative models. To align desired aesthetic outcomes with computer vision, human prompters engage in extensive experimentation, leveraging the model’s affordances through prompting for style. Focusing on the interplay between machine originality and repetition, this study addresses the dynamics of human-model interaction on Midjourney, a popular generative model (version 6) hosted on Discord. It examines style modifiers that users of visual generative media add to their prompts and addresses the aesthetic quality of AI images as a multilayered construct resulting from affordance actualization. I argue that while visual generative media holds promise for expanding the boundaries of creative expression, prompting for style is implicated in the practice of generating a visual aesthetic that mimics paradigms of existing cultural phenomena, which are never fully reduced to the optimized target output.\",\"PeriodicalId\":19149,\"journal\":{\"name\":\"New Media & Society\",\"volume\":\"15 1\",\"pages\":\"\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2024-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"New Media & Society\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1177/14614448241286144\",\"RegionNum\":1,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMMUNICATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Media & Society","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1177/14614448241286144","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMMUNICATION","Score":null,"Total":0}

引用次数: 0

摘要

作为一种处于人类、机器和视觉文化之间的社会技术实践，文本到图像生成依赖于语言提示作为引导生成模型的主要技术。为了使所需的美学效果与计算机视觉相一致，人类提示者进行了大量实验，通过提示风格来利用模型的能力。本研究侧重于机器原创性和重复性之间的相互作用，探讨了在 Discord 上托管的流行生成模型 Midjourney（第 6 版）上人机交互的动态。本研究探讨了视觉生成媒体用户在其提示中添加的风格修改器，并将人工智能图像的审美质量作为一种多层次的结构来处理，这种多层次的结构产生于承受能力的实现。我认为，虽然视觉生成媒体有望拓展创意表达的边界，但提示风格与生成模仿现有文化现象范式的视觉美学的实践有关，而这些范式永远不会完全简化为优化的目标输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Beyond magic: Prompting for style as affordance actualization in visual generative media

As a sociotechnical practice at the nexus of humans, machines, and visual culture, text-to-image generation relies on verbal prompts as the primary technique to guide generative models. To align desired aesthetic outcomes with computer vision, human prompters engage in extensive experimentation, leveraging the model’s affordances through prompting for style. Focusing on the interplay between machine originality and repetition, this study addresses the dynamics of human-model interaction on Midjourney, a popular generative model (version 6) hosted on Discord. It examines style modifiers that users of visual generative media add to their prompts and addresses the aesthetic quality of AI images as a multilayered construct resulting from affordance actualization. I argue that while visual generative media holds promise for expanding the boundaries of creative expression, prompting for style is implicated in the practice of generating a visual aesthetic that mimics paradigms of existing cultural phenomena, which are never fully reduced to the optimized target output.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

New Media & Society COMMUNICATION-

CiteScore

12.70

自引率

8.00%

发文量

274

期刊介绍： New Media & Society engages in critical discussions of the key issues arising from the scale and speed of new media development, drawing on a wide range of disciplinary perspectives and on both theoretical and empirical research. The journal includes contributions on: -the individual and the social, the cultural and the political dimensions of new media -the global and local dimensions of the relationship between media and social change -contemporary as well as historical developments -the implications and impacts of, as well as the determinants and obstacles to, media change the relationship between theory, policy and practice.