如果一张图片胜过1000个单词,那么在设计度量评估中,一个单词是否值得1000个特征?

K. M. Edwards, Aoran Peng, Scarlett R. Miller, Faez Ahmed
{"title":"如果一张图片胜过1000个单词,那么在设计度量评估中,一个单词是否值得1000个特征?","authors":"K. M. Edwards, Aoran Peng, Scarlett R. Miller, Faez Ahmed","doi":"10.1115/detc2021-70158","DOIUrl":null,"url":null,"abstract":"\n A picture is worth a thousand words, and in design metric estimation, a word may be worth a thousand features. Pictures are awarded this worth because of their ability to encode a plethora of information. When evaluating designs, we aim to capture a range of information as well, information including usefulness, uniqueness, and novelty of a design. The subjective nature of these concepts makes their evaluation difficult. Despite this, many attempts have been made and metrics developed to do so, because design evaluation is integral to innovation and the creation of novel solutions. The most common metrics used are the consensual assessment technique (CAT) and the Shah, Vargas-Hernandez, and Smith (SVS) method. While CAT is accurate and often regarded as the “gold standard,” it heavily relies on using expert ratings as a basis for judgement, making CAT expensive and time consuming. Comparatively, SVS is less resource-demanding, but it is often criticized as lacking sensitivity and accuracy. We aim to take advantage of the distinct strengths of both methods through machine learning. More specifically, this study seeks to investigate the possibility of using machine learning to facilitate automated creativity assessment. The SVS method results in a text-rich dataset about a design. In this paper we utilize these textual design representations and the deep semantic relationships that words and sentences encode, to predict more desirable design metrics, including CAT metrics. We demonstrate the ability of machine learning models to predict design metrics from the design itself and SVS Survey information. We demonstrate that incorporating natural language processing (NLP) improves prediction results across all of our design metrics, and that clear distinctions in the predictability of certain metrics exist. Our code and additional information about our work are available at http://decode.mit.edu/projects/nlp-design-eval/.","PeriodicalId":261968,"journal":{"name":"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"If a Picture is Worth 1000 Words, Is a Word Worth 1000 Features for Design Metric Estimation?\",\"authors\":\"K. M. Edwards, Aoran Peng, Scarlett R. Miller, Faez Ahmed\",\"doi\":\"10.1115/detc2021-70158\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n A picture is worth a thousand words, and in design metric estimation, a word may be worth a thousand features. Pictures are awarded this worth because of their ability to encode a plethora of information. When evaluating designs, we aim to capture a range of information as well, information including usefulness, uniqueness, and novelty of a design. The subjective nature of these concepts makes their evaluation difficult. Despite this, many attempts have been made and metrics developed to do so, because design evaluation is integral to innovation and the creation of novel solutions. The most common metrics used are the consensual assessment technique (CAT) and the Shah, Vargas-Hernandez, and Smith (SVS) method. While CAT is accurate and often regarded as the “gold standard,” it heavily relies on using expert ratings as a basis for judgement, making CAT expensive and time consuming. Comparatively, SVS is less resource-demanding, but it is often criticized as lacking sensitivity and accuracy. We aim to take advantage of the distinct strengths of both methods through machine learning. More specifically, this study seeks to investigate the possibility of using machine learning to facilitate automated creativity assessment. The SVS method results in a text-rich dataset about a design. In this paper we utilize these textual design representations and the deep semantic relationships that words and sentences encode, to predict more desirable design metrics, including CAT metrics. We demonstrate the ability of machine learning models to predict design metrics from the design itself and SVS Survey information. We demonstrate that incorporating natural language processing (NLP) improves prediction results across all of our design metrics, and that clear distinctions in the predictability of certain metrics exist. Our code and additional information about our work are available at http://decode.mit.edu/projects/nlp-design-eval/.\",\"PeriodicalId\":261968,\"journal\":{\"name\":\"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2021-70158\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2021-70158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

一张图片胜过千言万语,在设计度量评估中,一个词可能胜过一千个特征。图片之所以被授予这种价值,是因为它们能够编码大量的信息。在评估设计时,我们的目标是捕获一系列信息,包括设计的有用性,独特性和新颖性。这些概念的主观性质使它们难以评价。尽管如此,人们还是做出了许多尝试,并开发了度量标准,因为设计评估是创新和创造新解决方案的组成部分。最常用的指标是共识评估技术(CAT)和Shah, Vargas-Hernandez, and Smith (SVS)方法。虽然CAT是准确的,并且经常被视为“黄金标准”,但它严重依赖于使用专家评级作为判断的基础,这使得CAT昂贵且耗时。相比之下,SVS对资源的需求较少,但经常被批评缺乏灵敏度和准确性。我们的目标是通过机器学习来利用这两种方法的独特优势。更具体地说,本研究旨在探讨使用机器学习促进自动化创造力评估的可能性。SVS方法产生一个关于设计的文本丰富的数据集。在本文中,我们利用这些文本设计表示和单词和句子编码的深层语义关系来预测更理想的设计度量,包括CAT度量。我们展示了机器学习模型从设计本身和SVS调查信息中预测设计指标的能力。我们证明,结合自然语言处理(NLP)可以提高我们所有设计指标的预测结果,并且某些指标的可预测性存在明显的区别。我们的代码和关于我们工作的其他信息可以在http://decode.mit.edu/projects/nlp-design-eval/上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
If a Picture is Worth 1000 Words, Is a Word Worth 1000 Features for Design Metric Estimation?
A picture is worth a thousand words, and in design metric estimation, a word may be worth a thousand features. Pictures are awarded this worth because of their ability to encode a plethora of information. When evaluating designs, we aim to capture a range of information as well, information including usefulness, uniqueness, and novelty of a design. The subjective nature of these concepts makes their evaluation difficult. Despite this, many attempts have been made and metrics developed to do so, because design evaluation is integral to innovation and the creation of novel solutions. The most common metrics used are the consensual assessment technique (CAT) and the Shah, Vargas-Hernandez, and Smith (SVS) method. While CAT is accurate and often regarded as the “gold standard,” it heavily relies on using expert ratings as a basis for judgement, making CAT expensive and time consuming. Comparatively, SVS is less resource-demanding, but it is often criticized as lacking sensitivity and accuracy. We aim to take advantage of the distinct strengths of both methods through machine learning. More specifically, this study seeks to investigate the possibility of using machine learning to facilitate automated creativity assessment. The SVS method results in a text-rich dataset about a design. In this paper we utilize these textual design representations and the deep semantic relationships that words and sentences encode, to predict more desirable design metrics, including CAT metrics. We demonstrate the ability of machine learning models to predict design metrics from the design itself and SVS Survey information. We demonstrate that incorporating natural language processing (NLP) improves prediction results across all of our design metrics, and that clear distinctions in the predictability of certain metrics exist. Our code and additional information about our work are available at http://decode.mit.edu/projects/nlp-design-eval/.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信