K. M. Edwards, Aoran Peng, Scarlett R. Miller, Faez Ahmed
{"title":"如果一张图片胜过1000个单词,那么在设计度量评估中,一个单词是否值得1000个特征?","authors":"K. M. Edwards, Aoran Peng, Scarlett R. Miller, Faez Ahmed","doi":"10.1115/detc2021-70158","DOIUrl":null,"url":null,"abstract":"\n A picture is worth a thousand words, and in design metric estimation, a word may be worth a thousand features. Pictures are awarded this worth because of their ability to encode a plethora of information. When evaluating designs, we aim to capture a range of information as well, information including usefulness, uniqueness, and novelty of a design. The subjective nature of these concepts makes their evaluation difficult. Despite this, many attempts have been made and metrics developed to do so, because design evaluation is integral to innovation and the creation of novel solutions. The most common metrics used are the consensual assessment technique (CAT) and the Shah, Vargas-Hernandez, and Smith (SVS) method. While CAT is accurate and often regarded as the “gold standard,” it heavily relies on using expert ratings as a basis for judgement, making CAT expensive and time consuming. Comparatively, SVS is less resource-demanding, but it is often criticized as lacking sensitivity and accuracy. We aim to take advantage of the distinct strengths of both methods through machine learning. More specifically, this study seeks to investigate the possibility of using machine learning to facilitate automated creativity assessment. The SVS method results in a text-rich dataset about a design. In this paper we utilize these textual design representations and the deep semantic relationships that words and sentences encode, to predict more desirable design metrics, including CAT metrics. We demonstrate the ability of machine learning models to predict design metrics from the design itself and SVS Survey information. We demonstrate that incorporating natural language processing (NLP) improves prediction results across all of our design metrics, and that clear distinctions in the predictability of certain metrics exist. Our code and additional information about our work are available at http://decode.mit.edu/projects/nlp-design-eval/.","PeriodicalId":261968,"journal":{"name":"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"If a Picture is Worth 1000 Words, Is a Word Worth 1000 Features for Design Metric Estimation?\",\"authors\":\"K. M. Edwards, Aoran Peng, Scarlett R. Miller, Faez Ahmed\",\"doi\":\"10.1115/detc2021-70158\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n A picture is worth a thousand words, and in design metric estimation, a word may be worth a thousand features. Pictures are awarded this worth because of their ability to encode a plethora of information. When evaluating designs, we aim to capture a range of information as well, information including usefulness, uniqueness, and novelty of a design. The subjective nature of these concepts makes their evaluation difficult. Despite this, many attempts have been made and metrics developed to do so, because design evaluation is integral to innovation and the creation of novel solutions. The most common metrics used are the consensual assessment technique (CAT) and the Shah, Vargas-Hernandez, and Smith (SVS) method. While CAT is accurate and often regarded as the “gold standard,” it heavily relies on using expert ratings as a basis for judgement, making CAT expensive and time consuming. Comparatively, SVS is less resource-demanding, but it is often criticized as lacking sensitivity and accuracy. We aim to take advantage of the distinct strengths of both methods through machine learning. More specifically, this study seeks to investigate the possibility of using machine learning to facilitate automated creativity assessment. The SVS method results in a text-rich dataset about a design. In this paper we utilize these textual design representations and the deep semantic relationships that words and sentences encode, to predict more desirable design metrics, including CAT metrics. We demonstrate the ability of machine learning models to predict design metrics from the design itself and SVS Survey information. We demonstrate that incorporating natural language processing (NLP) improves prediction results across all of our design metrics, and that clear distinctions in the predictability of certain metrics exist. Our code and additional information about our work are available at http://decode.mit.edu/projects/nlp-design-eval/.\",\"PeriodicalId\":261968,\"journal\":{\"name\":\"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2021-70158\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 6: 33rd International Conference on Design Theory and Methodology (DTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2021-70158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
一张图片胜过千言万语,在设计度量评估中,一个词可能胜过一千个特征。图片之所以被授予这种价值,是因为它们能够编码大量的信息。在评估设计时,我们的目标是捕获一系列信息,包括设计的有用性,独特性和新颖性。这些概念的主观性质使它们难以评价。尽管如此,人们还是做出了许多尝试,并开发了度量标准,因为设计评估是创新和创造新解决方案的组成部分。最常用的指标是共识评估技术(CAT)和Shah, Vargas-Hernandez, and Smith (SVS)方法。虽然CAT是准确的,并且经常被视为“黄金标准”,但它严重依赖于使用专家评级作为判断的基础,这使得CAT昂贵且耗时。相比之下,SVS对资源的需求较少,但经常被批评缺乏灵敏度和准确性。我们的目标是通过机器学习来利用这两种方法的独特优势。更具体地说,本研究旨在探讨使用机器学习促进自动化创造力评估的可能性。SVS方法产生一个关于设计的文本丰富的数据集。在本文中,我们利用这些文本设计表示和单词和句子编码的深层语义关系来预测更理想的设计度量,包括CAT度量。我们展示了机器学习模型从设计本身和SVS调查信息中预测设计指标的能力。我们证明,结合自然语言处理(NLP)可以提高我们所有设计指标的预测结果,并且某些指标的可预测性存在明显的区别。我们的代码和关于我们工作的其他信息可以在http://decode.mit.edu/projects/nlp-design-eval/上获得。
If a Picture is Worth 1000 Words, Is a Word Worth 1000 Features for Design Metric Estimation?
A picture is worth a thousand words, and in design metric estimation, a word may be worth a thousand features. Pictures are awarded this worth because of their ability to encode a plethora of information. When evaluating designs, we aim to capture a range of information as well, information including usefulness, uniqueness, and novelty of a design. The subjective nature of these concepts makes their evaluation difficult. Despite this, many attempts have been made and metrics developed to do so, because design evaluation is integral to innovation and the creation of novel solutions. The most common metrics used are the consensual assessment technique (CAT) and the Shah, Vargas-Hernandez, and Smith (SVS) method. While CAT is accurate and often regarded as the “gold standard,” it heavily relies on using expert ratings as a basis for judgement, making CAT expensive and time consuming. Comparatively, SVS is less resource-demanding, but it is often criticized as lacking sensitivity and accuracy. We aim to take advantage of the distinct strengths of both methods through machine learning. More specifically, this study seeks to investigate the possibility of using machine learning to facilitate automated creativity assessment. The SVS method results in a text-rich dataset about a design. In this paper we utilize these textual design representations and the deep semantic relationships that words and sentences encode, to predict more desirable design metrics, including CAT metrics. We demonstrate the ability of machine learning models to predict design metrics from the design itself and SVS Survey information. We demonstrate that incorporating natural language processing (NLP) improves prediction results across all of our design metrics, and that clear distinctions in the predictability of certain metrics exist. Our code and additional information about our work are available at http://decode.mit.edu/projects/nlp-design-eval/.