A Question of Fidelity: Comparing Different User Testing Methods for Evaluating In-Car Prompts

Proceedings of the 4th Conference on Conversational User Interfaces Pub Date : 2022-07-26 DOI:10.1145/3543829.3544519

Anna-Maria Meck, C. Draxler, Thurid Vogt

{"title":"A Question of Fidelity: Comparing Different User Testing Methods for Evaluating In-Car Prompts","authors":"Anna-Maria Meck, C. Draxler, Thurid Vogt","doi":"10.1145/3543829.3544519","DOIUrl":null,"url":null,"abstract":"User studies are a major component in any user-centered design process. Testing methods thereby vary tremendously regarding the organizational, financial, and timely effort needed to conduct them. Driving simulator studies generally are the method of choice when dialogs need to be validated for in-car settings. These studies are highly time- and cost-consuming though. Online crowdsourcing studies can be an alternative as they allow for quick results and large sample sizes while at the same time being time- and cost-efficient. Still, voice user interface designers argue for a lack of applicability to concrete use cases. This is especially true for speech dialog systems in an in-car context where users experience voice as a secondary task with the primary task being driving. To compare the validity of different user testing methods, study participants in a between-subjects study design evaluated proactive in-car prompts presented a) in an online crowdsourcing study in text form, b) in an online crowdsourcing study via audio, and c) in a driving simulator. Prompt evaluations did not differ significantly between conditions a) and c) but diverged for condition b). Findings are explained by drawing from the Elaboration Likelihood Model and used to answer the question of how to efficiently validate in-car prompts.","PeriodicalId":138046,"journal":{"name":"Proceedings of the 4th Conference on Conversational User Interfaces","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th Conference on Conversational User Interfaces","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3543829.3544519","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

User studies are a major component in any user-centered design process. Testing methods thereby vary tremendously regarding the organizational, financial, and timely effort needed to conduct them. Driving simulator studies generally are the method of choice when dialogs need to be validated for in-car settings. These studies are highly time- and cost-consuming though. Online crowdsourcing studies can be an alternative as they allow for quick results and large sample sizes while at the same time being time- and cost-efficient. Still, voice user interface designers argue for a lack of applicability to concrete use cases. This is especially true for speech dialog systems in an in-car context where users experience voice as a secondary task with the primary task being driving. To compare the validity of different user testing methods, study participants in a between-subjects study design evaluated proactive in-car prompts presented a) in an online crowdsourcing study in text form, b) in an online crowdsourcing study via audio, and c) in a driving simulator. Prompt evaluations did not differ significantly between conditions a) and c) but diverged for condition b). Findings are explained by drawing from the Elaboration Likelihood Model and used to answer the question of how to efficiently validate in-car prompts.

查看原文本刊更多论文

保真度问题:比较评估车内提示的不同用户测试方法

用户研究是任何以用户为中心的设计过程的主要组成部分。因此，测试方法在组织、财务和执行测试所需的及时努力方面差异很大。当需要验证车内设置的对话框时，驾驶模拟器研究通常是选择的方法。然而，这些研究是非常耗时和昂贵的。在线众包研究可以是另一种选择，因为它们允许快速结果和大样本量，同时具有时间和成本效益。尽管如此，语音用户界面设计师仍然认为它缺乏对具体用例的适用性。这对于车内环境中的语音对话系统来说尤其如此，因为用户将语音体验作为次要任务，而主要任务是驾驶。为了比较不同用户测试方法的有效性，研究参与者在受试者之间的研究设计中评估了a)以文本形式出现的在线众包研究中，b)通过音频出现的在线众包研究中，以及c)在驾驶模拟器中出现的主动车内提示。提示评估在条件a)和c)之间没有显着差异，但在条件b)中有所差异。研究结果通过细化可能性模型来解释，并用于回答如何有效验证车内提示的问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 4th Conference on Conversational User Interfaces

自引率

0.00%

发文量