{"title":"Utility of Crowdsourced User Experiments for Measuring the Central Tendency of User Performance to Evaluate Error-Rate Models on GUIs","authors":"Shota Yamanaka","doi":"10.1609/hcomp.v9i1.18948","DOIUrl":null,"url":null,"abstract":"The usage of crowdsourcing to recruit numerous participants has been recognized as beneficial in the human-computer interaction (HCI) field, such as for designing user interfaces and validating user performance models.\nIn this work, we investigate its effectiveness for evaluating an error-rate prediction model in target pointing tasks.\nIn contrast to models for operational times, a clicking error (i.e., missing a target) occurs by chance at a certain probability, e.g., 5%.\nTherefore, in traditional laboratory-based experiments, a lot of repetitions are needed to measure the central tendency of error rates.\nWe hypothesize that recruiting many workers would enable us to keep the number of repetitions per worker much smaller.\nWe collected data from 384 workers and found that existing models on operational time and error rate showed good fits (both R^2 > 0.95).\nA simulation where we changed the number of participants N_P and the number of repetitions N_repeat showed that the time prediction model was robust against small N_P and N_repeat, although the error-rate model fitness was considerably degraded.\nThese findings empirically demonstrate a new utility of crowdsourced user experiments for collecting numerous participants, which should be of great use to HCI researchers for their evaluation studies.","PeriodicalId":87339,"journal":{"name":"Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing","volume":"19 1","pages":"155-165"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... AAAI Conference on Human Computation and Crowdsourcing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/hcomp.v9i1.18948","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
The usage of crowdsourcing to recruit numerous participants has been recognized as beneficial in the human-computer interaction (HCI) field, such as for designing user interfaces and validating user performance models.
In this work, we investigate its effectiveness for evaluating an error-rate prediction model in target pointing tasks.
In contrast to models for operational times, a clicking error (i.e., missing a target) occurs by chance at a certain probability, e.g., 5%.
Therefore, in traditional laboratory-based experiments, a lot of repetitions are needed to measure the central tendency of error rates.
We hypothesize that recruiting many workers would enable us to keep the number of repetitions per worker much smaller.
We collected data from 384 workers and found that existing models on operational time and error rate showed good fits (both R^2 > 0.95).
A simulation where we changed the number of participants N_P and the number of repetitions N_repeat showed that the time prediction model was robust against small N_P and N_repeat, although the error-rate model fitness was considerably degraded.
These findings empirically demonstrate a new utility of crowdsourced user experiments for collecting numerous participants, which should be of great use to HCI researchers for their evaluation studies.