On Inclusion: Video Analysis of Older Adult Interactions with a Multi-Modal Voice Assistant in a Public Setting

Andrea Cuadra, Hyein Baek, D. Estrin, Malte F. Jung, Nicola Dell
{"title":"On Inclusion: Video Analysis of Older Adult Interactions with a Multi-Modal Voice Assistant in a Public Setting","authors":"Andrea Cuadra, Hyein Baek, D. Estrin, Malte F. Jung, Nicola Dell","doi":"10.1145/3572334.3572371","DOIUrl":null,"url":null,"abstract":"Older adults around the world lack access to a wide range of potentially life-changing digital applications, services, and information that could be provided by voice assistants (such as Amazon’s Alexa, Google’s Assistant, or Apple’s Siri). However, older adults’ needs are underrepresented in the design of voice assistants. Because of this, we are missing opportunities for digital inclusion, and increasing risks of excluding older adults as these devices permeate public settings. In this work, we video record older adults (n=26) interacting with a multi-modal voice assistants while waiting in line at food pantries, and use Interaction Analysis to draw insights from these recordings. We find that by being agnostic to body language, audio-prosodic features, and other contextual factors, voice assistants fail to capture and react to some important aspects of interactions. We discuss design (e.g, interpreting users’ posture as a cue to wake the device when they are leaning towards the device) and research (e.g., surveillance trade-offs) implications, and argue for the use of multi-modal inputs with attention to privacy. Designing and training voice assistants to take in and appropriately respond to non-verbal cues may increase their inclusivity, helping them fulfill important needs of our aging population.","PeriodicalId":213752,"journal":{"name":"Proceedings of the 2022 International Conference on Information and Communication Technologies and Development","volume":"247 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 International Conference on Information and Communication Technologies and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3572334.3572371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Older adults around the world lack access to a wide range of potentially life-changing digital applications, services, and information that could be provided by voice assistants (such as Amazon’s Alexa, Google’s Assistant, or Apple’s Siri). However, older adults’ needs are underrepresented in the design of voice assistants. Because of this, we are missing opportunities for digital inclusion, and increasing risks of excluding older adults as these devices permeate public settings. In this work, we video record older adults (n=26) interacting with a multi-modal voice assistants while waiting in line at food pantries, and use Interaction Analysis to draw insights from these recordings. We find that by being agnostic to body language, audio-prosodic features, and other contextual factors, voice assistants fail to capture and react to some important aspects of interactions. We discuss design (e.g, interpreting users’ posture as a cue to wake the device when they are leaning towards the device) and research (e.g., surveillance trade-offs) implications, and argue for the use of multi-modal inputs with attention to privacy. Designing and training voice assistants to take in and appropriately respond to non-verbal cues may increase their inclusivity, helping them fulfill important needs of our aging population.
关于包容:老年人在公共场合与多模态语音助手互动的视频分析
世界各地的老年人无法获得语音助手(如亚马逊的Alexa、谷歌的助手或苹果的Siri)提供的大量可能改变生活的数字应用程序、服务和信息。然而,老年人的需求在语音助手的设计中没有得到充分的体现。正因为如此,我们正在错失数字包容的机会,随着这些设备渗透到公共场所,老年人被排除在外的风险也在增加。在这项工作中,我们录制了老年人(n=26)在食品分发处排队时与多模态语音助手互动的视频,并使用交互分析从这些录音中获得见解。我们发现,由于对肢体语言、音频韵律特征和其他语境因素不可知,语音助手无法捕捉到互动的一些重要方面并做出反应。我们讨论了设计(例如,当用户倾向于设备时,将用户的姿势解释为唤醒设备的线索)和研究(例如,监视权衡)的含义,并主张使用多模式输入,同时注意隐私。设计和培训语音助手,让他们接受并适当回应非语言提示,可能会增加他们的包容性,帮助他们满足我们老龄化人口的重要需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信