Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones.

Maozheng Zhao, Wenzhe Cui, I V Ramakrishnan, Shumin Zhai, Xiaojun Bi
{"title":"Voice and Touch Based Error-tolerant Multimodal Text Editing and Correction for Smartphones.","authors":"Maozheng Zhao,&nbsp;Wenzhe Cui,&nbsp;I V Ramakrishnan,&nbsp;Shumin Zhai,&nbsp;Xiaojun Bi","doi":"10.1145/3472749.3474742","DOIUrl":null,"url":null,"abstract":"<p><p>Editing operations such as cut, copy, paste, and correcting errors in typed text are often tedious and challenging to perform on smartphones. In this paper, we present VT, a voice and touch-based multi-modal text editing and correction method for smartphones. To edit text with VT, the user glides over a text fragment with a finger and dictates a command, such as \"bold\" to change the format of the fragment, or the user can tap inside a text area and speak a command such as \"highlight this paragraph\" to edit the text. For text correcting, the user taps approximately at the area of erroneous text fragment and dictates the new content for substitution or insertion. VT combines touch and voice inputs with language context such as language model and phrase similarity to infer a user's editing intention, which can handle ambiguities and noisy input signals. It is a great advantage over the existing error correction methods (e.g., iOS's Voice Control) which require precise cursor control or text selection. Our evaluation shows that VT significantly improves the efficiency of text editing and text correcting on smartphones over the touch-only method and the iOS's Voice Control method. Our user studies showed that VT reduced the text editing time by 30.80%, and text correcting time by 29.97% over the touch-only method. VT reduced the text editing time by 30.81%, and text correcting time by 47.96% over the iOS's Voice Control method.</p>","PeriodicalId":93361,"journal":{"name":"Proceedings of the ACM Symposium on User Interface Software and Technology. ACM Symposium on User Interface Software and Technology","volume":"2021 ","pages":"162-178"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/02/ef/nihms-1777404.PMC8845054.pdf","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Symposium on User Interface Software and Technology. ACM Symposium on User Interface Software and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3472749.3474742","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/10/12 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Editing operations such as cut, copy, paste, and correcting errors in typed text are often tedious and challenging to perform on smartphones. In this paper, we present VT, a voice and touch-based multi-modal text editing and correction method for smartphones. To edit text with VT, the user glides over a text fragment with a finger and dictates a command, such as "bold" to change the format of the fragment, or the user can tap inside a text area and speak a command such as "highlight this paragraph" to edit the text. For text correcting, the user taps approximately at the area of erroneous text fragment and dictates the new content for substitution or insertion. VT combines touch and voice inputs with language context such as language model and phrase similarity to infer a user's editing intention, which can handle ambiguities and noisy input signals. It is a great advantage over the existing error correction methods (e.g., iOS's Voice Control) which require precise cursor control or text selection. Our evaluation shows that VT significantly improves the efficiency of text editing and text correcting on smartphones over the touch-only method and the iOS's Voice Control method. Our user studies showed that VT reduced the text editing time by 30.80%, and text correcting time by 29.97% over the touch-only method. VT reduced the text editing time by 30.81%, and text correcting time by 47.96% over the iOS's Voice Control method.

基于语音和触摸的智能手机容错多模态文本编辑和校正。
在智能手机上进行编辑操作,如剪切、复制、粘贴和纠正输入文本中的错误,通常是乏味且具有挑战性的。本文提出了一种基于语音和触摸的智能手机多模态文本编辑和纠错方法。要使用VT编辑文本,用户可以用手指在文本片段上滑动并发出命令,例如“加粗”来更改片段的格式,或者用户可以在文本区域内点击并发出命令,例如“突出显示此段落”来编辑文本。对于文本更正,用户在错误文本片段的区域附近点击,并指示替换或插入的新内容。VT将触摸和语音输入与语言模型和短语相似度等语言语境相结合,推断用户的编辑意图,可以处理歧义和噪声输入信号。与现有的纠错方法(如iOS的语音控制)相比,这是一个很大的优势,因为现有的纠错方法需要精确的光标控制或文本选择。我们的评估表明,与纯触控和iOS的语音控制相比,VT显著提高了智能手机上文本编辑和文本校正的效率。我们的用户研究表明,与纯触控相比,VT将文本编辑时间减少了30.80%,文本更正时间减少了29.97%。与iOS的Voice Control方式相比,VT将文本编辑时间减少了30.81%,文本纠错时间减少了47.96%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信