TieLent

Proceedings of the International Conference on Advanced Visual Interfaces Pub Date : 2020-09-28 DOI:10.1145/3399715.3399852

N. Kimura, Kentaro Hayashi, J. Rekimoto

引用次数: 13

Abstract

With the increased use of smart speakers, silent speech interaction (SSI) is attracting attention. Unfortunately, traditional silent speech interaction methods require the addition of obtrusive sensors and devices around the user's face, making wearability and portability a challenge. Considering that most uses for smart speakers do not require many words, we suggest a more casual approach, TieLent, which can easily be worn between the neck and the chest. TieLent's RGB camera is set away from the user's face, presenting less interference with the user. Although TieLent's camera is not able to capture the whole mouth, when combined with our image-to-speech neural network model, it is able to generate the recognizable speech of 15 commands with an average accuracy of 94%.

查看原文本刊更多论文

TieLent

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the International Conference on Advanced Visual Interfaces

自引率

0.00%

发文量