Mitigating the impact of speech recognition errors on chatbot using sequence-to-sequence model

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) Pub Date : 2017-09-22 DOI:10.1109/ASRU.2017.8268977

Pin-Jung Chen, I-Hung Hsu, Yi Yao Huang, Hung-yi Lee

引用次数: 15

Abstract

We apply sequence-to-sequence model to mitigate the impact of speech recognition errors on open domain end-to-end dialog generation. We cast the task as a domain adaptation problem where ASR transcriptions and original texts are in two different domains. In this paper, our proposed model includes two individual encoders for each domain data and make their hidden states similar to ensure the decoder predict the same dialog text. The method demonstrates that the sequence-to-sequence model can learn the ASR transcriptions and original text pair having the same meaning and eliminate the speech recognition errors. Experimental results on Cornell movie dialog dataset demonstrate that the domain adaption system help the spoken dialog system generate more similar responses with the original text answers.

查看原文本刊更多论文

利用序列对序列模型减轻语音识别错误对聊天机器人的影响

我们采用序列到序列模型来减轻语音识别错误对开放域端到端对话生成的影响。我们将该任务视为一个领域适应问题，其中ASR转录和原始文本位于两个不同的领域。在本文中，我们提出的模型为每个领域数据包括两个独立的编码器，并使它们的隐藏状态相似，以确保解码器预测相同的对话文本。该方法表明，序列到序列模型可以学习到具有相同含义的ASR转录词和原始文本对，消除了语音识别错误。在Cornell电影对话数据集上的实验结果表明，领域适应系统可以帮助口语对话系统生成与原始文本答案更相似的响应。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

自引率

0.00%

发文量