Unsupervised Word Segmentation from Speech with Attention

ArXiv Pub Date : 2018-06-18 DOI:10.21437/Interspeech.2018-1308

Pierre Godard, Marcely Zanon Boito, Lucas Ondel, Alexandre Berard, François Yvon, Aline Villavicencio, L. Besacier

引用次数: 28

Abstract

We present a first attempt to perform attentional word segmen-tation directly from the speech signal, with the final goal to automatically identify lexical units in a low-resource, unwritten language (UL). Our methodology assumes a pairing between recordings in the UL with translations in a well-resourced language. It uses Acoustic Unit Discovery (AUD) to convert speech into a sequence of pseudo-phones that is segmented using neural soft-alignments produced by a neural machine translation model. Evaluation uses an actual Bantu UL, Mboshi; comparisons to monolingual and bilingual baselines illustrate the potential of attentional word segmentation for language documentation.

查看原文本刊更多论文

注意语音的无监督分词

我们提出了直接从语音信号中执行注意分词的第一次尝试，其最终目标是自动识别低资源的非书面语言(UL)中的词汇单位。我们的方法假设UL中的录音与资源丰富的语言的翻译之间是配对的。它使用声学单元发现(AUD)将语音转换成一系列伪电话，这些伪电话使用神经机器翻译模型产生的神经软对齐进行分割。评估使用实际班图UL, Mboshi;与单语和双语基线的比较说明了语言文档中注意分词的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ArXiv

自引率

0.00%

发文量