基于部分引理的语言模型在基于LF-MMI的印尼语语音识别中的OOV处理

2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM) Pub Date : 2022-11-22 DOI:10.1109/CENIM56801.2022.10037479

Agung Santosa, Asril Jarin, E. M. Yuniarno, Hammam Riza, M. Purnomo

{"title":"基于部分引理的语言模型在基于LF-MMI的印尼语语音识别中的OOV处理","authors":"Agung Santosa, Asril Jarin, E. M. Yuniarno, Hammam Riza, M. Purnomo","doi":"10.1109/CENIM56801.2022.10037479","DOIUrl":null,"url":null,"abstract":"One of the common problems in ASR is the out-of-vocabulary word in an utterance that can degrade the performance of the system. Bahasa Indonesia, as an agglutinative language, uses affixation to generate words from a set of affixes and root words. We propose the use of a partial lemma-based language model (LM) and lexicon that can handle words created from affixation. The partial lemma-based LM and lexicon are created from the original ones using morphology analyzer output as a reference. The experiment shows that using the LM in ASR with LF-MMI cost function gives a better WER when the heuristic to insert inter-word short pause is modified to also consider the affixes.","PeriodicalId":118934,"journal":{"name":"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)","volume":"17 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"OOV Handling Using Partial Lemma-Based Language Model in LF-MMI Based ASR for Bahasa Indonesia\",\"authors\":\"Agung Santosa, Asril Jarin, E. M. Yuniarno, Hammam Riza, M. Purnomo\",\"doi\":\"10.1109/CENIM56801.2022.10037479\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the common problems in ASR is the out-of-vocabulary word in an utterance that can degrade the performance of the system. Bahasa Indonesia, as an agglutinative language, uses affixation to generate words from a set of affixes and root words. We propose the use of a partial lemma-based language model (LM) and lexicon that can handle words created from affixation. The partial lemma-based LM and lexicon are created from the original ones using morphology analyzer output as a reference. The experiment shows that using the LM in ASR with LF-MMI cost function gives a better WER when the heuristic to insert inter-word short pause is modified to also consider the affixes.\",\"PeriodicalId\":118934,\"journal\":{\"name\":\"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)\",\"volume\":\"17 6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CENIM56801.2022.10037479\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CENIM56801.2022.10037479","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

语音识别中常见的问题之一是话语中的词汇外词，这可能会降低系统的性能。印尼语作为一种粘连语言，使用词缀从一组词缀和词根词中生成单词。我们建议使用部分基于引理的语言模型(LM)和词典来处理由词缀创建的单词。部分基于引理的LM和词典是在原始LM和词典的基础上以形态学分析器的输出为参考创建的。实验表明，将LM用于带LF-MMI代价函数的ASR中，当将插入词间短停顿的启发式方法修改为考虑词缀时，可以获得更好的WER。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

OOV Handling Using Partial Lemma-Based Language Model in LF-MMI Based ASR for Bahasa Indonesia

One of the common problems in ASR is the out-of-vocabulary word in an utterance that can degrade the performance of the system. Bahasa Indonesia, as an agglutinative language, uses affixation to generate words from a set of affixes and root words. We propose the use of a partial lemma-based language model (LM) and lexicon that can handle words created from affixation. The partial lemma-based LM and lexicon are created from the original ones using morphology analyzer output as a reference. The experiment shows that using the LM in ASR with LF-MMI cost function gives a better WER when the heuristic to insert inter-word short pause is modified to also consider the affixes.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)

自引率

0.00%

发文量