基于HMM的Isarn数字语音识别

2017 2nd International Conference on Information Technology (INCIT) Pub Date : 2017-11-01 DOI:10.1109/INCIT.2017.8257882

Sasithron Sangjamraschaikun, Pusadee Seresangtakul

{"title":"基于HMM的Isarn数字语音识别","authors":"Sasithron Sangjamraschaikun, Pusadee Seresangtakul","doi":"10.1109/INCIT.2017.8257882","DOIUrl":null,"url":null,"abstract":"Herein we present an automatic digit-speech recognition system for the Isarn language, which is a dialect spoken in the northeast of Thailand. In this work, an Isarn digit corpus was collected from natives speakers. The system utilizes the Mel Frequency Cepstral Coefficients (MFCC) technique to extract speech features, and the Hidden Markov Model (HMM) classifier for speech recognition. The paper focuses on isolated and continuous speech recognition for speakers (dependent and independent) uttering Isarn numerals (from 0 through 999). The system was evaluated by correctness. The results obtained from isolated recognition in speaker dependence and speaker independence were 90.00% and 79.80%, respectively; whereas continuous recognition provided results of 89.16% in speaker dependence and 82.47% in speaker dependence.","PeriodicalId":405827,"journal":{"name":"2017 2nd International Conference on Information Technology (INCIT)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Isarn digit speech recognition using HMM\",\"authors\":\"Sasithron Sangjamraschaikun, Pusadee Seresangtakul\",\"doi\":\"10.1109/INCIT.2017.8257882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Herein we present an automatic digit-speech recognition system for the Isarn language, which is a dialect spoken in the northeast of Thailand. In this work, an Isarn digit corpus was collected from natives speakers. The system utilizes the Mel Frequency Cepstral Coefficients (MFCC) technique to extract speech features, and the Hidden Markov Model (HMM) classifier for speech recognition. The paper focuses on isolated and continuous speech recognition for speakers (dependent and independent) uttering Isarn numerals (from 0 through 999). The system was evaluated by correctness. The results obtained from isolated recognition in speaker dependence and speaker independence were 90.00% and 79.80%, respectively; whereas continuous recognition provided results of 89.16% in speaker dependence and 82.47% in speaker dependence.\",\"PeriodicalId\":405827,\"journal\":{\"name\":\"2017 2nd International Conference on Information Technology (INCIT)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd International Conference on Information Technology (INCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INCIT.2017.8257882\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd International Conference on Information Technology (INCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCIT.2017.8257882","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

在这里，我们提出了一个自动数字语音识别系统的Isarn语言，这是在泰国东北部的一种方言。在这项工作中，收集了来自母语使用者的Isarn数字语料库。该系统利用Mel频率倒谱系数(MFCC)技术提取语音特征，利用隐马尔可夫模型(HMM)分类器进行语音识别。本文的重点是孤立的和连续的语音识别发言者(依赖和独立)说出Isarn数字(从0到999)。对系统进行了正确性评价。独立识别在说话人依赖性和说话人独立性方面的结果分别为90.00%和79.80%;而连续识别在说话人依赖和说话人依赖方面的得分分别为89.16%和82.47%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Isarn digit speech recognition using HMM

Herein we present an automatic digit-speech recognition system for the Isarn language, which is a dialect spoken in the northeast of Thailand. In this work, an Isarn digit corpus was collected from natives speakers. The system utilizes the Mel Frequency Cepstral Coefficients (MFCC) technique to extract speech features, and the Hidden Markov Model (HMM) classifier for speech recognition. The paper focuses on isolated and continuous speech recognition for speakers (dependent and independent) uttering Isarn numerals (from 0 through 999). The system was evaluated by correctness. The results obtained from isolated recognition in speaker dependence and speaker independence were 90.00% and 79.80%, respectively; whereas continuous recognition provided results of 89.16% in speaker dependence and 82.47% in speaker dependence.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 2nd International Conference on Information Technology (INCIT)

自引率

0.00%

发文量