Tagalog regional accent classification in the Philippines

2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) Pub Date : 2017-12-01 DOI:10.1109/HNICEM.2017.8269545

Glorianne Danao, J. Torres, Jamila Vi Tubio, L. Vea

{"title":"Tagalog regional accent classification in the Philippines","authors":"Glorianne Danao, J. Torres, Jamila Vi Tubio, L. Vea","doi":"10.1109/HNICEM.2017.8269545","DOIUrl":null,"url":null,"abstract":"Accent classification has been a focus on recent computational researches since it directly influence the performance of automatic speech recognition technologies. In this paper, we aimed to automatically classify Tagalog accented speech of speakers from Region IV-A, Philippines. Speech and voice data were collected from 150 local residents with strong accent from the 15 towns of five (5) provinces of the region, namely: Batangas, Cavite, Laguna, Quezon and Rizal. The data gathered was cleaned and denoised using Audacity sound editor software. We then extracted some voice features from the cleaned data using PRAAT application software. These include: harmony, pitch, intensity, power, LFCC and MFCC. We tried several data mining tools to address our objectives. Results showed that MultiLayerPerceptron (MLP) classifier gave the most significant result. Among the towns that have distinct variety of accent are: Talisay, Batangas; Maragondon, Cavite; Paete, Laguna; Lucban, Quezon; and Taytay, Rizal. The significant features that classifies tagalog accent among these towns are: standardDeviationPitch, maximumHarmony, minimumIntensity, standardDeviationIntensity, minimumLPC, meanLPC, LFCC and standardDeviationMelFilter.","PeriodicalId":104407,"journal":{"name":"2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM.2017.8269545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Accent classification has been a focus on recent computational researches since it directly influence the performance of automatic speech recognition technologies. In this paper, we aimed to automatically classify Tagalog accented speech of speakers from Region IV-A, Philippines. Speech and voice data were collected from 150 local residents with strong accent from the 15 towns of five (5) provinces of the region, namely: Batangas, Cavite, Laguna, Quezon and Rizal. The data gathered was cleaned and denoised using Audacity sound editor software. We then extracted some voice features from the cleaned data using PRAAT application software. These include: harmony, pitch, intensity, power, LFCC and MFCC. We tried several data mining tools to address our objectives. Results showed that MultiLayerPerceptron (MLP) classifier gave the most significant result. Among the towns that have distinct variety of accent are: Talisay, Batangas; Maragondon, Cavite; Paete, Laguna; Lucban, Quezon; and Taytay, Rizal. The significant features that classifies tagalog accent among these towns are: standardDeviationPitch, maximumHarmony, minimumIntensity, standardDeviationIntensity, minimumLPC, meanLPC, LFCC and standardDeviationMelFilter.

查看原文本刊更多论文

菲律宾的他加禄语地区口音分类

口音分类直接影响语音自动识别技术的性能，是近年来计算研究的热点。在本文中，我们旨在自动分类菲律宾IV-A地区说话者的他加禄语重音语音。语音和语音数据来自该地区五省(5)15个城镇的150名重口音当地居民，分别是:巴丹加斯、卡菲特、拉古纳、奎松和黎萨尔。收集的数据使用Audacity声音编辑软件进行清理和去噪。然后，我们使用PRAAT应用软件从清洗后的数据中提取一些语音特征。这些包括:和声、音高、强度、力量、LFCC和MFCC。我们尝试了几种数据挖掘工具来实现我们的目标。结果表明，多层感知器(MultiLayerPerceptron, MLP)分类器的分类效果最为显著。具有不同口音的城镇有:塔利赛，八打雁;Maragondon,甲米地;Paete,拉古纳;Lucban奎松城;Taytay, Rizal。在这些城镇中分类标签音的重要特征是:standardDeviationPitch, maximumHarmony, minimumIntensity, standardDeviationIntensity, minimumLPC, meanLPC, LFCC和standardDeviationMelFilter。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

自引率

0.00%

发文量