Glorianne Danao, J. Torres, Jamila Vi Tubio, L. Vea
{"title":"Tagalog regional accent classification in the Philippines","authors":"Glorianne Danao, J. Torres, Jamila Vi Tubio, L. Vea","doi":"10.1109/HNICEM.2017.8269545","DOIUrl":null,"url":null,"abstract":"Accent classification has been a focus on recent computational researches since it directly influence the performance of automatic speech recognition technologies. In this paper, we aimed to automatically classify Tagalog accented speech of speakers from Region IV-A, Philippines. Speech and voice data were collected from 150 local residents with strong accent from the 15 towns of five (5) provinces of the region, namely: Batangas, Cavite, Laguna, Quezon and Rizal. The data gathered was cleaned and denoised using Audacity sound editor software. We then extracted some voice features from the cleaned data using PRAAT application software. These include: harmony, pitch, intensity, power, LFCC and MFCC. We tried several data mining tools to address our objectives. Results showed that MultiLayerPerceptron (MLP) classifier gave the most significant result. Among the towns that have distinct variety of accent are: Talisay, Batangas; Maragondon, Cavite; Paete, Laguna; Lucban, Quezon; and Taytay, Rizal. The significant features that classifies tagalog accent among these towns are: standardDeviationPitch, maximumHarmony, minimumIntensity, standardDeviationIntensity, minimumLPC, meanLPC, LFCC and standardDeviationMelFilter.","PeriodicalId":104407,"journal":{"name":"2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017IEEE 9th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HNICEM.2017.8269545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Accent classification has been a focus on recent computational researches since it directly influence the performance of automatic speech recognition technologies. In this paper, we aimed to automatically classify Tagalog accented speech of speakers from Region IV-A, Philippines. Speech and voice data were collected from 150 local residents with strong accent from the 15 towns of five (5) provinces of the region, namely: Batangas, Cavite, Laguna, Quezon and Rizal. The data gathered was cleaned and denoised using Audacity sound editor software. We then extracted some voice features from the cleaned data using PRAAT application software. These include: harmony, pitch, intensity, power, LFCC and MFCC. We tried several data mining tools to address our objectives. Results showed that MultiLayerPerceptron (MLP) classifier gave the most significant result. Among the towns that have distinct variety of accent are: Talisay, Batangas; Maragondon, Cavite; Paete, Laguna; Lucban, Quezon; and Taytay, Rizal. The significant features that classifies tagalog accent among these towns are: standardDeviationPitch, maximumHarmony, minimumIntensity, standardDeviationIntensity, minimumLPC, meanLPC, LFCC and standardDeviationMelFilter.