{"title":"Application of Data Mining Techniques in the Analysis of Acoustic Sound Characteristics","authors":"Mojtaba Talafi Daryani, Hossein Khabiri, Zahra Yamini","doi":"10.4172/2165-7866.1000238","DOIUrl":null,"url":null,"abstract":"In recent years, the analysis of acoustic characteristics of speech and sound has been one of the areas that data mining has found its way through. The present research study is also related to this topic which aims to detect the gender of the speaker by using the acoustic feature of his voice. In this research, the data set includes 3,168 recorded voice samples gathered from female and male speakers. Through acoustic analysis, 20 characteristics along with the desired labels were extracted and prepared for the data mining process. Finally, using Python programming language tools, 6 different techniques were used to construct an appropriate problem-solving model. These techniques were: support vector machines, logistic regression, random forest, regression and classification trees, adaptive boosting, and K-nearest neighbor. The accuracy of the models was also compared with each other. The obtained results revealed that the accuracy of all these techniques was sufficiently high (above 90%) for solving the problem and the model made by them had the necessary efficiency for classification. Moreover, the obtained model was specifically evaluated through decision tree and some principles and rules existing in the model were extracted. As a result, it was revealed that average fundamental frequency measured across the audio signal is the key characteristic of the sound for the evaluation of the voice gender to the extent that it plays a key role in data classification.","PeriodicalId":91908,"journal":{"name":"Journal of information technology & software engineering","volume":"08 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of information technology & software engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4172/2165-7866.1000238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In recent years, the analysis of acoustic characteristics of speech and sound has been one of the areas that data mining has found its way through. The present research study is also related to this topic which aims to detect the gender of the speaker by using the acoustic feature of his voice. In this research, the data set includes 3,168 recorded voice samples gathered from female and male speakers. Through acoustic analysis, 20 characteristics along with the desired labels were extracted and prepared for the data mining process. Finally, using Python programming language tools, 6 different techniques were used to construct an appropriate problem-solving model. These techniques were: support vector machines, logistic regression, random forest, regression and classification trees, adaptive boosting, and K-nearest neighbor. The accuracy of the models was also compared with each other. The obtained results revealed that the accuracy of all these techniques was sufficiently high (above 90%) for solving the problem and the model made by them had the necessary efficiency for classification. Moreover, the obtained model was specifically evaluated through decision tree and some principles and rules existing in the model were extracted. As a result, it was revealed that average fundamental frequency measured across the audio signal is the key characteristic of the sound for the evaluation of the voice gender to the extent that it plays a key role in data classification.