Application of Data Mining Techniques in the Analysis of Acoustic Sound Characteristics

Journal of information technology & software engineering Pub Date : 2018-01-01 DOI:10.4172/2165-7866.1000238

Mojtaba Talafi Daryani, Hossein Khabiri, Zahra Yamini

{"title":"Application of Data Mining Techniques in the Analysis of Acoustic Sound Characteristics","authors":"Mojtaba Talafi Daryani, Hossein Khabiri, Zahra Yamini","doi":"10.4172/2165-7866.1000238","DOIUrl":null,"url":null,"abstract":"In recent years, the analysis of acoustic characteristics of speech and sound has been one of the areas that data mining has found its way through. The present research study is also related to this topic which aims to detect the gender of the speaker by using the acoustic feature of his voice. In this research, the data set includes 3,168 recorded voice samples gathered from female and male speakers. Through acoustic analysis, 20 characteristics along with the desired labels were extracted and prepared for the data mining process. Finally, using Python programming language tools, 6 different techniques were used to construct an appropriate problem-solving model. These techniques were: support vector machines, logistic regression, random forest, regression and classification trees, adaptive boosting, and K-nearest neighbor. The accuracy of the models was also compared with each other. The obtained results revealed that the accuracy of all these techniques was sufficiently high (above 90%) for solving the problem and the model made by them had the necessary efficiency for classification. Moreover, the obtained model was specifically evaluated through decision tree and some principles and rules existing in the model were extracted. As a result, it was revealed that average fundamental frequency measured across the audio signal is the key characteristic of the sound for the evaluation of the voice gender to the extent that it plays a key role in data classification.","PeriodicalId":91908,"journal":{"name":"Journal of information technology & software engineering","volume":"08 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2018-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of information technology & software engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4172/2165-7866.1000238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

In recent years, the analysis of acoustic characteristics of speech and sound has been one of the areas that data mining has found its way through. The present research study is also related to this topic which aims to detect the gender of the speaker by using the acoustic feature of his voice. In this research, the data set includes 3,168 recorded voice samples gathered from female and male speakers. Through acoustic analysis, 20 characteristics along with the desired labels were extracted and prepared for the data mining process. Finally, using Python programming language tools, 6 different techniques were used to construct an appropriate problem-solving model. These techniques were: support vector machines, logistic regression, random forest, regression and classification trees, adaptive boosting, and K-nearest neighbor. The accuracy of the models was also compared with each other. The obtained results revealed that the accuracy of all these techniques was sufficiently high (above 90%) for solving the problem and the model made by them had the necessary efficiency for classification. Moreover, the obtained model was specifically evaluated through decision tree and some principles and rules existing in the model were extracted. As a result, it was revealed that average fundamental frequency measured across the audio signal is the key characteristic of the sound for the evaluation of the voice gender to the extent that it plays a key role in data classification.

查看原文本刊更多论文

数据挖掘技术在声学声学特征分析中的应用

近年来，语音和声音的声学特性分析已成为数据挖掘的一个领域。本研究也与这一主题相关，即利用说话人声音的声学特征来判断说话人的性别。在本研究中，数据集包括从女性和男性演讲者中收集的3,168个录音语音样本。通过声学分析，提取20个特征以及所需的标签，并为数据挖掘过程做好准备。最后，使用Python编程语言工具，使用6种不同的技术来构建合适的问题解决模型。这些技术包括:支持向量机、逻辑回归、随机森林、回归与分类树、自适应增强和k近邻。并对模型的精度进行了比较。得到的结果表明，所有这些技术的精度都足够高(90%以上)，可以解决问题，并且由它们制作的模型具有必要的分类效率。并通过决策树对得到的模型进行具体评价，提取模型中存在的一些原则和规则。结果表明，在整个音频信号中测量的平均基频是声音性别评估的关键特征，它在数据分类中起着关键作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of information technology & software engineering

自引率

0.00%

发文量