Contrast of Gaussian Mixture Model and Clustering Algorithm for Singer Identification

D. Dharini, A. Revathy, M. Kalaivani
{"title":"Contrast of Gaussian Mixture Model and Clustering Algorithm for Singer Identification","authors":"D. Dharini, A. Revathy, M. Kalaivani","doi":"10.1109/ICCCI.2018.8441491","DOIUrl":null,"url":null,"abstract":"The intension is to provide the contrast between Clustering Algorithm and Gaussian Mixture Model using Perceptual Linear Prediction features to assess the singer identification structure using two phases, phase 1 as training and phase 2 as testing over the film tracks(vocal with background music). The intent of assessing of singer is to categorize different singers impartial of data that is trained in phase 1. The aspects for two phases are executed for downright tracks from films for 20 different singers. In phase 1 aspects, for individual singer 15 tracks are loaded as input data. Now loaded datas are shaped to go through a deck of pre-handling steps. The pre-handling steps includes three more internal stages with stage1 as Pre-emphasis, stage2 as Frame Blocking and stage3 as Windowing. From individual context of pre-handled signal PLP features are evolved. Using the K-Means Clustering Algorithm and GMM the phase1 output is developed for individual singers. In Clustering algorithm the singer is categorized deployed with choice of the model that gives mean value as minimum. In GMM, by using Maximum Likelihood (ML) algorithm singers are categorized deployed with choice of the model that gives maximum likelihood. Depending on identity of accuracy the singer identification structure is performed. (Abstract)","PeriodicalId":141663,"journal":{"name":"2018 International Conference on Computer Communication and Informatics (ICCCI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Computer Communication and Informatics (ICCCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCI.2018.8441491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The intension is to provide the contrast between Clustering Algorithm and Gaussian Mixture Model using Perceptual Linear Prediction features to assess the singer identification structure using two phases, phase 1 as training and phase 2 as testing over the film tracks(vocal with background music). The intent of assessing of singer is to categorize different singers impartial of data that is trained in phase 1. The aspects for two phases are executed for downright tracks from films for 20 different singers. In phase 1 aspects, for individual singer 15 tracks are loaded as input data. Now loaded datas are shaped to go through a deck of pre-handling steps. The pre-handling steps includes three more internal stages with stage1 as Pre-emphasis, stage2 as Frame Blocking and stage3 as Windowing. From individual context of pre-handled signal PLP features are evolved. Using the K-Means Clustering Algorithm and GMM the phase1 output is developed for individual singers. In Clustering algorithm the singer is categorized deployed with choice of the model that gives mean value as minimum. In GMM, by using Maximum Likelihood (ML) algorithm singers are categorized deployed with choice of the model that gives maximum likelihood. Depending on identity of accuracy the singer identification structure is performed. (Abstract)
高斯混合模型与聚类算法在歌手识别中的对比
目的是提供聚类算法和高斯混合模型之间的对比,使用感知线性预测特征来评估歌手识别结构,使用两个阶段,阶段1作为训练,阶段2作为对电影轨道(背景音乐的声乐)的测试。评估歌手的目的是对不同的歌手进行分类,不依赖于第一阶段训练的数据。两个阶段的方面是为20个不同歌手的电影中的直接曲目执行的。在阶段1方面,为单个歌手加载15首曲目作为输入数据。现在加载的数据将经过一系列预处理步骤。预处理步骤包括三个内部阶段,其中阶段1为预强调,阶段2为帧阻塞,阶段3为窗口。从单独的背景下预处理信号的PLP特征演变。利用k均值聚类算法和GMM对单个歌手的相位1输出进行了开发。在聚类算法中,歌手被分类部署,选择均值最小的模型。在GMM中,通过使用最大似然(ML)算法对歌手进行分类,并选择提供最大似然的模型。根据准确度的同一性,执行歌手识别结构。(抽象)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信