Feature Selection using Pre-clustering via Affinity Propagation for Speech Classification in Low-resource Languages

Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das
{"title":"Feature Selection using Pre-clustering via Affinity Propagation for Speech Classification in Low-resource Languages","authors":"Parabattina Bhagath, Komal Bharti, Abhishek Kotiya, P. Das","doi":"10.1109/IICAIET51634.2021.9573696","DOIUrl":null,"url":null,"abstract":"Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.","PeriodicalId":234229,"journal":{"name":"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET51634.2021.9573696","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Speech analysis is an active research field where different feature extraction techniques are studied for solving various issues. Such studies help to improve the time complexity of solutions by understanding necessary clues to select the features. Choosing essential features by removing irrelevant information is a significant step in feature engineering. Perceptual Linear Predictive (PLP) modeling concentrates on understanding the speech signals by focusing on the features perceived at the listener end. They have been used successfully in many speech processing applications. The selection of the order of PLP coefficients for efficient classification of spoken units plays a crucial role in the recognition task. A conventional speech processing system requires a huge training process to develop an Automatic Speech Recognition system. Such systems are efficient for the languages that have enough resources i.e. data. But, low-resource languages especially Asian languages haven't been developed to provide the data sufficient for such tasks. In this context, alternative methods and techniques are encouraged to enhance or optimize the development process with less amount of data. This paper proposes a pre-clustering technique to improve the classification rate with low resources.
基于亲和力传播的预聚类特征选择在低资源语言语音分类中的应用
语音分析是一个活跃的研究领域,人们研究了不同的特征提取技术来解决各种问题。这样的研究通过理解必要的线索来选择特征,有助于提高解决方案的时间复杂度。通过去除不相关信息来选择基本特征是特征工程中的一个重要步骤。感知线性预测(PLP)建模的重点是通过关注听者端感知到的特征来理解语音信号。它们已成功地应用于许多语音处理应用中。在语音识别任务中,有效分类语音单元的PLP系数顺序的选择是至关重要的。传统的语音处理系统需要大量的训练才能开发出自动语音识别系统。这样的系统对于拥有足够资源(即数据)的语言是有效的。但是,资源匮乏的语言,尤其是亚洲语言,还没有开发出能够为这些任务提供足够数据的语言。在这方面,鼓励采用其他方法和技术,以较少的数据量加强或优化开发过程。为了在资源较少的情况下提高分类率,提出了一种预聚类技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信