支持向量机声学建模中轨迹矩阵的外积

R. Anitha, D. S. Satish, C. Sekhar
{"title":"支持向量机声学建模中轨迹矩阵的外积","authors":"R. Anitha, D. S. Satish, C. Sekhar","doi":"10.1109/MLSP.2004.1422993","DOIUrl":null,"url":null,"abstract":"In this paper, we address the issues in classification of varying duration segments of speech using support vector machines. Commonly used methods for mapping the varying duration segments into fixed dimension patterns may lead to loss of crucial information necessary for classification. We propose a method in which the representation of a segment of speech is considered as a trajectory in a multidimensional space. A fixed dimension pattern vector derived from the outerproduct operation on the matrix representation of a multidimensional trajectory is given as input to the support vector machines. For acoustic modeling of speech segments consisting of multiple phonemes, the outerproduct operation is carried out for the trajectory matrix of each phoneme. The effectiveness of the proposed methods is demonstrated in recognition of isolated utterances of the E-set of English alphabet","PeriodicalId":70952,"journal":{"name":"信号处理","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2004-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Outerproduct of trajectory matrix for acoustic modeling using support vector machines\",\"authors\":\"R. Anitha, D. S. Satish, C. Sekhar\",\"doi\":\"10.1109/MLSP.2004.1422993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we address the issues in classification of varying duration segments of speech using support vector machines. Commonly used methods for mapping the varying duration segments into fixed dimension patterns may lead to loss of crucial information necessary for classification. We propose a method in which the representation of a segment of speech is considered as a trajectory in a multidimensional space. A fixed dimension pattern vector derived from the outerproduct operation on the matrix representation of a multidimensional trajectory is given as input to the support vector machines. For acoustic modeling of speech segments consisting of multiple phonemes, the outerproduct operation is carried out for the trajectory matrix of each phoneme. The effectiveness of the proposed methods is demonstrated in recognition of isolated utterances of the E-set of English alphabet\",\"PeriodicalId\":70952,\"journal\":{\"name\":\"信号处理\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"信号处理\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1109/MLSP.2004.1422993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"信号处理","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/MLSP.2004.1422993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

摘要

在本文中,我们使用支持向量机解决了不同时长语音片段的分类问题。将不同持续时间段映射到固定维度模式的常用方法可能导致丢失分类所需的关键信息。我们提出了一种方法,该方法将语音片段的表示视为多维空间中的轨迹。通过对多维轨迹的矩阵表示进行外积运算得到的固定维模式向量作为支持向量机的输入。对于由多个音素组成的语音片段的声学建模,对每个音素的轨迹矩阵进行外积运算。通过对英语字母e集孤立语音的识别,验证了所提方法的有效性
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Outerproduct of trajectory matrix for acoustic modeling using support vector machines
In this paper, we address the issues in classification of varying duration segments of speech using support vector machines. Commonly used methods for mapping the varying duration segments into fixed dimension patterns may lead to loss of crucial information necessary for classification. We propose a method in which the representation of a segment of speech is considered as a trajectory in a multidimensional space. A fixed dimension pattern vector derived from the outerproduct operation on the matrix representation of a multidimensional trajectory is given as input to the support vector machines. For acoustic modeling of speech segments consisting of multiple phonemes, the outerproduct operation is carried out for the trajectory matrix of each phoneme. The effectiveness of the proposed methods is demonstrated in recognition of isolated utterances of the E-set of English alphabet
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
5812
期刊介绍: Journal of Signal Processing is an academic journal supervised by China Association for Science and Technology and sponsored by China Institute of Electronics. The journal is an academic journal that reflects the latest research results and technological progress in the field of signal processing and related disciplines. It covers academic papers and review articles on new theories, new ideas, and new technologies in the field of signal processing. The journal aims to provide a platform for academic exchanges for scientific researchers and engineering and technical personnel engaged in basic research and applied research in signal processing, thereby promoting the development of information science and technology. At present, the journal has been included in the three major domestic core journal databases "China Science Citation Database (CSCD), China Science and Technology Core Journals (CSTPCD), Chinese Core Journals Overview" and Coaj. It is also included in many foreign databases such as Scopus, CSA, EBSCO host, INSPEC, JST, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信