基于张量分解的多因素数据挖掘的多通道音频信号检索

Yi Zhao, Jing Wang, Lidong Yang, Ali Imtiaz, Jingming Kuang
{"title":"基于张量分解的多因素数据挖掘的多通道音频信号检索","authors":"Yi Zhao, Jing Wang, Lidong Yang, Ali Imtiaz, Jingming Kuang","doi":"10.1109/ICDSP.2014.6900766","DOIUrl":null,"url":null,"abstract":"In this paper, an efficient method of multi-channel audio signal retrieval with finite number of channels is proposed based on multi-factor data mining with tensor decomposition. We briefly discuss how to convert the limited channels to an increased number of channels (multi-channel) by capturing the latent higher-order tensor structure of multi-channel audio data. The multi-channel audio data space is established mainly due to three factors including location, channel and time-frequency. Moreover, CANDECOMP/PARAFAC (CP) decomposition is introduced in the process of multi-factor data mining to predict the data in the missing channels. Besides, considering human auditory effects at low frequency, we compute a set of data in advance for the retrieval of Low Frequency Effects (LFE) channel. The performance of the proposed method is assessed by MUlti-Stimulus test with Hidden References and Anchor listening test (MUSHRA). We further demonstrate the retrieval of 5.1 multi-channel audio from stereo audio. Experiments show that an acceptable converting quality has been achieved and the novel tensor-based method is easy to implement as compared to the traditional method.","PeriodicalId":301856,"journal":{"name":"2014 19th International Conference on Digital Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-channel audio signal retrieval based on multi-factor data mining with tensor decomposition\",\"authors\":\"Yi Zhao, Jing Wang, Lidong Yang, Ali Imtiaz, Jingming Kuang\",\"doi\":\"10.1109/ICDSP.2014.6900766\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, an efficient method of multi-channel audio signal retrieval with finite number of channels is proposed based on multi-factor data mining with tensor decomposition. We briefly discuss how to convert the limited channels to an increased number of channels (multi-channel) by capturing the latent higher-order tensor structure of multi-channel audio data. The multi-channel audio data space is established mainly due to three factors including location, channel and time-frequency. Moreover, CANDECOMP/PARAFAC (CP) decomposition is introduced in the process of multi-factor data mining to predict the data in the missing channels. Besides, considering human auditory effects at low frequency, we compute a set of data in advance for the retrieval of Low Frequency Effects (LFE) channel. The performance of the proposed method is assessed by MUlti-Stimulus test with Hidden References and Anchor listening test (MUSHRA). We further demonstrate the retrieval of 5.1 multi-channel audio from stereo audio. Experiments show that an acceptable converting quality has been achieved and the novel tensor-based method is easy to implement as compared to the traditional method.\",\"PeriodicalId\":301856,\"journal\":{\"name\":\"2014 19th International Conference on Digital Signal Processing\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 19th International Conference on Digital Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2014.6900766\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 19th International Conference on Digital Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2014.6900766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种基于张量分解的多因素数据挖掘的有限信道多通道音频信号检索方法。我们简要讨论了如何通过捕获多通道音频数据的潜在高阶张量结构将有限的通道转换为增加的通道(多通道)。多通道音频数据空间的建立主要取决于位置、通道和时频三个因素。此外,在多因素数据挖掘过程中引入了CANDECOMP/PARAFAC (CP)分解来预测缺失通道中的数据。此外,考虑到人的低频听觉效应,我们提前计算了一组数据,用于检索低频效应(LFE)通道。通过隐含参考的多刺激测试和锚点听力测试(MUSHRA)对该方法的性能进行了评估。我们进一步演示了从立体声音频中检索5.1多声道音频。实验表明,该方法具有较好的转换质量,且与传统方法相比,该方法易于实现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi-channel audio signal retrieval based on multi-factor data mining with tensor decomposition
In this paper, an efficient method of multi-channel audio signal retrieval with finite number of channels is proposed based on multi-factor data mining with tensor decomposition. We briefly discuss how to convert the limited channels to an increased number of channels (multi-channel) by capturing the latent higher-order tensor structure of multi-channel audio data. The multi-channel audio data space is established mainly due to three factors including location, channel and time-frequency. Moreover, CANDECOMP/PARAFAC (CP) decomposition is introduced in the process of multi-factor data mining to predict the data in the missing channels. Besides, considering human auditory effects at low frequency, we compute a set of data in advance for the retrieval of Low Frequency Effects (LFE) channel. The performance of the proposed method is assessed by MUlti-Stimulus test with Hidden References and Anchor listening test (MUSHRA). We further demonstrate the retrieval of 5.1 multi-channel audio from stereo audio. Experiments show that an acceptable converting quality has been achieved and the novel tensor-based method is easy to implement as compared to the traditional method.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信