Journal on Audio Speech and Music Processing最新文献

筛选
英文 中文
A large TV dataset for speech and music activity detection 用于语音和音乐活动检测的大型电视数据集
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-09-03 DOI: 10.1186/s13636-022-00253-8
Yun-Ning Hung, Chih-Wei Wu, Iroro Orife, A. Hipple, W. Wolcott, Alexander Lerch
{"title":"A large TV dataset for speech and music activity detection","authors":"Yun-Ning Hung, Chih-Wei Wu, Iroro Orife, A. Hipple, W. Wolcott, Alexander Lerch","doi":"10.1186/s13636-022-00253-8","DOIUrl":"https://doi.org/10.1186/s13636-022-00253-8","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2022 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
DOA-guided source separation with direction-based initialization and time annotations using complex angular central Gaussian mixture models 使用复杂角中心高斯混合模型的DOA引导源分离,基于方向的初始化和时间注释
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-06-18 DOI: 10.1186/s13636-022-00246-7
Alexander Bohlender, Lucas Van Severen, Jonathan Sterckx, N. Madhu
{"title":"DOA-guided source separation with direction-based initialization and time annotations using complex angular central Gaussian mixture models","authors":"Alexander Bohlender, Lucas Van Severen, Jonathan Sterckx, N. Madhu","doi":"10.1186/s13636-022-00246-7","DOIUrl":"https://doi.org/10.1186/s13636-022-00246-7","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43510647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-based spatial audio processing 基于数据的空间音频处理
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-06-08 DOI: 10.1186/s13636-022-00248-5
M. Cobos, J. Ahrens, K. Kowalczyk, A. Politis
{"title":"Data-based spatial audio processing","authors":"M. Cobos, J. Ahrens, K. Kowalczyk, A. Politis","doi":"10.1186/s13636-022-00248-5","DOIUrl":"https://doi.org/10.1186/s13636-022-00248-5","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45627222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Improving sign-algorithm convergence rate using natural gradient for lossless audio compression 利用自然梯度改进无损音频压缩的符号算法收敛率
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-05-21 DOI: 10.1186/s13636-022-00243-w
Taiyo Mineo, Hayaru Shouno
{"title":"Improving sign-algorithm convergence rate using natural gradient for lossless audio compression","authors":"Taiyo Mineo, Hayaru Shouno","doi":"10.1186/s13636-022-00243-w","DOIUrl":"https://doi.org/10.1186/s13636-022-00243-w","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2022 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction 概述机器学习和其他基于数据的空间音频捕获、处理和再现方法
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-05-16 DOI: 10.1186/s13636-022-00242-x
M. Cobos, J. Ahrens, K. Kowalczyk, A. Politis
{"title":"An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction","authors":"M. Cobos, J. Ahrens, K. Kowalczyk, A. Politis","doi":"10.1186/s13636-022-00242-x","DOIUrl":"https://doi.org/10.1186/s13636-022-00242-x","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48642269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Automated audio captioning: an overview of recent progress and new challenges 自动音频字幕:最新进展和新挑战概述
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-05-12 DOI: 10.1186/s13636-022-00259-2
Xinhao Mei, Xubo Liu, M. Plumbley, Wenwu Wang
{"title":"Automated audio captioning: an overview of recent progress and new challenges","authors":"Xinhao Mei, Xubo Liu, M. Plumbley, Wenwu Wang","doi":"10.1186/s13636-022-00259-2","DOIUrl":"https://doi.org/10.1186/s13636-022-00259-2","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2022 1","pages":"1-18"},"PeriodicalIF":2.4,"publicationDate":"2022-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49139079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Interaural time difference individualization in HRTF by scaling through anthropometric parameters HRTF中通过人体测量参数缩放的耳间时间差个体化
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-05-12 DOI: 10.1186/s13636-022-00241-y
P. Gutierrez-Parera, José J. López, J. M. Mora-Merchan, D. Larios
{"title":"Interaural time difference individualization in HRTF by scaling through anthropometric parameters","authors":"P. Gutierrez-Parera, José J. López, J. M. Mora-Merchan, D. Larios","doi":"10.1186/s13636-022-00241-y","DOIUrl":"https://doi.org/10.1186/s13636-022-00241-y","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48038096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Heterogeneous separation consistency training for adaptation of unsupervised speech separation 基于非监督语音分离的异构分离一致性训练
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-04-23 DOI: 10.1186/s13636-023-00273-y
Jiangyu Han, Yanhua Long
{"title":"Heterogeneous separation consistency training for adaptation of unsupervised speech separation","authors":"Jiangyu Han, Yanhua Long","doi":"10.1186/s13636-023-00273-y","DOIUrl":"https://doi.org/10.1186/s13636-023-00273-y","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45769392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Sound event triage: detecting sound events considering priority of classes 声音事件分类:考虑类别优先级检测声音事件
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-04-13 DOI: 10.1186/s13636-022-00270-7
Noriyuki Tonami, Keisuke Imoto
{"title":"Sound event triage: detecting sound events considering priority of classes","authors":"Noriyuki Tonami, Keisuke Imoto","doi":"10.1186/s13636-022-00270-7","DOIUrl":"https://doi.org/10.1186/s13636-022-00270-7","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2023 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44176024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices 一种基于神经网络的两阶段轻量化降噪算法
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2022-04-06 DOI: 10.1186/s13636-023-00285-8
Jean-Marie Lemercier, J. Thiemann, Raphael Koning, Timo Gerkmann
{"title":"A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices","authors":"Jean-Marie Lemercier, J. Thiemann, Raphael Koning, Timo Gerkmann","doi":"10.1186/s13636-023-00285-8","DOIUrl":"https://doi.org/10.1186/s13636-023-00285-8","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2022-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49169410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信