Journal on Audio Speech and Music Processing最新文献

筛选
英文 中文
Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones 更正:集成MVDR波束形成器,用于使用本地麦克风阵列和外部麦克风进行语音增强
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-04-06 DOI: 10.1186/s13636-021-00202-x
Randall Ali, T. van Waterschoot, M. Moonen
{"title":"Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones","authors":"Randall Ali, T. van Waterschoot, M. Moonen","doi":"10.1186/s13636-021-00202-x","DOIUrl":"https://doi.org/10.1186/s13636-021-00202-x","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00202-x","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition 具有自注意机制的对抗性联合训练用于鲁棒的端到端语音识别
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-04-03 DOI: 10.1186/s13636-021-00215-6
Lujun Li, Yikai Kang, Yucheng Shi, Ludwig Kürzinger, Tobias Watzel, G. Rigoll
{"title":"Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition","authors":"Lujun Li, Yikai Kang, Yucheng Shi, Ludwig Kürzinger, Tobias Watzel, G. Rigoll","doi":"10.1186/s13636-021-00215-6","DOIUrl":"https://doi.org/10.1186/s13636-021-00215-6","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00215-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48650245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain 多说话人到达方向估计的nmf加权SRP:对空间混叠的鲁棒性同时利用原子时域的稀疏性
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-03-03 DOI: 10.1186/s13636-021-00201-y
S. Thakallapalli, S. Gangashetty, N. Madhu
{"title":"NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain","authors":"S. Thakallapalli, S. Gangashetty, N. Madhu","doi":"10.1186/s13636-021-00201-y","DOIUrl":"https://doi.org/10.1186/s13636-021-00201-y","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00201-y","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of transition cost and model parameters in speaker diarization for meetings 会议发言者配置的转换成本及模型参数分析
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-02-24 DOI: 10.1186/s13636-021-00196-6
Beatriz Martínez-González, J. Pardo, J. A. Vallejo-Pinto, R. San-Segundo, J. Ferreiros
{"title":"Analysis of transition cost and model parameters in speaker diarization for meetings","authors":"Beatriz Martínez-González, J. Pardo, J. A. Vallejo-Pinto, R. San-Segundo, J. Ferreiros","doi":"10.1186/s13636-021-00196-6","DOIUrl":"https://doi.org/10.1186/s13636-021-00196-6","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00196-6","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Comparison of semi-supervised deep learning algorithms for audio classification 音频分类的半监督深度学习算法比较
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-02-16 DOI: 10.1186/s13636-022-00255-6
Léo Cances, E. Labbé, Thomas Pellegrini
{"title":"Comparison of semi-supervised deep learning algorithms for audio classification","authors":"Léo Cances, E. Labbé, Thomas Pellegrini","doi":"10.1186/s13636-022-00255-6","DOIUrl":"https://doi.org/10.1186/s13636-022-00255-6","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2022 1","pages":"1-16"},"PeriodicalIF":2.4,"publicationDate":"2021-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43872465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones 一种集成MVDR波束形成器,用于使用本地麦克风阵列和外部麦克风进行语音增强
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-02-10 DOI: 10.1186/s13636-020-00192-2
Randall Ali, T. van Waterschoot, M. Moonen
{"title":"An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones","authors":"Randall Ali, T. van Waterschoot, M. Moonen","doi":"10.1186/s13636-020-00192-2","DOIUrl":"https://doi.org/10.1186/s13636-020-00192-2","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00192-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A CNN-based approach to identification of degradations in speech signals 基于cnn的语音信号退化识别方法
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-02-05 DOI: 10.1186/s13636-021-00198-4
Yuki Saishu, A. H. Poorjam, M. G. Christensen
{"title":"A CNN-based approach to identification of degradations in speech signals","authors":"Yuki Saishu, A. H. Poorjam, M. G. Christensen","doi":"10.1186/s13636-021-00198-4","DOIUrl":"https://doi.org/10.1186/s13636-021-00198-4","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-021-00198-4","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Dynamic out-of-vocabulary word registration to language model for speech recognition 面向语音识别的动态词汇外词配准语言模型
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2021-01-25 DOI: 10.1186/s13636-020-00193-1
N. Kitaoka, Bohan Chen, Yuya Obashi
{"title":"Dynamic out-of-vocabulary word registration to language model for speech recognition","authors":"N. Kitaoka, Bohan Chen, Yuya Obashi","doi":"10.1186/s13636-020-00193-1","DOIUrl":"https://doi.org/10.1186/s13636-020-00193-1","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2021 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2021-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00193-1","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A simulation study on optimal scores for speaker recognition 说话人识别最优分数的仿真研究
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2020-11-25 DOI: 10.1186/s13636-020-00183-3
Dong Wang
{"title":"A simulation study on optimal scores for speaker recognition","authors":"Dong Wang","doi":"10.1186/s13636-020-00183-3","DOIUrl":"https://doi.org/10.1186/s13636-020-00183-3","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2020 1","pages":""},"PeriodicalIF":2.4,"publicationDate":"2020-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00183-3","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"65687779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization DOANet:一种用于无人机嵌入声源定位搜救的深度扩张卷积神经网络方法
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2020-11-05 DOI: 10.1186/s13636-020-00184-2
Alif Bin Abdul Qayyum, K. M. N. Hassan, Adrita Anika, Md. Farhan Shadiq, M. Rahman, Md. Tariqul Islam, Sheikh Asif Imran, Shahruk Hossain, M. A. Haque
{"title":"DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization","authors":"Alif Bin Abdul Qayyum, K. M. N. Hassan, Adrita Anika, Md. Farhan Shadiq, M. Rahman, Md. Tariqul Islam, Sheikh Asif Imran, Shahruk Hossain, M. A. Haque","doi":"10.1186/s13636-020-00184-2","DOIUrl":"https://doi.org/10.1186/s13636-020-00184-2","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":"2020 1","pages":"1-18"},"PeriodicalIF":2.4,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s13636-020-00184-2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49585126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信