Journal on Audio Speech and Music Processing最新文献_第9页

Correction to: An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones 更正:集成MVDR波束形成器，用于使用本地麦克风阵列和外部麦克风进行语音增强

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-04-06 DOI: 10.1186/s13636-021-00202-x

Randall Ali, T. van Waterschoot, M. Moonen

引用次数: 1

Adversarial joint training with self-attention mechanism for robust end-to-end speech recognition 具有自注意机制的对抗性联合训练用于鲁棒的端到端语音识别

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-04-03 DOI: 10.1186/s13636-021-00215-6

Lujun Li, Yikai Kang, Yucheng Shi, Ludwig Kürzinger, Tobias Watzel, G. Rigoll

引用次数: 12

NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain 多说话人到达方向估计的nmf加权SRP:对空间混叠的鲁棒性同时利用原子时域的稀疏性

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-03-03 DOI: 10.1186/s13636-021-00201-y

S. Thakallapalli, S. Gangashetty, N. Madhu

引用次数: 0

Analysis of transition cost and model parameters in speaker diarization for meetings 会议发言者配置的转换成本及模型参数分析

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-02-24 DOI: 10.1186/s13636-021-00196-6

Beatriz Martínez-González, J. Pardo, J. A. Vallejo-Pinto, R. San-Segundo, J. Ferreiros

引用次数: 2

Comparison of semi-supervised deep learning algorithms for audio classification 音频分类的半监督深度学习算法比较

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-02-16 DOI: 10.1186/s13636-022-00255-6

Léo Cances, E. Labbé, Thomas Pellegrini

引用次数: 8

An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones 一种集成MVDR波束形成器，用于使用本地麦克风阵列和外部麦克风进行语音增强

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-02-10 DOI: 10.1186/s13636-020-00192-2

Randall Ali, T. van Waterschoot, M. Moonen

引用次数: 6

A CNN-based approach to identification of degradations in speech signals 基于cnn的语音信号退化识别方法

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-02-05 DOI: 10.1186/s13636-021-00198-4

Yuki Saishu, A. H. Poorjam, M. G. Christensen

引用次数: 1

Dynamic out-of-vocabulary word registration to language model for speech recognition 面向语音识别的动态词汇外词配准语言模型

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2021-01-25 DOI: 10.1186/s13636-020-00193-1

N. Kitaoka, Bohan Chen, Yuya Obashi

引用次数: 4

A simulation study on optimal scores for speaker recognition 说话人识别最优分数的仿真研究

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2020-11-25 DOI: 10.1186/s13636-020-00183-3

Dong Wang

引用次数: 10

DOANet: a deep dilated convolutional neural network approach for search and rescue with drone-embedded sound source localization DOANet:一种用于无人机嵌入声源定位搜救的深度扩张卷积神经网络方法

IF 2.4 3区计算机科学

Journal on Audio Speech and Music Processing Pub Date : 2020-11-05 DOI: 10.1186/s13636-020-00184-2

Alif Bin Abdul Qayyum, K. M. N. Hassan, Adrita Anika, Md. Farhan Shadiq, M. Rahman, Md. Tariqul Islam, Sheikh Asif Imran, Shahruk Hossain, M. A. Haque

引用次数: 5