Journal on Audio Speech and Music Processing最新文献

筛选
英文 中文
Three-stage training and orthogonality regularization for spoken language recognition 口语识别的三阶段训练和正交正则化
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-04-06 DOI: 10.1186/s13636-023-00281-y
Zimu Li, Yanyan Xu, Dengfeng Ke, Kaile Su
{"title":"Three-stage training and orthogonality regularization for spoken language recognition","authors":"Zimu Li, Yanyan Xu, Dengfeng Ke, Kaile Su","doi":"10.1186/s13636-023-00281-y","DOIUrl":"https://doi.org/10.1186/s13636-023-00281-y","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47604028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks AAM:用于不同音乐信息检索任务的人工音频多音轨数据集
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-03-23 DOI: 10.1186/s13636-023-00278-7
Fabian Ostermann, Igor Vatolkin, Martin Ebeling
{"title":"AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks","authors":"Fabian Ostermann, Igor Vatolkin, Martin Ebeling","doi":"10.1186/s13636-023-00278-7","DOIUrl":"https://doi.org/10.1186/s13636-023-00278-7","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45035728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Deep learning-based wave digital modeling of rate-dependent hysteretic nonlinearities for virtual analog applications 基于深度学习的虚拟模拟应用中速率相关迟滞非线性波数字建模
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-03-08 DOI: 10.1186/s13636-023-00277-8
Oliviero Massi, Alessandro Ilic Mezza, Riccardo Giampiccolo, A. Bernardini
{"title":"Deep learning-based wave digital modeling of rate-dependent hysteretic nonlinearities for virtual analog applications","authors":"Oliviero Massi, Alessandro Ilic Mezza, Riccardo Giampiccolo, A. Bernardini","doi":"10.1186/s13636-023-00277-8","DOIUrl":"https://doi.org/10.1186/s13636-023-00277-8","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43451051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A latent rhythm complexity model for attribute-controlled drum pattern generation 属性控制鼓纹生成的潜在节奏复杂性模型
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-02-17 DOI: 10.1186/s13636-022-00267-2
Alessandro Ilic Mezza, M. Zanoni, A. Sarti
{"title":"A latent rhythm complexity model for attribute-controlled drum pattern generation","authors":"Alessandro Ilic Mezza, M. Zanoni, A. Sarti","doi":"10.1186/s13636-022-00267-2","DOIUrl":"https://doi.org/10.1186/s13636-022-00267-2","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43818647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Research on monaural speech segregation based on feature selection 基于特征选择的单词语音分离研究
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-02-16 DOI: 10.1186/s13636-023-00276-9
Xiaoping Xie, Yong-Nan Chen, Rufeng Shen, Dan Tian
{"title":"Research on monaural speech segregation based on feature selection","authors":"Xiaoping Xie, Yong-Nan Chen, Rufeng Shen, Dan Tian","doi":"10.1186/s13636-023-00276-9","DOIUrl":"https://doi.org/10.1186/s13636-023-00276-9","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48700415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: Trainable windows for SincNet architecture 更正:SincNet架构的可训练窗口
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-02-09 DOI: 10.1186/s13636-023-00275-w
C. PrashanthH., Madhav Rao, Dhanya Eledath, V. Ramasubramanian
{"title":"Correction: Trainable windows for SincNet architecture","authors":"C. PrashanthH., Madhav Rao, Dhanya Eledath, V. Ramasubramanian","doi":"10.1186/s13636-023-00275-w","DOIUrl":"https://doi.org/10.1186/s13636-023-00275-w","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47645645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Review of methods for coding of speech signals 语音信号编码方法综述
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-02-07 DOI: 10.1186/s13636-023-00274-x
D. O'Shaughnessy
{"title":"Review of methods for coding of speech signals","authors":"D. O'Shaughnessy","doi":"10.1186/s13636-023-00274-x","DOIUrl":"https://doi.org/10.1186/s13636-023-00274-x","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42991437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An MMSE graph spectral magnitude estimator for speech signals residing on an undirected multiple graph 无向多重图上语音信号的MMSE图谱幅度估计器
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-02-03 DOI: 10.1186/s13636-023-00272-z
Tingting Wang, Haiyan Guo, Zirui Ge, Qiquan Zhang, Zhen Yang
{"title":"An MMSE graph spectral magnitude estimator for speech signals residing on an undirected multiple graph","authors":"Tingting Wang, Haiyan Guo, Zirui Ge, Qiquan Zhang, Zhen Yang","doi":"10.1186/s13636-023-00272-z","DOIUrl":"https://doi.org/10.1186/s13636-023-00272-z","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49135001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trainable windows for SincNet architecture SincNet体系结构的可训练窗口
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-01-19 DOI: 10.1186/s13636-023-00271-0
Prashanth H C, Madhav Rao, Dhanya Eledath, R. V.
{"title":"Trainable windows for SincNet architecture","authors":"Prashanth H C, Madhav Rao, Dhanya Eledath, R. V.","doi":"10.1186/s13636-023-00271-0","DOIUrl":"https://doi.org/10.1186/s13636-023-00271-0","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43997539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Beyond the Big Five personality traits for music recommendation systems 音乐推荐系统的五大人格特征之外
IF 2.4 3区 计算机科学
Journal on Audio Speech and Music Processing Pub Date : 2023-01-19 DOI: 10.1186/s13636-022-00269-0
Mariusz Kleć, Alicja Wieczorkowska, K. Szklanny, Włodzimierz Strus
{"title":"Beyond the Big Five personality traits for music recommendation systems","authors":"Mariusz Kleć, Alicja Wieczorkowska, K. Szklanny, Włodzimierz Strus","doi":"10.1186/s13636-022-00269-0","DOIUrl":"https://doi.org/10.1186/s13636-022-00269-0","url":null,"abstract":"","PeriodicalId":49309,"journal":{"name":"Journal on Audio Speech and Music Processing","volume":" ","pages":""},"PeriodicalIF":2.4,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44132285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信