International Journal of Speech Technology最新文献

筛选
英文 中文
Psychoacoustic model-driven spectral subtraction for monaural speech enhancement 心理声学模型驱动的单声道语音增强频谱减法
International Journal of Speech Technology Pub Date : 2023-11-18 DOI: 10.1007/s10772-023-10062-9
Navneet Upadhyay
{"title":"Psychoacoustic model-driven spectral subtraction for monaural speech enhancement","authors":"Navneet Upadhyay","doi":"10.1007/s10772-023-10062-9","DOIUrl":"https://doi.org/10.1007/s10772-023-10062-9","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"5 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139261559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized cross-corpus speech emotion recognition framework based on normalized 1D convolutional neural network with data augmentation and feature selection 基于归一化一维卷积神经网络与数据增强和特征选择的优化跨语料库语音情感识别框架
International Journal of Speech Technology Pub Date : 2023-11-15 DOI: 10.1007/s10772-023-10063-8
Nishant Barsainyan, Dileep Kumar Singh
{"title":"Optimized cross-corpus speech emotion recognition framework based on normalized 1D convolutional neural network with data augmentation and feature selection","authors":"Nishant Barsainyan, Dileep Kumar Singh","doi":"10.1007/s10772-023-10063-8","DOIUrl":"https://doi.org/10.1007/s10772-023-10063-8","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"8 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139271436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Usefulness of glottal excitation source information for audio-visual speech recognition system 声门激励源信息在视听语音识别系统中的应用
International Journal of Speech Technology Pub Date : 2023-11-14 DOI: 10.1007/s10772-023-10060-x
Salam Nandakishor, Debadatta Pati
{"title":"Usefulness of glottal excitation source information for audio-visual speech recognition system","authors":"Salam Nandakishor, Debadatta Pati","doi":"10.1007/s10772-023-10060-x","DOIUrl":"https://doi.org/10.1007/s10772-023-10060-x","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 18","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134954312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust and efficient keyword spotting using a bidirectional attention LSTM 基于双向注意力LSTM的鲁棒高效关键字识别
International Journal of Speech Technology Pub Date : 2023-11-11 DOI: 10.1007/s10772-023-10067-4
Om Prakash Swain, H. Hemanth, Puneet Saran, Mohanaprasad Kothandaraman, Logesh Ravi, Hardik Sailor, K. S. Rajesh
{"title":"Robust and efficient keyword spotting using a bidirectional attention LSTM","authors":"Om Prakash Swain, H. Hemanth, Puneet Saran, Mohanaprasad Kothandaraman, Logesh Ravi, Hardik Sailor, K. S. Rajesh","doi":"10.1007/s10772-023-10067-4","DOIUrl":"https://doi.org/10.1007/s10772-023-10067-4","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"18 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135043070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
End-to-end ASR framework for Indian-English accent: using speech CNN-based segmentation 端到端印度英语语音的ASR框架:使用基于cnn的语音分割
International Journal of Speech Technology Pub Date : 2023-11-11 DOI: 10.1007/s10772-023-10053-w
Ghayas Ahmed, Aadil Ahmad Lawaye
{"title":"End-to-end ASR framework for Indian-English accent: using speech CNN-based segmentation","authors":"Ghayas Ahmed, Aadil Ahmad Lawaye","doi":"10.1007/s10772-023-10053-w","DOIUrl":"https://doi.org/10.1007/s10772-023-10053-w","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"19 19","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135043271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting Character-based Mandarin ASR via Chinese Pinyin Representation 基于汉字拼音表示的汉语语音识别研究
International Journal of Speech Technology Pub Date : 2023-11-08 DOI: 10.1007/s10772-023-10050-z
Li Li, Yanhua Long, Dongxing Xu, Yijie Li
{"title":"Boosting Character-based Mandarin ASR via Chinese Pinyin Representation","authors":"Li Li, Yanhua Long, Dongxing Xu, Yijie Li","doi":"10.1007/s10772-023-10050-z","DOIUrl":"https://doi.org/10.1007/s10772-023-10050-z","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135390364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-based factorized TDNN for a noise-robust and spoof-aware speaker verification system 基于注意力的分解TDNN噪声鲁棒和欺骗感知说话人验证系统
International Journal of Speech Technology Pub Date : 2023-11-05 DOI: 10.1007/s10772-023-10059-4
Zhor Benhafid, Sid Ahmed Selouani, Abderrahmane Amrouche, Mohammed Sidi Yakoub
{"title":"Attention-based factorized TDNN for a noise-robust and spoof-aware speaker verification system","authors":"Zhor Benhafid, Sid Ahmed Selouani, Abderrahmane Amrouche, Mohammed Sidi Yakoub","doi":"10.1007/s10772-023-10059-4","DOIUrl":"https://doi.org/10.1007/s10772-023-10059-4","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"22 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135724739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel data augmentation for named entity recognition 命名实体识别的新型数据增强
International Journal of Speech Technology Pub Date : 2023-11-03 DOI: 10.1007/s10772-023-10055-8
Aluru V. N. M. Hemateja, Gopikrishnan Kondakath, Susruta Das, Mohanaprasad Kothandaraman, S. Shobha, Abhishek Pandey, Rajin Babu, Abhinav Jain
{"title":"Novel data augmentation for named entity recognition","authors":"Aluru V. N. M. Hemateja, Gopikrishnan Kondakath, Susruta Das, Mohanaprasad Kothandaraman, S. Shobha, Abhishek Pandey, Rajin Babu, Abhinav Jain","doi":"10.1007/s10772-023-10055-8","DOIUrl":"https://doi.org/10.1007/s10772-023-10055-8","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"40 22","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135819607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A speech based diagnostic method for Alzheimer disease using machine learning 使用机器学习的基于语音的阿尔茨海默病诊断方法
International Journal of Speech Technology Pub Date : 2023-11-03 DOI: 10.1007/s10772-023-10056-7
R. Benazir Begam, M. Palanivelan
{"title":"A speech based diagnostic method for Alzheimer disease using machine learning","authors":"R. Benazir Begam, M. Palanivelan","doi":"10.1007/s10772-023-10056-7","DOIUrl":"https://doi.org/10.1007/s10772-023-10056-7","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"43 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135820188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CI-Mix: cut instance mix for robust speaker verification CI-Mix:切断实例混合稳健的说话者验证
International Journal of Speech Technology Pub Date : 2023-11-01 DOI: 10.1007/s10772-023-10051-y
Yibo Duan, Yanhua Long, Yijie Li
{"title":"CI-Mix: cut instance mix for robust speaker verification","authors":"Yibo Duan, Yanhua Long, Yijie Li","doi":"10.1007/s10772-023-10051-y","DOIUrl":"https://doi.org/10.1007/s10772-023-10051-y","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"4 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135325766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信