International Journal of Speech Technology最新文献

筛选
英文 中文
A transformer-based network for speech recognition 基于变压器的语音识别网络
International Journal of Speech Technology Pub Date : 2023-06-26 DOI: 10.1007/s10772-023-10034-z
Lina Tang
{"title":"A transformer-based network for speech recognition","authors":"Lina Tang","doi":"10.1007/s10772-023-10034-z","DOIUrl":"https://doi.org/10.1007/s10772-023-10034-z","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"531 - 539"},"PeriodicalIF":0.0,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"52286217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noise robust automatic speech recognition: review and analysis 抗噪声的自动语音识别:综述与分析
International Journal of Speech Technology Pub Date : 2023-06-24 DOI: 10.1007/s10772-023-10033-0
M. Dua, Akanksha, Shelza Dua
{"title":"Noise robust automatic speech recognition: review and analysis","authors":"M. Dua, Akanksha, Shelza Dua","doi":"10.1007/s10772-023-10033-0","DOIUrl":"https://doi.org/10.1007/s10772-023-10033-0","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"475 - 519"},"PeriodicalIF":0.0,"publicationDate":"2023-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45835689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stuttering detection using speaker representations and self-supervised contextual embeddings 使用说话人表示和自监督上下文嵌入的口吃检测
International Journal of Speech Technology Pub Date : 2023-06-01 DOI: 10.1007/s10772-023-10032-1
S. A. Sheikh, Md. Sahidullah, F. Hirsch, Slim Ouni
{"title":"Stuttering detection using speaker representations and self-supervised contextual embeddings","authors":"S. A. Sheikh, Md. Sahidullah, F. Hirsch, Slim Ouni","doi":"10.1007/s10772-023-10032-1","DOIUrl":"https://doi.org/10.1007/s10772-023-10032-1","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"521 - 530"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46627133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Mouth2Audio: intelligible audio synthesis from videos with distinctive vowel articulation Mouth2Audio:从具有独特元音发音的视频中合成可理解的音频
International Journal of Speech Technology Pub Date : 2023-05-25 DOI: 10.1007/s10772-023-10030-3
Saurabh Garg, Haoyao Ruan, G. Hamarneh, D. Behne, A. Jongman, J. Sereno, Yue Wang
{"title":"Mouth2Audio: intelligible audio synthesis from videos with distinctive vowel articulation","authors":"Saurabh Garg, Haoyao Ruan, G. Hamarneh, D. Behne, A. Jongman, J. Sereno, Yue Wang","doi":"10.1007/s10772-023-10030-3","DOIUrl":"https://doi.org/10.1007/s10772-023-10030-3","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"459 - 474"},"PeriodicalIF":0.0,"publicationDate":"2023-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"52286201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perception of impoliteness in disagreement speech acts among Iranian upper-intermediate EFL students: a gender perspective 伊朗高中英语学生对分歧言语行为中不礼貌的感知:性别视角
International Journal of Speech Technology Pub Date : 2023-04-26 DOI: 10.1007/s10772-023-10029-w
M. Shahrokhi, Behnaz Khodadadi
{"title":"Perception of impoliteness in disagreement speech acts among Iranian upper-intermediate EFL students: a gender perspective","authors":"M. Shahrokhi, Behnaz Khodadadi","doi":"10.1007/s10772-023-10029-w","DOIUrl":"https://doi.org/10.1007/s10772-023-10029-w","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"1 1","pages":"1-15"},"PeriodicalIF":0.0,"publicationDate":"2023-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49406580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble machine learning regression model based predictive framework for Parkinson’s UPDRS motor score prediction from speech data 基于集成机器学习回归模型的帕金森氏UPDRS运动成绩预测框架
International Journal of Speech Technology Pub Date : 2023-03-28 DOI: 10.1007/s10772-023-10026-z
K. Shastry
{"title":"Ensemble machine learning regression model based predictive framework for Parkinson’s UPDRS motor score prediction from speech data","authors":"K. Shastry","doi":"10.1007/s10772-023-10026-z","DOIUrl":"https://doi.org/10.1007/s10772-023-10026-z","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"1 1","pages":"1-25"},"PeriodicalIF":0.0,"publicationDate":"2023-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49663962","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linguistic analysis for emotion recognition: a case of Chinese speakers 情绪识别的语言学分析——以汉语为例
International Journal of Speech Technology Pub Date : 2023-03-18 DOI: 10.1007/s10772-023-10028-x
C. Schirru, Shahla Simin, Paolo Mengoni, A. Milani
{"title":"Linguistic analysis for emotion recognition: a case of Chinese speakers","authors":"C. Schirru, Shahla Simin, Paolo Mengoni, A. Milani","doi":"10.1007/s10772-023-10028-x","DOIUrl":"https://doi.org/10.1007/s10772-023-10028-x","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"417 - 432"},"PeriodicalIF":0.0,"publicationDate":"2023-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46800103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The perception of artificial-intelligence (AI) based synthesized speech in younger and older adults 基于人工智能(AI)的合成语音在年轻人和老年人中的感知
International Journal of Speech Technology Pub Date : 2023-03-13 DOI: 10.1007/s10772-023-10027-y
Björn Herrmann
{"title":"The perception of artificial-intelligence (AI) based synthesized speech in younger and older adults","authors":"Björn Herrmann","doi":"10.1007/s10772-023-10027-y","DOIUrl":"https://doi.org/10.1007/s10772-023-10027-y","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"395 - 415"},"PeriodicalIF":0.0,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"52286189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Different attacks presence considerations: analyzing the simple and efficient self-marked algorithm performance for highly-sensitive audio signals contents verification 不同攻击存在的考虑:分析简单高效的自标记算法对高敏感音频信号内容验证的性能
International Journal of Speech Technology Pub Date : 2023-03-12 DOI: 10.1007/s10772-023-10025-0
M. El-Bendary, Sabry S. Nassar
{"title":"Different attacks presence considerations: analyzing the simple and efficient self-marked algorithm performance for highly-sensitive audio signals contents verification","authors":"M. El-Bendary, Sabry S. Nassar","doi":"10.1007/s10772-023-10025-0","DOIUrl":"https://doi.org/10.1007/s10772-023-10025-0","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"379 - 394"},"PeriodicalIF":0.0,"publicationDate":"2023-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43603493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance analysis of the speech enhancement application with wavelet transform domain adaptive filters 小波变换域自适应滤波器语音增强应用的性能分析
International Journal of Speech Technology Pub Date : 2023-03-01 DOI: 10.1007/s10772-023-10022-3
Elif Özen Acarbay, Nalan Özkurt
{"title":"Performance analysis of the speech enhancement application with wavelet transform domain adaptive filters","authors":"Elif Özen Acarbay, Nalan Özkurt","doi":"10.1007/s10772-023-10022-3","DOIUrl":"https://doi.org/10.1007/s10772-023-10022-3","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"26 1","pages":"245-258"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43643797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信