International Journal of Speech Technology最新文献

筛选
英文 中文
A computationally efficient speech emotion recognition system employing machine learning classifiers and ensemble learning 采用机器学习分类器和集合学习的计算高效语音情感识别系统
International Journal of Speech Technology Pub Date : 2024-03-30 DOI: 10.1007/s10772-024-10095-8
N. Aishwarya, Kanwaljeet Kaur, Karthik Seemakurthy
{"title":"A computationally efficient speech emotion recognition system employing machine learning classifiers and ensemble learning","authors":"N. Aishwarya, Kanwaljeet Kaur, Karthik Seemakurthy","doi":"10.1007/s10772-024-10095-8","DOIUrl":"https://doi.org/10.1007/s10772-024-10095-8","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"23 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140364321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech recognition based on the transformer's multi-head attention in Arabic 基于变压器多头注意力的阿拉伯语语音识别
International Journal of Speech Technology Pub Date : 2024-03-29 DOI: 10.1007/s10772-024-10092-x
Omayma Mahmoudi, Mouncef Filali-Bouami, Mohamed Benchat
{"title":"Speech recognition based on the transformer's multi-head attention in Arabic","authors":"Omayma Mahmoudi, Mouncef Filali-Bouami, Mohamed Benchat","doi":"10.1007/s10772-024-10092-x","DOIUrl":"https://doi.org/10.1007/s10772-024-10092-x","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"42 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140368252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature extraction using GTCC spectrogram and ResNet50 based classification for audio spoof detection 利用 GTCC 频谱和基于 ResNet50 的分类进行特征提取,用于音频欺骗检测
International Journal of Speech Technology Pub Date : 2024-03-29 DOI: 10.1007/s10772-024-10093-w
N. Chakravarty, Mohit Dua
{"title":"Feature extraction using GTCC spectrogram and ResNet50 based classification for audio spoof detection","authors":"N. Chakravarty, Mohit Dua","doi":"10.1007/s10772-024-10093-w","DOIUrl":"https://doi.org/10.1007/s10772-024-10093-w","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"62 20","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140367823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conditional Denoising Diffusion Implicit Model for Speech Enhancement 用于语音增强的条件去噪扩散隐含模型
International Journal of Speech Technology Pub Date : 2024-03-26 DOI: 10.1007/s10772-024-10091-y
Chengyong Yang, Xiukang Yu, Sheng Huang
{"title":"Conditional Denoising Diffusion Implicit Model for Speech Enhancement","authors":"Chengyong Yang, Xiukang Yu, Sheng Huang","doi":"10.1007/s10772-024-10091-y","DOIUrl":"https://doi.org/10.1007/s10772-024-10091-y","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140378747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stockwell-Transform based feature representation for detection and assessment of voice disorders 基于斯托克韦尔变换的特征表示法检测和评估嗓音疾病
International Journal of Speech Technology Pub Date : 2024-02-29 DOI: 10.1007/s10772-024-10085-w
Purva Barche, K. Gurugubelli, A. Vuppala
{"title":"Stockwell-Transform based feature representation for detection and assessment of voice disorders","authors":"Purva Barche, K. Gurugubelli, A. Vuppala","doi":"10.1007/s10772-024-10085-w","DOIUrl":"https://doi.org/10.1007/s10772-024-10085-w","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"12 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-02-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140412538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Automated detection system for texture feature based classification on different image datasets using S-transform 更正为使用 S 变换对不同图像数据集进行基于纹理特征分类的自动检测系统
International Journal of Speech Technology Pub Date : 2024-01-29 DOI: 10.1007/s10772-024-10083-y
O. Kesav, G. K. Rajini
{"title":"Correction to: Automated detection system for texture feature based classification on different image datasets using S-transform","authors":"O. Kesav, G. K. Rajini","doi":"10.1007/s10772-024-10083-y","DOIUrl":"https://doi.org/10.1007/s10772-024-10083-y","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"58 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140487019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review on speech emotion recognition for late deafened educators in online education 晚聋教育工作者在线教育语音情感识别综述
International Journal of Speech Technology Pub Date : 2024-01-24 DOI: 10.1007/s10772-023-10064-7
Aparna Vyakaranam, Tomas Maul, Bavani Ramayah
{"title":"A review on speech emotion recognition for late deafened educators in online education","authors":"Aparna Vyakaranam, Tomas Maul, Bavani Ramayah","doi":"10.1007/s10772-023-10064-7","DOIUrl":"https://doi.org/10.1007/s10772-023-10064-7","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"60 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139601433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements in encoded speech data by background noise suppression under uncontrolled environment 在不受控环境下抑制背景噪声,推动编码语音数据的发展
International Journal of Speech Technology Pub Date : 2024-01-06 DOI: 10.1007/s10772-023-10078-1
B. G. Nagaraja, G. T. Yadava, Mohamed Anees
{"title":"Advancements in encoded speech data by background noise suppression under uncontrolled environment","authors":"B. G. Nagaraja, G. T. Yadava, Mohamed Anees","doi":"10.1007/s10772-023-10078-1","DOIUrl":"https://doi.org/10.1007/s10772-023-10078-1","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139380910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scene text visual question answering by using YOLO and STN 利用 YOLO 和 STN 进行场景文本可视化问题解答
International Journal of Speech Technology Pub Date : 2024-01-03 DOI: 10.1007/s10772-023-10081-6
Kimiya Nourali, Elham Dolkhani
{"title":"Scene text visual question answering by using YOLO and STN","authors":"Kimiya Nourali, Elham Dolkhani","doi":"10.1007/s10772-023-10081-6","DOIUrl":"https://doi.org/10.1007/s10772-023-10081-6","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":"8 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139389461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An optimized convolutional neural network for speech enhancement 用于语音增强的优化卷积神经网络
International Journal of Speech Technology Pub Date : 2023-12-29 DOI: 10.1007/s10772-023-10073-6
A. Karthik, J. L. Mazher Iqbal
{"title":"An optimized convolutional neural network for speech enhancement","authors":"A. Karthik, J. L. Mazher Iqbal","doi":"10.1007/s10772-023-10073-6","DOIUrl":"https://doi.org/10.1007/s10772-023-10073-6","url":null,"abstract":"","PeriodicalId":14305,"journal":{"name":"International Journal of Speech Technology","volume":" 46","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139144647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信