ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献

筛选
英文 中文
Improving Acoustic Echo Cancellation for Voice Assistants Using Neural Echo Suppression and Multi-Microphone Noise Reduction 利用神经回声抑制和多麦克风降噪技术改进语音助手的回声消除功能
Jens Heitkaemper, Arun Narayanan, T. Shabestary, S. Panchapagesan, James Walker, Bhalchandra Gajare, Shlomi Regev, Ajay Dudani, Alexander Gruenstein
{"title":"Improving Acoustic Echo Cancellation for Voice Assistants Using Neural Echo Suppression and Multi-Microphone Noise Reduction","authors":"Jens Heitkaemper, Arun Narayanan, T. Shabestary, S. Panchapagesan, James Walker, Bhalchandra Gajare, Shlomi Regev, Ajay Dudani, Alexander Gruenstein","doi":"10.1109/icassp48485.2024.10447477","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10447477","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"9 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140706260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Think as People: Context-Driven Multi-Image News Captioning with Adaptive Dual Attention 像人一样思考利用自适应双重注意力进行上下文驱动的多图像新闻字幕制作
Qiang Yang, Xiaodong Wu, Xiuying Chen, Xin Gao, Xiangliang Zhang
{"title":"Think as People: Context-Driven Multi-Image News Captioning with Adaptive Dual Attention","authors":"Qiang Yang, Xiaodong Wu, Xiuying Chen, Xin Gao, Xiangliang Zhang","doi":"10.1109/icassp48485.2024.10446024","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10446024","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"21 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140706482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Stochastic Proximal WMMSE for Ergodic Sum Rate Maximization 一种随机近端 WMMSE 算法,可实现遍历和率最大化
Xiaotong Zhao, Xi Wang, Juncheng Wang, Qingjiang Shi
{"title":"A Stochastic Proximal WMMSE for Ergodic Sum Rate Maximization","authors":"Xiaotong Zhao, Xi Wang, Juncheng Wang, Qingjiang Shi","doi":"10.1109/icassp48485.2024.10446220","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10446220","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"2 12","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140706502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrmix: Progressive Mixed Contrastive Learning for Semi-Supervised Medical Image Segmentation Contrmix:用于半监督医学图像分割的渐进式混合对比学习
Meisheng Zhang, Chenye Wang, Wenxuan Zou, Xingqun Qi, Muyi Sun, Wanting Zhou
{"title":"Contrmix: Progressive Mixed Contrastive Learning for Semi-Supervised Medical Image Segmentation","authors":"Meisheng Zhang, Chenye Wang, Wenxuan Zou, Xingqun Qi, Muyi Sun, Wanting Zhou","doi":"10.1109/icassp48485.2024.10447013","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10447013","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"174 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140706598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Oral Reading Fluency Assessment Through Sub-Sequence Matching of Acoustic Word Embeddings 通过语音词嵌入的子序列匹配改进口语阅读流利度评估
Yihao Wang, Zhongdi Wu, Joseph F. T. Nese, Akihito Kamata, Vedant Nilabh, Eric Larson
{"title":"Improving Oral Reading Fluency Assessment Through Sub-Sequence Matching of Acoustic Word Embeddings","authors":"Yihao Wang, Zhongdi Wu, Joseph F. T. Nese, Akihito Kamata, Vedant Nilabh, Eric Larson","doi":"10.1109/icassp48485.2024.10447029","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10447029","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"133 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140706687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pre-Trained Acoustic-and-Textual Modeling for End-To-End Speech-To-Text Translation 用于端到端语音到文本翻译的预训练声学和文本建模
Weitai Zhang, Hanyi Zhang, Chenxuan Liu, Zhongyi Ye, Xinyuan Zhou, Chao Lin, Lirong Dai
{"title":"Pre-Trained Acoustic-and-Textual Modeling for End-To-End Speech-To-Text Translation","authors":"Weitai Zhang, Hanyi Zhang, Chenxuan Liu, Zhongyi Ye, Xinyuan Zhou, Chao Lin, Lirong Dai","doi":"10.1109/icassp48485.2024.10446635","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10446635","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"54 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140705005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SIMMKD: Simple Mask-Flow Keypoint Detection for Both Typhoon Detection and Typhoon Eye Location SIMMKD:用于台风探测和台风眼定位的简单掩膜-流关键点探测
Yunling Feng, Yang Lei, Xinjie Yang, Jian Xu, Xingxian Liu, Bo Xiao, Yajing Xu
{"title":"SIMMKD: Simple Mask-Flow Keypoint Detection for Both Typhoon Detection and Typhoon Eye Location","authors":"Yunling Feng, Yang Lei, Xinjie Yang, Jian Xu, Xingxian Liu, Bo Xiao, Yajing Xu","doi":"10.1109/icassp48485.2024.10448466","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10448466","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"46 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140705189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Learning on Compressed Posterior Space for Non-Iterative Score-based End-to-End Text-to-Speech 在压缩后验空间上进行对抗学习,实现基于分数的非迭代端到端文本到语音技术
Won-Gook Choi, Donghyun Seong, Joon-Hyuk Chang
{"title":"Adversarial Learning on Compressed Posterior Space for Non-Iterative Score-based End-to-End Text-to-Speech","authors":"Won-Gook Choi, Donghyun Seong, Joon-Hyuk Chang","doi":"10.1109/icassp48485.2024.10446958","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10446958","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"42 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140705231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gradient Reactivation Enhanced Causal Attention for Out-Of-Distribution Generalizable Graph Classification 梯度再激活增强因果注意,实现非分布式通用图分类
Xu Wang, Pengfei Gu, Yudong Zhang, Binwu Wang, Pengkun Wang, Yang Wang
{"title":"Gradient Reactivation Enhanced Causal Attention for Out-Of-Distribution Generalizable Graph Classification","authors":"Xu Wang, Pengfei Gu, Yudong Zhang, Binwu Wang, Pengkun Wang, Yang Wang","doi":"10.1109/icassp48485.2024.10446036","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10446036","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"156 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140704434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local Optimization Networks for Multi-View Multi-Person Human Posture Estimation 用于多视角多人人体姿态估计的局部优化网络
Jucheng Song, Chi-Man Pun, Haolun Li, Rushi Lan, Jiucheng Xie, Hao Gao
{"title":"Local Optimization Networks for Multi-View Multi-Person Human Posture Estimation","authors":"Jucheng Song, Chi-Man Pun, Haolun Li, Rushi Lan, Jiucheng Xie, Hao Gao","doi":"10.1109/icassp48485.2024.10445922","DOIUrl":"https://doi.org/10.1109/icassp48485.2024.10445922","url":null,"abstract":"","PeriodicalId":517764,"journal":{"name":"ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"86 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140704759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信