压缩感知框架下潜艇对水面站水下语音清晰度的提高

Alisha Gupta, A. Koul, K. Nathwani
{"title":"压缩感知框架下潜艇对水面站水下语音清晰度的提高","authors":"Alisha Gupta, A. Koul, K. Nathwani","doi":"10.1109/ICONAT53423.2022.9726099","DOIUrl":null,"url":null,"abstract":"Inter speech communication between submarine to surface in an underwater vessel is always unintelligible. One of the major reasons is the underwater vessel-noise which distorts the speech signal profoundly. The Compressed Sensing (CS) techniques have been widely used to enhance the quality of the noisy speech signal. However, improving the speech intelligibility (SI) of the received speech signal with the on-board equipment is a challenging task and has never been attempted before. Hence in this work the improvement in the intelligibility of the noisy speech signal is achieved by modifying the CS technique by pre-processing the signal based on different features. The pre-processing scheme is based on projecting the received speech signal onto the null-space of the noise formants. The formants herein are extracted from the features such as Linear Prediction (LP) coefficients, Mel-Frequency Cepstral Coefficients (MFCC), and chirp group-delay (GD). Experimental results show that the proposed CS scheme using different features pre-processing (which maximizes the improvement factor), achieves signifi-cant intelligibility improvement over traditional CS and other methods. The improvement factor is obtained using short time objective intelligibility (STOI) metrics.","PeriodicalId":377501,"journal":{"name":"2022 International Conference for Advancement in Technology (ICONAT)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Underwater Speech Intelligibility Improvement Between Submarine to Surface Station in Compress Sensing Framework\",\"authors\":\"Alisha Gupta, A. Koul, K. Nathwani\",\"doi\":\"10.1109/ICONAT53423.2022.9726099\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Inter speech communication between submarine to surface in an underwater vessel is always unintelligible. One of the major reasons is the underwater vessel-noise which distorts the speech signal profoundly. The Compressed Sensing (CS) techniques have been widely used to enhance the quality of the noisy speech signal. However, improving the speech intelligibility (SI) of the received speech signal with the on-board equipment is a challenging task and has never been attempted before. Hence in this work the improvement in the intelligibility of the noisy speech signal is achieved by modifying the CS technique by pre-processing the signal based on different features. The pre-processing scheme is based on projecting the received speech signal onto the null-space of the noise formants. The formants herein are extracted from the features such as Linear Prediction (LP) coefficients, Mel-Frequency Cepstral Coefficients (MFCC), and chirp group-delay (GD). Experimental results show that the proposed CS scheme using different features pre-processing (which maximizes the improvement factor), achieves signifi-cant intelligibility improvement over traditional CS and other methods. The improvement factor is obtained using short time objective intelligibility (STOI) metrics.\",\"PeriodicalId\":377501,\"journal\":{\"name\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference for Advancement in Technology (ICONAT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICONAT53423.2022.9726099\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference for Advancement in Technology (ICONAT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONAT53423.2022.9726099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在水下航行器中,潜艇与水面之间的语音通信总是不可理解的。其中一个主要原因是水下船舶噪声对语音信号造成了严重的失真。压缩感知(CS)技术被广泛应用于提高含噪语音信号的质量。然而,利用车载设备提高接收语音信号的语音清晰度(SI)是一项具有挑战性的任务,以前从未尝试过。因此,在本工作中,通过修改CS技术,根据不同的特征对信号进行预处理,从而提高了噪声语音信号的可理解性。该预处理方案基于将接收到的语音信号投影到噪声共振峰的零空间上。本文从线性预测(LP)系数、Mel-Frequency倒谱系数(MFCC)和啁啾群延迟(GD)等特征中提取共振峰。实验结果表明,采用不同的特征预处理方法(使改进因子最大化)的图像识别方案比传统的图像识别方法和其他方法获得了显著的可理解性提高。利用短时客观可解度(STOI)指标获得改进因子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Underwater Speech Intelligibility Improvement Between Submarine to Surface Station in Compress Sensing Framework
Inter speech communication between submarine to surface in an underwater vessel is always unintelligible. One of the major reasons is the underwater vessel-noise which distorts the speech signal profoundly. The Compressed Sensing (CS) techniques have been widely used to enhance the quality of the noisy speech signal. However, improving the speech intelligibility (SI) of the received speech signal with the on-board equipment is a challenging task and has never been attempted before. Hence in this work the improvement in the intelligibility of the noisy speech signal is achieved by modifying the CS technique by pre-processing the signal based on different features. The pre-processing scheme is based on projecting the received speech signal onto the null-space of the noise formants. The formants herein are extracted from the features such as Linear Prediction (LP) coefficients, Mel-Frequency Cepstral Coefficients (MFCC), and chirp group-delay (GD). Experimental results show that the proposed CS scheme using different features pre-processing (which maximizes the improvement factor), achieves signifi-cant intelligibility improvement over traditional CS and other methods. The improvement factor is obtained using short time objective intelligibility (STOI) metrics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信