{"title":"基于熵的噪声语音信号谱图有用内容提取","authors":"Ana Vrankovic, I. Ipšić, J. Lerga","doi":"10.1109/ELMAR52657.2021.9550891","DOIUrl":null,"url":null,"abstract":"In the paper, we extend and apply the 2D local entropy method (2DLEM) used for signal time-frequency repre-sentation to useful content extraction from noisy speech signals. In our previous work, we presented the 2D local entropy method (2DLEM) and tested it on synthetic signals and a small number of real-world signals. We now extend the application of our method by applying it to recorded speech signals combined with noise from different sources. The database we used is commonly used in speech recognition, where tested methods usually have the best result achieved on clean signals without added noise or on denoised signals. The 2DLEM method is used for the extraction of useful content, and in this paper, we test it in real-world scenarios. Our results show promising results for all tested signals regardless of the noise source or signal-to-noise ratios (SNRs). Combining the 2DLEM method with speech recognition methods could improve the performance of speech recognition and understanding systems.","PeriodicalId":410503,"journal":{"name":"2021 International Symposium ELMAR","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Entropy-Based Extraction of Useful Content from Spectrograms of Noisy Speech Signals\",\"authors\":\"Ana Vrankovic, I. Ipšić, J. Lerga\",\"doi\":\"10.1109/ELMAR52657.2021.9550891\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the paper, we extend and apply the 2D local entropy method (2DLEM) used for signal time-frequency repre-sentation to useful content extraction from noisy speech signals. In our previous work, we presented the 2D local entropy method (2DLEM) and tested it on synthetic signals and a small number of real-world signals. We now extend the application of our method by applying it to recorded speech signals combined with noise from different sources. The database we used is commonly used in speech recognition, where tested methods usually have the best result achieved on clean signals without added noise or on denoised signals. The 2DLEM method is used for the extraction of useful content, and in this paper, we test it in real-world scenarios. Our results show promising results for all tested signals regardless of the noise source or signal-to-noise ratios (SNRs). Combining the 2DLEM method with speech recognition methods could improve the performance of speech recognition and understanding systems.\",\"PeriodicalId\":410503,\"journal\":{\"name\":\"2021 International Symposium ELMAR\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Symposium ELMAR\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ELMAR52657.2021.9550891\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Symposium ELMAR","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ELMAR52657.2021.9550891","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Entropy-Based Extraction of Useful Content from Spectrograms of Noisy Speech Signals
In the paper, we extend and apply the 2D local entropy method (2DLEM) used for signal time-frequency repre-sentation to useful content extraction from noisy speech signals. In our previous work, we presented the 2D local entropy method (2DLEM) and tested it on synthetic signals and a small number of real-world signals. We now extend the application of our method by applying it to recorded speech signals combined with noise from different sources. The database we used is commonly used in speech recognition, where tested methods usually have the best result achieved on clean signals without added noise or on denoised signals. The 2DLEM method is used for the extraction of useful content, and in this paper, we test it in real-world scenarios. Our results show promising results for all tested signals regardless of the noise source or signal-to-noise ratios (SNRs). Combining the 2DLEM method with speech recognition methods could improve the performance of speech recognition and understanding systems.