{"title":"相位检索:在音频信号重建中的应用","authors":"Raja Abdelmalek Bedoui, Z. Mnasri, F. Benzarti","doi":"10.1109/SSD54932.2022.9955795","DOIUrl":null,"url":null,"abstract":"Theoretically, phase retrieval is an efficient method for signal reconstruction given only the magnitude spectrum of the short-term Fourier transform (STFT). This topic has recently regained popularity due to its utility in a variety of applications such as compressive sensing, speech synthesis, speech enhancement, source separation, and so on. As a result, based on an explicit relationship between STFT magnitude and phase, this paper presents an efficient algorithm for audio signal reconstruction using phase retrieval from the STFT magnitude spectrum. The standard metrics in signal reconstruction, such as time domain segmental signal-to-noise ratio (segSNR), time-frequency domain signal-to-error ratio (SER), and cepstrum-related distance measures, such as log-likelihood ratio (LLR), Itakura-Saito distorsion (IS), and cepstrum distance, are used to perform an objective evaluation. The results support the proposed approach's improvement.","PeriodicalId":253898,"journal":{"name":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Phase Retrieval: Application to Audio Signal Reconstruction\",\"authors\":\"Raja Abdelmalek Bedoui, Z. Mnasri, F. Benzarti\",\"doi\":\"10.1109/SSD54932.2022.9955795\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Theoretically, phase retrieval is an efficient method for signal reconstruction given only the magnitude spectrum of the short-term Fourier transform (STFT). This topic has recently regained popularity due to its utility in a variety of applications such as compressive sensing, speech synthesis, speech enhancement, source separation, and so on. As a result, based on an explicit relationship between STFT magnitude and phase, this paper presents an efficient algorithm for audio signal reconstruction using phase retrieval from the STFT magnitude spectrum. The standard metrics in signal reconstruction, such as time domain segmental signal-to-noise ratio (segSNR), time-frequency domain signal-to-error ratio (SER), and cepstrum-related distance measures, such as log-likelihood ratio (LLR), Itakura-Saito distorsion (IS), and cepstrum distance, are used to perform an objective evaluation. The results support the proposed approach's improvement.\",\"PeriodicalId\":253898,\"journal\":{\"name\":\"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SSD54932.2022.9955795\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD54932.2022.9955795","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Phase Retrieval: Application to Audio Signal Reconstruction
Theoretically, phase retrieval is an efficient method for signal reconstruction given only the magnitude spectrum of the short-term Fourier transform (STFT). This topic has recently regained popularity due to its utility in a variety of applications such as compressive sensing, speech synthesis, speech enhancement, source separation, and so on. As a result, based on an explicit relationship between STFT magnitude and phase, this paper presents an efficient algorithm for audio signal reconstruction using phase retrieval from the STFT magnitude spectrum. The standard metrics in signal reconstruction, such as time domain segmental signal-to-noise ratio (segSNR), time-frequency domain signal-to-error ratio (SER), and cepstrum-related distance measures, such as log-likelihood ratio (LLR), Itakura-Saito distorsion (IS), and cepstrum distance, are used to perform an objective evaluation. The results support the proposed approach's improvement.