{"title":"Phase Retrieval: Application to Audio Signal Reconstruction","authors":"Raja Abdelmalek Bedoui, Z. Mnasri, F. Benzarti","doi":"10.1109/SSD54932.2022.9955795","DOIUrl":null,"url":null,"abstract":"Theoretically, phase retrieval is an efficient method for signal reconstruction given only the magnitude spectrum of the short-term Fourier transform (STFT). This topic has recently regained popularity due to its utility in a variety of applications such as compressive sensing, speech synthesis, speech enhancement, source separation, and so on. As a result, based on an explicit relationship between STFT magnitude and phase, this paper presents an efficient algorithm for audio signal reconstruction using phase retrieval from the STFT magnitude spectrum. The standard metrics in signal reconstruction, such as time domain segmental signal-to-noise ratio (segSNR), time-frequency domain signal-to-error ratio (SER), and cepstrum-related distance measures, such as log-likelihood ratio (LLR), Itakura-Saito distorsion (IS), and cepstrum distance, are used to perform an objective evaluation. The results support the proposed approach's improvement.","PeriodicalId":253898,"journal":{"name":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 19th International Multi-Conference on Systems, Signals & Devices (SSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSD54932.2022.9955795","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Theoretically, phase retrieval is an efficient method for signal reconstruction given only the magnitude spectrum of the short-term Fourier transform (STFT). This topic has recently regained popularity due to its utility in a variety of applications such as compressive sensing, speech synthesis, speech enhancement, source separation, and so on. As a result, based on an explicit relationship between STFT magnitude and phase, this paper presents an efficient algorithm for audio signal reconstruction using phase retrieval from the STFT magnitude spectrum. The standard metrics in signal reconstruction, such as time domain segmental signal-to-noise ratio (segSNR), time-frequency domain signal-to-error ratio (SER), and cepstrum-related distance measures, such as log-likelihood ratio (LLR), Itakura-Saito distorsion (IS), and cepstrum distance, are used to perform an objective evaluation. The results support the proposed approach's improvement.