{"title":"联合稀疏性和独立性约束下基于环境相位估计的立体信号初级环境提取","authors":"Xiyu Song;Teng Tian;Shiqi Wang;Fangzhi Yao;Hongbing Qiu;Mei Wang;Hongyan Jiang","doi":"10.1109/TCE.2025.3563989","DOIUrl":null,"url":null,"abstract":"Primary-ambient extraction (PAE) is a technique to enhance the user listening experience in spatial audio reproduction. This is achieved by extracting the primary and ambient components from the sound scene. The PAE approach of ambient phase estimation with a sparsity constraint (APES) leverages the magnitude consistency of ambient components and the sparsity of the primary components to refine the PAE performance. This approach demonstrates an improved extraction accuracy when the ambient component is relatively strong. However, APES suffers from severe extraction errors when the primary amplitudes are equal in two channels of a stereo signal, which is a common sound scene in stereo signals. In this paper, the limitations of APES are analyzed, and a novel ambient phase estimation method is proposed under the joint constraints of sparsity and independence, called APESI. This method uses the independence between the primary component and the ambient component to correct the ambient phase estimation condition. Both objective and subjective experimental results demonstrate that the proposed APESI outperforms the APES and other traditional approaches in terms of extraction accuracy and ambient spatial accuracy, especially when the primary amplitudes are equal.","PeriodicalId":13208,"journal":{"name":"IEEE Transactions on Consumer Electronics","volume":"71 2","pages":"2806-2813"},"PeriodicalIF":10.9000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Primary-Ambient Extraction Using Ambient Phase Estimate Under Joint Sparsity and Independence Constraints for Stereo Signals\",\"authors\":\"Xiyu Song;Teng Tian;Shiqi Wang;Fangzhi Yao;Hongbing Qiu;Mei Wang;Hongyan Jiang\",\"doi\":\"10.1109/TCE.2025.3563989\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Primary-ambient extraction (PAE) is a technique to enhance the user listening experience in spatial audio reproduction. This is achieved by extracting the primary and ambient components from the sound scene. The PAE approach of ambient phase estimation with a sparsity constraint (APES) leverages the magnitude consistency of ambient components and the sparsity of the primary components to refine the PAE performance. This approach demonstrates an improved extraction accuracy when the ambient component is relatively strong. However, APES suffers from severe extraction errors when the primary amplitudes are equal in two channels of a stereo signal, which is a common sound scene in stereo signals. In this paper, the limitations of APES are analyzed, and a novel ambient phase estimation method is proposed under the joint constraints of sparsity and independence, called APESI. This method uses the independence between the primary component and the ambient component to correct the ambient phase estimation condition. Both objective and subjective experimental results demonstrate that the proposed APESI outperforms the APES and other traditional approaches in terms of extraction accuracy and ambient spatial accuracy, especially when the primary amplitudes are equal.\",\"PeriodicalId\":13208,\"journal\":{\"name\":\"IEEE Transactions on Consumer Electronics\",\"volume\":\"71 2\",\"pages\":\"2806-2813\"},\"PeriodicalIF\":10.9000,\"publicationDate\":\"2025-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Consumer Electronics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10976234/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Consumer Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976234/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Primary-Ambient Extraction Using Ambient Phase Estimate Under Joint Sparsity and Independence Constraints for Stereo Signals
Primary-ambient extraction (PAE) is a technique to enhance the user listening experience in spatial audio reproduction. This is achieved by extracting the primary and ambient components from the sound scene. The PAE approach of ambient phase estimation with a sparsity constraint (APES) leverages the magnitude consistency of ambient components and the sparsity of the primary components to refine the PAE performance. This approach demonstrates an improved extraction accuracy when the ambient component is relatively strong. However, APES suffers from severe extraction errors when the primary amplitudes are equal in two channels of a stereo signal, which is a common sound scene in stereo signals. In this paper, the limitations of APES are analyzed, and a novel ambient phase estimation method is proposed under the joint constraints of sparsity and independence, called APESI. This method uses the independence between the primary component and the ambient component to correct the ambient phase estimation condition. Both objective and subjective experimental results demonstrate that the proposed APESI outperforms the APES and other traditional approaches in terms of extraction accuracy and ambient spatial accuracy, especially when the primary amplitudes are equal.
期刊介绍:
The main focus for the IEEE Transactions on Consumer Electronics is the engineering and research aspects of the theory, design, construction, manufacture or end use of mass market electronics, systems, software and services for consumers.