音频和声学应用中的多通道信号处理特刊导论

IEEE Trans. Speech Audio Process. Pub Date : 2004-08-16 DOI:10.1109/TSA.2004.833716

Walter Kellermann, M. Sondhi, D. DeVries

{"title":"音频和声学应用中的多通道信号处理特刊导论","authors":"Walter Kellermann, M. Sondhi, D. DeVries","doi":"10.1109/TSA.2004.833716","DOIUrl":null,"url":null,"abstract":"HE IEEE Signal Processing Society has its roots in an area where acoustics, speech, and signal processing converge, as was reflected in the former name of the society when it was founded in 1974. The interface between acoustics, speech, and signal processing is still an area of great interest to the society, with many fundamental problems still unsolved. Research is driven by applications where acoustic signals have to be captured, transmitted, and/or reproduced in an acoustic environment that includes echoes, noise, and reverberation Considering human/machine interfaces as a major area of applications, it is obvious that signal processing becomes more challenging as the distance between humans and the machines increases, as the signal bandwidth increases, and as the acoustic environment becomes more complex and hostile. Increasingly sophisticated algorithms have been developed since the mid-1970s and along with the availability of greatly increased and affordable computational power, multichannel signal processing algorithms naturally evolved for exploiting the spatial dimension of acoustic signals. The importance and popularity of this field was well reflected by the large number of submissions to this special issue. The volume of high-quality papers could not be fitted into the page budget allotted to us. Thus, we regrettably had to decide to publish some of them in a second section of this special issue as part of a regular issue of the TRANSACTIONS in early 2005. For sound reproduction, where we want to provide a pair of desired signals at the listeners’ ear drums, seamless human/machine interfaces based on multichannel techniques have been implemented since the invention of stereo systems. However, providing the true spatial sound experience in large listening spaces became possible only with new multichannel signal processing techniques, such as wavefield synthesis. Still, major challenges remain, especially phase-true equalization of listening room acoustics and the cancellation of local noise sources and interferers. On the other hand, acquisition of audio and speech signals has been a research topic since the invention of the microphone and still today presents major challenges for the signal processing community. Structurally the simplest problem, the acoustic feedback from loudspeakers into microphones is addressed by acoustic echo cancellation: From the single-channel case which has been investigated since the 1970s, research has moved on to stereo and multichannel reproduction, recently culminating in a new wave-domain adaptive filtering concept which has been presented for the first time at ICASSP 2004. For removing unwanted interference and noise from desired signals, multichannel techniques utilize spatial diversity to discriminate between desired and undesired components, either by exploiting different spatial coherence properties or by beamforming, which directs a beam of increased sensitivity towards the desired source. For traditional beamforming, source localization is necessary if the location of the source is not known a priori. Reflecting current research emphasis in these areas, the paper by Cohen addresses filtering of the","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"11 1","pages":"449-450"},"PeriodicalIF":0.0000,"publicationDate":"2004-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Introduction to the Special Issue on Multichannel Signal Processing for Audio and Acoustics Applications\",\"authors\":\"Walter Kellermann, M. Sondhi, D. DeVries\",\"doi\":\"10.1109/TSA.2004.833716\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"HE IEEE Signal Processing Society has its roots in an area where acoustics, speech, and signal processing converge, as was reflected in the former name of the society when it was founded in 1974. The interface between acoustics, speech, and signal processing is still an area of great interest to the society, with many fundamental problems still unsolved. Research is driven by applications where acoustic signals have to be captured, transmitted, and/or reproduced in an acoustic environment that includes echoes, noise, and reverberation Considering human/machine interfaces as a major area of applications, it is obvious that signal processing becomes more challenging as the distance between humans and the machines increases, as the signal bandwidth increases, and as the acoustic environment becomes more complex and hostile. Increasingly sophisticated algorithms have been developed since the mid-1970s and along with the availability of greatly increased and affordable computational power, multichannel signal processing algorithms naturally evolved for exploiting the spatial dimension of acoustic signals. The importance and popularity of this field was well reflected by the large number of submissions to this special issue. The volume of high-quality papers could not be fitted into the page budget allotted to us. Thus, we regrettably had to decide to publish some of them in a second section of this special issue as part of a regular issue of the TRANSACTIONS in early 2005. For sound reproduction, where we want to provide a pair of desired signals at the listeners’ ear drums, seamless human/machine interfaces based on multichannel techniques have been implemented since the invention of stereo systems. However, providing the true spatial sound experience in large listening spaces became possible only with new multichannel signal processing techniques, such as wavefield synthesis. Still, major challenges remain, especially phase-true equalization of listening room acoustics and the cancellation of local noise sources and interferers. On the other hand, acquisition of audio and speech signals has been a research topic since the invention of the microphone and still today presents major challenges for the signal processing community. Structurally the simplest problem, the acoustic feedback from loudspeakers into microphones is addressed by acoustic echo cancellation: From the single-channel case which has been investigated since the 1970s, research has moved on to stereo and multichannel reproduction, recently culminating in a new wave-domain adaptive filtering concept which has been presented for the first time at ICASSP 2004. For removing unwanted interference and noise from desired signals, multichannel techniques utilize spatial diversity to discriminate between desired and undesired components, either by exploiting different spatial coherence properties or by beamforming, which directs a beam of increased sensitivity towards the desired source. For traditional beamforming, source localization is necessary if the location of the source is not known a priori. Reflecting current research emphasis in these areas, the paper by Cohen addresses filtering of the\",\"PeriodicalId\":13155,\"journal\":{\"name\":\"IEEE Trans. Speech Audio Process.\",\"volume\":\"11 1\",\"pages\":\"449-450\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-08-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Trans. Speech Audio Process.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSA.2004.833716\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Trans. Speech Audio Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSA.2004.833716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

IEEE信号处理学会起源于声学、语音和信号处理融合的领域，正如1974年成立时该学会的前名称所反映的那样。声学、语音和信号处理之间的接口仍然是一个社会非常感兴趣的领域，许多基本问题仍未解决。研究是由声学信号必须在包括回声、噪声和混响的声学环境中捕获、传输和/或再现的应用驱动的。考虑到人机界面是一个主要的应用领域，很明显，随着人与机器之间距离的增加，信号带宽的增加，以及声学环境变得更加复杂和恶劣，信号处理变得更具挑战性。自20世纪70年代中期以来，越来越复杂的算法得到了发展，随着计算能力的大大提高和可负担得起，多通道信号处理算法自然演变为利用声学信号的空间维度。这一领域的重要性和受欢迎程度在本期特刊收到的大量稿件中得到了很好的反映。高质量论文的数量无法装入分配给我们的页数预算中。因此，遗憾的是，我们不得不决定在2005年初的《汇刊》特刊的第二部分中发表其中的一些文章。对于声音再现，我们希望在听众的耳鼓上提供一对所需的信号，自立体声系统发明以来，基于多声道技术的无缝人机接口已经实现。然而，只有采用新的多通道信号处理技术，如波场合成，才能在大的听音空间中提供真正的空间声音体验。然而，主要的挑战仍然存在，特别是听音室声学的相位真实均衡和局部噪声源和干扰的消除。另一方面，自麦克风发明以来，音频和语音信号的采集一直是一个研究课题，至今仍是信号处理领域面临的主要挑战。结构上最简单的问题，从扬声器到麦克风的声学反馈是通过声学回声消除来解决的:从20世纪70年代以来调查的单通道情况，研究已经转移到立体声和多通道再现，最近在ICASSP 2004上首次提出了一个新的波域自适应滤波概念。为了从期望的信号中去除不必要的干扰和噪声，多通道技术利用空间分集来区分期望的和不希望的分量，要么利用不同的空间相干特性，要么通过波束成形，将灵敏度更高的波束指向期望的源。对于传统的波束形成，如果不能先验地知道源的位置，则需要对源进行定位。反映当前的研究重点在这些领域，科恩的论文解决过滤

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Introduction to the Special Issue on Multichannel Signal Processing for Audio and Acoustics Applications

HE IEEE Signal Processing Society has its roots in an area where acoustics, speech, and signal processing converge, as was reflected in the former name of the society when it was founded in 1974. The interface between acoustics, speech, and signal processing is still an area of great interest to the society, with many fundamental problems still unsolved. Research is driven by applications where acoustic signals have to be captured, transmitted, and/or reproduced in an acoustic environment that includes echoes, noise, and reverberation Considering human/machine interfaces as a major area of applications, it is obvious that signal processing becomes more challenging as the distance between humans and the machines increases, as the signal bandwidth increases, and as the acoustic environment becomes more complex and hostile. Increasingly sophisticated algorithms have been developed since the mid-1970s and along with the availability of greatly increased and affordable computational power, multichannel signal processing algorithms naturally evolved for exploiting the spatial dimension of acoustic signals. The importance and popularity of this field was well reflected by the large number of submissions to this special issue. The volume of high-quality papers could not be fitted into the page budget allotted to us. Thus, we regrettably had to decide to publish some of them in a second section of this special issue as part of a regular issue of the TRANSACTIONS in early 2005. For sound reproduction, where we want to provide a pair of desired signals at the listeners’ ear drums, seamless human/machine interfaces based on multichannel techniques have been implemented since the invention of stereo systems. However, providing the true spatial sound experience in large listening spaces became possible only with new multichannel signal processing techniques, such as wavefield synthesis. Still, major challenges remain, especially phase-true equalization of listening room acoustics and the cancellation of local noise sources and interferers. On the other hand, acquisition of audio and speech signals has been a research topic since the invention of the microphone and still today presents major challenges for the signal processing community. Structurally the simplest problem, the acoustic feedback from loudspeakers into microphones is addressed by acoustic echo cancellation: From the single-channel case which has been investigated since the 1970s, research has moved on to stereo and multichannel reproduction, recently culminating in a new wave-domain adaptive filtering concept which has been presented for the first time at ICASSP 2004. For removing unwanted interference and noise from desired signals, multichannel techniques utilize spatial diversity to discriminate between desired and undesired components, either by exploiting different spatial coherence properties or by beamforming, which directs a beam of increased sensitivity towards the desired source. For traditional beamforming, source localization is necessary if the location of the source is not known a priori. Reflecting current research emphasis in these areas, the paper by Cohen addresses filtering of the

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Trans. Speech Audio Process.

自引率

0.00%

发文量