Multiple Speaker Localization in a Smart Room

M. M. Nejad, D. Mahmoodi, Salehe Zohroudi
{"title":"Multiple Speaker Localization in a Smart Room","authors":"M. M. Nejad, D. Mahmoodi, Salehe Zohroudi","doi":"10.1109/CMSP.2011.152","DOIUrl":null,"url":null,"abstract":"in recent years, there has been growing an interest in intelligent system. Human-machine interaction and the automatic analysis of meeting in smart room is an emerging research field. One of the most important tasks in a smart room is localization of multi-speaker that permits a wide spectrum of application. In this paper, by using the Combined of hyperbolae produced by time delay estimation (TDE) between several microphones pair and the head orientation information, a new acoustic multi-speaker localization function has been proposed that we call it OPROD-PHAT function. We implement a grid-based, multiple speaker localization method. On the multiple moving speaker location estimation, the new approach has been proposed, that to find number of active source in each time frame, the power of cross correlation function has been used. After find the loudest source present by maximizing the energy of a steered beamformer, in order to localize other source, the process is repeated by removing the contribution of the first source. The result of simulation show superior performance of proposed system.","PeriodicalId":309902,"journal":{"name":"2011 International Conference on Multimedia and Signal Processing","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on Multimedia and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CMSP.2011.152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

in recent years, there has been growing an interest in intelligent system. Human-machine interaction and the automatic analysis of meeting in smart room is an emerging research field. One of the most important tasks in a smart room is localization of multi-speaker that permits a wide spectrum of application. In this paper, by using the Combined of hyperbolae produced by time delay estimation (TDE) between several microphones pair and the head orientation information, a new acoustic multi-speaker localization function has been proposed that we call it OPROD-PHAT function. We implement a grid-based, multiple speaker localization method. On the multiple moving speaker location estimation, the new approach has been proposed, that to find number of active source in each time frame, the power of cross correlation function has been used. After find the loudest source present by maximizing the energy of a steered beamformer, in order to localize other source, the process is repeated by removing the contribution of the first source. The result of simulation show superior performance of proposed system.
智能房间中的多扬声器定位
近年来,人们对智能系统的兴趣日益浓厚。智能会议室的人机交互和会议自动分析是一个新兴的研究领域。智能房间中最重要的任务之一是多扬声器的定位,以实现广泛的应用范围。本文利用多个传声器对之间的时延估计(TDE)产生的双曲线与头部方向信息的组合,提出了一种新的声学多扬声器定位函数,我们称之为OPROD-PHAT函数。我们实现了一种基于网格的多说话人定位方法。在多运动扬声器的位置估计中,提出了一种新的方法,即利用互相关函数的幂来确定每个时间帧内的有源数量。在通过最大化定向波束形成器的能量找到存在的最大噪声源后,为了定位其他噪声源,通过去除第一个噪声源的贡献来重复该过程。仿真结果表明该系统具有良好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信