多扬声器检测和跟踪使用音频和视频传感器与手势分析

B. Hariharan, S. Hari, Uma Gopalakrishnan
{"title":"多扬声器检测和跟踪使用音频和视频传感器与手势分析","authors":"B. Hariharan, S. Hari, Uma Gopalakrishnan","doi":"10.1109/WOCN.2013.6616222","DOIUrl":null,"url":null,"abstract":"Video conferencing plays an important role in many corporate and educational fields. E-learning uses the concept of video conferencing for interaction between students' and tutors' in different locations. The tutor's actual presence is in a real classroom and the students can view their tutor through a video in a virtual classroom. Wireless microphones and video sensors are used, to facilitate an interaction between the students and tutors but sometimes it may not be as efficient when we use multiple speakers. In that case, it would be helpful if we can identify the student who asks a question first, either in the virtual or real classroom by using audio and video sensors. To make an E-learning classroom as similar to a real classroom, we propose a system that will utilize the professor's gestures; this will decide who can ask questions. This is particularly useful when we use several speakers in an E-learning classroom. The student who is asking a question, for the first time, will be located using audio and video sensors in either virtual or real classroom. The raised hand along with student's voice is used for localization. This method helps both the professor and the student get the experience of being in a real classroom.","PeriodicalId":388309,"journal":{"name":"2013 Tenth International Conference on Wireless and Optical Communications Networks (WOCN)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Multi speaker detection and tracking using audio and video sensors with gesture analysis\",\"authors\":\"B. Hariharan, S. Hari, Uma Gopalakrishnan\",\"doi\":\"10.1109/WOCN.2013.6616222\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Video conferencing plays an important role in many corporate and educational fields. E-learning uses the concept of video conferencing for interaction between students' and tutors' in different locations. The tutor's actual presence is in a real classroom and the students can view their tutor through a video in a virtual classroom. Wireless microphones and video sensors are used, to facilitate an interaction between the students and tutors but sometimes it may not be as efficient when we use multiple speakers. In that case, it would be helpful if we can identify the student who asks a question first, either in the virtual or real classroom by using audio and video sensors. To make an E-learning classroom as similar to a real classroom, we propose a system that will utilize the professor's gestures; this will decide who can ask questions. This is particularly useful when we use several speakers in an E-learning classroom. The student who is asking a question, for the first time, will be located using audio and video sensors in either virtual or real classroom. The raised hand along with student's voice is used for localization. This method helps both the professor and the student get the experience of being in a real classroom.\",\"PeriodicalId\":388309,\"journal\":{\"name\":\"2013 Tenth International Conference on Wireless and Optical Communications Networks (WOCN)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 Tenth International Conference on Wireless and Optical Communications Networks (WOCN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WOCN.2013.6616222\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 Tenth International Conference on Wireless and Optical Communications Networks (WOCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WOCN.2013.6616222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

视频会议在许多企业和教育领域发挥着重要作用。电子学习使用视频会议的概念,在不同地点的学生和导师之间进行互动。导师的实际存在是在一个真实的教室里,学生可以通过虚拟教室的视频来观看他们的导师。使用无线麦克风和视频传感器,以促进学生和导师之间的互动,但有时当我们使用多个扬声器时,它可能不那么有效。在这种情况下,如果我们能够通过使用音频和视频传感器,在虚拟或真实教室中,先识别出提出问题的学生,将会很有帮助。为了使电子学习课堂与真实课堂相似,我们提出了一个系统,该系统将利用教授的手势;这将决定谁可以提问。当我们在一个电子学习课堂中使用几个演讲者时,这尤其有用。第一次提出问题的学生将在虚拟或真实教室中使用音频和视频传感器定位。举起的手和学生的声音一起用于定位。这种方法可以帮助教授和学生获得在真实课堂上的体验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Multi speaker detection and tracking using audio and video sensors with gesture analysis
Video conferencing plays an important role in many corporate and educational fields. E-learning uses the concept of video conferencing for interaction between students' and tutors' in different locations. The tutor's actual presence is in a real classroom and the students can view their tutor through a video in a virtual classroom. Wireless microphones and video sensors are used, to facilitate an interaction between the students and tutors but sometimes it may not be as efficient when we use multiple speakers. In that case, it would be helpful if we can identify the student who asks a question first, either in the virtual or real classroom by using audio and video sensors. To make an E-learning classroom as similar to a real classroom, we propose a system that will utilize the professor's gestures; this will decide who can ask questions. This is particularly useful when we use several speakers in an E-learning classroom. The student who is asking a question, for the first time, will be located using audio and video sensors in either virtual or real classroom. The raised hand along with student's voice is used for localization. This method helps both the professor and the student get the experience of being in a real classroom.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信