Lip-reading by surveillance cameras

2017 Smart City Symposium Prague (SCSP) Pub Date : 2017-05-25 DOI:10.1109/SCSP.2017.7973348

L. Rothkrantz

引用次数: 10

Abstract

To increase the safety of citizens a network of surveillance cameras has been installed all over the city. These cameras enable analysis of behavior of people and objects. Aggressive behavior is from nature multimodal. Microphones attached to these cameras are not able to analyze speech in a noisy environments and if the speaker is too far away. Lip-movements of a talking mouth can be recorded and understood under limited conditions. From recent progress in the area of Artificial Intelligence it can be expected that large scale lip-reading will be possible next future. In this paper we report the state of the art of lip-reading for the Dutch language. We present a prototype developed at Delft University of Technology. The model is based on the Active Appearance model and Hidden Markov models. The results of experiments with the lip-reading will be represented too. The system has been successfully applied in trains to detect aggressive acts and violence against people and material.

查看原文本刊更多论文

通过监控摄像头读唇

为了提高市民的安全，全城都安装了监控摄像头网络。这些摄像头可以分析人和物体的行为。攻击行为是天生的多模式行为。连接在这些摄像头上的麦克风无法在嘈杂的环境中分析语音，如果扬声器离得太远。在有限的条件下，说话嘴的嘴唇运动可以被记录和理解。从人工智能领域的最新进展来看，可以预期大规模的唇读将在未来成为可能。在这篇论文中，我们报告了荷兰语唇读艺术的现状。我们展示了代尔夫特理工大学开发的一个原型。该模型基于活动外观模型和隐马尔可夫模型。唇读实验的结果也将被展示出来。该系统已成功应用于火车上，用于检测针对人员和物质的攻击行为和暴力行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Smart City Symposium Prague (SCSP)

自引率

0.00%

发文量