基于联想神经网络的低分辨率视频识别方法

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005. Pub Date : 2005-12-27 DOI:10.1109/IJCNN.2005.1556419

Dmitry O. Gorodnichy

{"title":"基于联想神经网络的低分辨率视频识别方法","authors":"Dmitry O. Gorodnichy","doi":"10.1109/IJCNN.2005.1556419","DOIUrl":null,"url":null,"abstract":"Techniques developed for recognition of objects in photographs often fail when applied to recognition of the same objects in video. A critical example of such a situation is seen in face recognition, where many technologies are already intensively used for passport verification and where there is no technology that can reliably identify a person from a surveillance video. The reason for this is that video provides images of much lower quality and resolution than that of photographs. Besides, objects in video are normally captured in unconstrained environments, often under poor lighting, in motion and at a distance. This makes memorization of an object from a single video frame unreliable and recognition based on a single video frame very difficult if even possible. This paper introduces a neuro-associative approach to recognition which can both learn and identify an object from low-resolution low-quality video sequences. This approach is derived from a mathematical model of biological visual memory, in which correlation-based projection learning is used to memorize a face from a video sequence and attractor-based association is performed to recognize a face over several video frames. The approach is demonstrated using a video-based facial database and real-time video annotation of TV shows.","PeriodicalId":365690,"journal":{"name":"Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":"{\"title\":\"Associative neural networks as means for low-resolution video-based recognition\",\"authors\":\"Dmitry O. Gorodnichy\",\"doi\":\"10.1109/IJCNN.2005.1556419\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Techniques developed for recognition of objects in photographs often fail when applied to recognition of the same objects in video. A critical example of such a situation is seen in face recognition, where many technologies are already intensively used for passport verification and where there is no technology that can reliably identify a person from a surveillance video. The reason for this is that video provides images of much lower quality and resolution than that of photographs. Besides, objects in video are normally captured in unconstrained environments, often under poor lighting, in motion and at a distance. This makes memorization of an object from a single video frame unreliable and recognition based on a single video frame very difficult if even possible. This paper introduces a neuro-associative approach to recognition which can both learn and identify an object from low-resolution low-quality video sequences. This approach is derived from a mathematical model of biological visual memory, in which correlation-based projection learning is used to memorize a face from a video sequence and attractor-based association is performed to recognize a face over several video frames. The approach is demonstrated using a video-based facial database and real-time video annotation of TV shows.\",\"PeriodicalId\":365690,\"journal\":{\"name\":\"Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-12-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"26\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IJCNN.2005.1556419\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN.2005.1556419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 26

摘要

为识别照片中的物体而开发的技术在应用于识别视频中的相同物体时往往失败。这种情况的一个关键例子是面部识别，其中许多技术已经被广泛用于护照核查，并且没有技术可以从监控视频中可靠地识别一个人。这样做的原因是视频提供的图像质量和分辨率都比照片低得多。此外，视频中的物体通常是在不受约束的环境中拍摄的，通常是在光线不足、运动和距离较远的情况下。这使得从单个视频帧记忆对象变得不可靠，并且即使可能，基于单个视频帧的识别也非常困难。本文介绍了一种神经联想识别方法，可以从低分辨率、低质量的视频序列中学习和识别目标。该方法来源于生物视觉记忆的数学模型，其中使用基于相关性的投影学习来记忆视频序列中的人脸，并执行基于吸引子的关联来识别多个视频帧中的人脸。利用基于视频的人脸数据库和电视节目的实时视频注释对该方法进行了验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Associative neural networks as means for low-resolution video-based recognition

Techniques developed for recognition of objects in photographs often fail when applied to recognition of the same objects in video. A critical example of such a situation is seen in face recognition, where many technologies are already intensively used for passport verification and where there is no technology that can reliably identify a person from a surveillance video. The reason for this is that video provides images of much lower quality and resolution than that of photographs. Besides, objects in video are normally captured in unconstrained environments, often under poor lighting, in motion and at a distance. This makes memorization of an object from a single video frame unreliable and recognition based on a single video frame very difficult if even possible. This paper introduces a neuro-associative approach to recognition which can both learn and identify an object from low-resolution low-quality video sequences. This approach is derived from a mathematical model of biological visual memory, in which correlation-based projection learning is used to memorize a face from a video sequence and attractor-based association is performed to recognize a face over several video frames. The approach is demonstrated using a video-based facial database and real-time video annotation of TV shows.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.

自引率

0.00%

发文量