基于图切和动态模型的手语视频手部跟踪与分割

2012 IEEE 11th International Conference on Signal Processing Pub Date : 2012-10-01 DOI:10.1109/ICOSP.2012.6491778

Jun Wan, Q. Ruan, Gaoyun An, Wei Li

{"title":"基于图切和动态模型的手语视频手部跟踪与分割","authors":"Jun Wan, Q. Ruan, Gaoyun An, Wei Li","doi":"10.1109/ICOSP.2012.6491778","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a new method for hands tracking and segmentation based on augmented graph cuts and dynamic model in sign language videos. We focus on resolving three problems which are fast hand motion capture, hand over face and hand occlusions. At first, an effective dynamic model for state prediction is used. This dynamic model can correctly predict the location of hand which has a rapid movement and quick shape deformation. Then, new energy terms are augmented into the energy function in graph cuts. The additional terms are inspired by multi cues, such as color, motion and spatial-temporal information. Finally, we construct the graph and achieve the hand segmentation in successive frames using min-cut/max-flow algorithm. We evaluate our algorithm in a real American Sign Language video from Purdue ASL Database. Besides, our method can be easily extended to track objects with similar color.","PeriodicalId":143331,"journal":{"name":"2012 IEEE 11th International Conference on Signal Processing","volume":"11 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Hand tracking and segmentation via graph cuts and dynamic model in sign language videos\",\"authors\":\"Jun Wan, Q. Ruan, Gaoyun An, Wei Li\",\"doi\":\"10.1109/ICOSP.2012.6491778\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a new method for hands tracking and segmentation based on augmented graph cuts and dynamic model in sign language videos. We focus on resolving three problems which are fast hand motion capture, hand over face and hand occlusions. At first, an effective dynamic model for state prediction is used. This dynamic model can correctly predict the location of hand which has a rapid movement and quick shape deformation. Then, new energy terms are augmented into the energy function in graph cuts. The additional terms are inspired by multi cues, such as color, motion and spatial-temporal information. Finally, we construct the graph and achieve the hand segmentation in successive frames using min-cut/max-flow algorithm. We evaluate our algorithm in a real American Sign Language video from Purdue ASL Database. Besides, our method can be easily extended to track objects with similar color.\",\"PeriodicalId\":143331,\"journal\":{\"name\":\"2012 IEEE 11th International Conference on Signal Processing\",\"volume\":\"11 9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 11th International Conference on Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.2012.6491778\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 11th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2012.6491778","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文提出了一种基于增广图割和动态模型的手语视频手部跟踪与分割方法。我们重点解决了三个问题，即快速手部动作捕捉，手过脸和手闭塞。首先，采用有效的动态模型进行状态预测。该动态模型能够准确地预测手部运动速度快、形状变形快的位置。然后，将新的能量项增广到图割中的能量函数中。附加术语的灵感来自多种线索，如颜色、运动和时空信息。最后，利用最小切割/最大流量算法构造图形，实现连续帧的手部分割。我们在普渡大学手语数据库的一个真实的美国手语视频中评估了我们的算法。此外，我们的方法可以很容易地扩展到跟踪相似颜色的物体。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hand tracking and segmentation via graph cuts and dynamic model in sign language videos

In this paper, we propose a new method for hands tracking and segmentation based on augmented graph cuts and dynamic model in sign language videos. We focus on resolving three problems which are fast hand motion capture, hand over face and hand occlusions. At first, an effective dynamic model for state prediction is used. This dynamic model can correctly predict the location of hand which has a rapid movement and quick shape deformation. Then, new energy terms are augmented into the energy function in graph cuts. The additional terms are inspired by multi cues, such as color, motion and spatial-temporal information. Finally, we construct the graph and achieve the hand segmentation in successive frames using min-cut/max-flow algorithm. We evaluate our algorithm in a real American Sign Language video from Purdue ASL Database. Besides, our method can be easily extended to track objects with similar color.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 IEEE 11th International Conference on Signal Processing

自引率

0.00%

发文量