面向可扩展的人体运动片段检索

2020 IEEE International Symposium on Multimedia (ISM) Pub Date : 2020-12-01 DOI:10.1109/ISM.2020.00015

Petra Budíková, J. Sedmidubský, J. Horvath, P. Zezula

{"title":"面向可扩展的人体运动片段检索","authors":"Petra Budíková, J. Sedmidubský, J. Horvath, P. Zezula","doi":"10.1109/ISM.2020.00015","DOIUrl":null,"url":null,"abstract":"With the increasing availability of human motion data captured in the form of 2D/3D skeleton sequences, more complex motion recordings need to be processed. In this paper, we focus on the similarity-based retrieval of motion episodes - medium-sized skeleton sequences that consist of multiple semantic actions and correspond to some logical motion unit (e.g., a figure skating performance). We examine two orthogonal approaches to the episode-matching task: (1) the deep learning approach that is traditionally used for processing short motion actions, and (2) the motion-word technique that transforms skeleton sequences into a text-like representation. Since the second approach is more promising, we propose a two-phase retrieval scheme that combines mature text-processing techniques with application-specific refinement methods. We demonstrate that this solution achieves promising results in both effectiveness and efficiency, and can be further indexed to implement scalable episode retrieval.","PeriodicalId":120972,"journal":{"name":"2020 IEEE International Symposium on Multimedia (ISM)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Towards Scalable Retrieval of Human Motion Episodes\",\"authors\":\"Petra Budíková, J. Sedmidubský, J. Horvath, P. Zezula\",\"doi\":\"10.1109/ISM.2020.00015\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the increasing availability of human motion data captured in the form of 2D/3D skeleton sequences, more complex motion recordings need to be processed. In this paper, we focus on the similarity-based retrieval of motion episodes - medium-sized skeleton sequences that consist of multiple semantic actions and correspond to some logical motion unit (e.g., a figure skating performance). We examine two orthogonal approaches to the episode-matching task: (1) the deep learning approach that is traditionally used for processing short motion actions, and (2) the motion-word technique that transforms skeleton sequences into a text-like representation. Since the second approach is more promising, we propose a two-phase retrieval scheme that combines mature text-processing techniques with application-specific refinement methods. We demonstrate that this solution achieves promising results in both effectiveness and efficiency, and can be further indexed to implement scalable episode retrieval.\",\"PeriodicalId\":120972,\"journal\":{\"name\":\"2020 IEEE International Symposium on Multimedia (ISM)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Symposium on Multimedia (ISM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISM.2020.00015\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Symposium on Multimedia (ISM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM.2020.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

随着以2D/3D骨骼序列形式捕获的人体运动数据的增加，需要处理更复杂的运动记录。在本文中，我们专注于基于相似度的运动片段检索-由多个语义动作组成的中等骨架序列，并对应于一些逻辑运动单元(例如，花样滑冰表演)。我们研究了情节匹配任务的两种正交方法:(1)传统上用于处理短动作的深度学习方法，以及(2)将骨架序列转换为类似文本表示的动作词技术。由于第二种方法更有前途，我们提出了一种两阶段检索方案，该方案将成熟的文本处理技术与特定于应用程序的细化方法相结合。我们证明了该解决方案在有效性和效率方面都取得了令人鼓舞的结果，并且可以进一步索引以实现可扩展的集检索。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Towards Scalable Retrieval of Human Motion Episodes

With the increasing availability of human motion data captured in the form of 2D/3D skeleton sequences, more complex motion recordings need to be processed. In this paper, we focus on the similarity-based retrieval of motion episodes - medium-sized skeleton sequences that consist of multiple semantic actions and correspond to some logical motion unit (e.g., a figure skating performance). We examine two orthogonal approaches to the episode-matching task: (1) the deep learning approach that is traditionally used for processing short motion actions, and (2) the motion-word technique that transforms skeleton sequences into a text-like representation. Since the second approach is more promising, we propose a two-phase retrieval scheme that combines mature text-processing techniques with application-specific refinement methods. We demonstrate that this solution achieves promising results in both effectiveness and efficiency, and can be further indexed to implement scalable episode retrieval.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE International Symposium on Multimedia (ISM)

自引率

0.00%

发文量