SVQ-ACT: Querying for Actions over Videos

2023 IEEE 39th International Conference on Data Engineering (ICDE) Pub Date : 2023-04-01 DOI:10.1109/ICDE55515.2023.00277

Daren Chao, Kaiwen Chen, Nick Koudas

{"title":"SVQ-ACT: Querying for Actions over Videos","authors":"Daren Chao, Kaiwen Chen, Nick Koudas","doi":"10.1109/ICDE55515.2023.00277","DOIUrl":null,"url":null,"abstract":"We present SVQ-ACT, a system capable of evaluating declarative action and object queries over input videos. Our approach is independent of the underlying object and action detection models utilized. Users may issue queries involving action and specific objects (e.g., a human riding a bicycle, close to a traffic light and a car left of the bicycle) and identify video clips that satisfy query constraints. Our system is capable of operating in two main settings, namely online and offline. In the online setting, the user specifies a video source (e.g., a surveillance video) and a declarative query containing an action and object predicates. Our system will identify and label in real-time all frame sequences that match the query. In the offline mode, the system accepts a video repository as input, preprocesses all the video in an offline manner and extracts suitable metadata. Following this step, users can execute any query they wish interactively on the video repository (containing actions and objects supported by the underlying detection models) to identify sequences of frames from videos that satisfy the query. In this case, to limit the number of results produced, we introduce novel result ranking algorithms that can produce the k most relevant results efficiently.We demonstrate that SVQ-ACT can correctly capture the desired query semantics and execute queries efficiently and correctly, delivering a high degree of accuracy.","PeriodicalId":434744,"journal":{"name":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE55515.2023.00277","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

We present SVQ-ACT, a system capable of evaluating declarative action and object queries over input videos. Our approach is independent of the underlying object and action detection models utilized. Users may issue queries involving action and specific objects (e.g., a human riding a bicycle, close to a traffic light and a car left of the bicycle) and identify video clips that satisfy query constraints. Our system is capable of operating in two main settings, namely online and offline. In the online setting, the user specifies a video source (e.g., a surveillance video) and a declarative query containing an action and object predicates. Our system will identify and label in real-time all frame sequences that match the query. In the offline mode, the system accepts a video repository as input, preprocesses all the video in an offline manner and extracts suitable metadata. Following this step, users can execute any query they wish interactively on the video repository (containing actions and objects supported by the underlying detection models) to identify sequences of frames from videos that satisfy the query. In this case, to limit the number of results produced, we introduce novel result ranking algorithms that can produce the k most relevant results efficiently.We demonstrate that SVQ-ACT can correctly capture the desired query semantics and execute queries efficiently and correctly, delivering a high degree of accuracy.

查看原文本刊更多论文

SVQ-ACT:通过视频查询动作

我们提出了SVQ-ACT，一个能够评估输入视频的声明性动作和对象查询的系统。我们的方法独立于所使用的底层对象和动作检测模型。用户可以发出涉及动作和特定对象的查询(例如，骑自行车的人，靠近交通灯和自行车左侧的汽车)，并识别满足查询约束的视频剪辑。我们的系统能够在两种主要设置下运行，即在线和离线。在在线设置中，用户指定视频源(例如，监控视频)和包含动作和对象谓词的声明性查询。我们的系统将实时识别和标记所有匹配查询的帧序列。在离线模式下，系统接受视频库作为输入，对所有视频进行离线预处理，并提取合适的元数据。在此步骤之后，用户可以在视频存储库(包含底层检测模型支持的动作和对象)上交互式地执行任何查询，以识别满足查询的视频帧序列。在这种情况下，为了限制产生的结果数量，我们引入了新的结果排序算法，可以有效地产生k个最相关的结果。我们证明了SVQ-ACT可以正确捕获所需的查询语义，并高效、正确地执行查询，从而提供高度的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE 39th International Conference on Data Engineering (ICDE)

自引率

0.00%

发文量