A Temporal Convolutional Network for Weakly Supervised Action Segmentation

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC) Pub Date : 2021-11-17 DOI:10.1109/IC-NIDC54101.2021.9660442

Z. Zou, Jiaqi Zou, Junzhe Liu, Songlin Sun

引用次数: 0

Abstract

The task of video action segmentation in weakly supervised learning is one of the key points of video content understanding. The ground truth only provides a set of actions but not frame level features. A popular type uses a neural network framework to train the prediction model. Our key contribution is a new Hidden Markov Model (HMM) grounded on a Temporal Convolutional Network (TCN) to label video frames, and thus generate a pseudo-ground truth for the subsequent pseudo-supervised training. In testing, we use Viterbi algorithm to generate the time action sequence to be selected, and finally get the largest posteriori sequence. We evaluate the performance of action segmentation task on breakfast dataset. The research experiments on this dataset show that our model gets efficient performance.

查看原文本刊更多论文

弱监督动作分割的时间卷积网络

弱监督学习中的视频动作分割任务是视频内容理解的关键之一。ground truth只提供一组动作，而不提供框架级别的功能。一种流行的类型使用神经网络框架来训练预测模型。我们的关键贡献是基于时间卷积网络(TCN)的新的隐马尔可夫模型(HMM)来标记视频帧，从而为后续的伪监督训练生成伪基础真理。在测试中，我们使用Viterbi算法生成待选的时间动作序列，最终得到最大后验序列。我们在早餐数据集上评估动作分割任务的性能。在该数据集上的研究实验表明，我们的模型具有良好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 7th IEEE International Conference on Network Intelligence and Digital Content (IC-NIDC)

自引率

0.00%

发文量