Turbo Training with Token Dropout

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference Pub Date : 2022-10-10 DOI:10.48550/arXiv.2210.04889

Tengda Han, Weidi Xie, Andrew Zisserman

引用次数: 2

Abstract

The objective of this paper is an efficient training method for video tasks. We make three contributions: (1) We propose Turbo training, a simple and versatile training paradigm for Transformers on multiple video tasks. (2) We illustrate the advantages of Turbo training on action classification, video-language representation learning, and long-video activity classification, showing that Turbo training can largely maintain competitive performance while achieving almost 4X speed-up and significantly less memory consumption. (3) Turbo training enables long-schedule video-language training and end-to-end long-video training, delivering competitive or superior performance than previous works, which were infeasible to train under limited resources.

查看原文本刊更多论文

涡轮训练与令牌辍学

本文的目标是为视频任务提供一种有效的训练方法。我们做出了三个贡献:(1)我们提出Turbo训练，这是一个简单而通用的训练范式，用于多个视频任务的变形金刚。(2)我们说明了Turbo训练在动作分类、视频语言表征学习和长视频活动分类上的优势，表明Turbo训练在很大程度上保持了竞争性能，同时实现了近4倍的加速和显著减少的内存消耗。(3) Turbo训练可以实现长时间的视频语言训练和端到端的长视频训练，比以前的作品有竞争力或更好的表现，这在资源有限的情况下是不可训练的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMVC : proceedings of the British Machine Vision Conference. British Machine Vision Conference

自引率

0.00%

发文量