Methodology for Translation of Video Content Activates into Text Description: Three Object Activities Action

International Journal of Image, Graphics and Signal Processing Pub Date : 2022-08-08 DOI:10.5815/ijigsp.2022.04.05

Ramesh M. Kagalkar

引用次数: 0

Abstract

: This paper presents a natural language text description from video content activities. Here it analyzes the content of any video to identify the number of objects in that video content, what actions and activities are going on has to track and match the action model then based on that generate the grammatical correct text description in English is discussed. It uses two approaches, training, and testing. In the training, we need to maintain a database i.e. subject-verb and object are assigned to extract features of images, and the second approach called testing will automatically generate text descriptions from video content. The implemented system will translate complex video contents into text descriptions and by the duration of a one-minute video with three different object considerations. For this evaluation, a standard DB of YouTube is considered where 250 samples from 50 different domains. The overall system gives an accuracy of 93%.

查看原文本刊更多论文

视频内容活动转化为文本的方法描述:三个对象活动行动

本文提出了一种基于视频内容活动的自然语言文本描述方法。在这里，它分析任何视频的内容，以识别该视频内容中的对象数量，正在进行的动作和活动必须跟踪和匹配动作模型，然后在此基础上生成语法正确的英语文本描述。它使用两种方法，训练和测试。在训练中，我们需要维护一个数据库，即分配主谓和宾语来提取图像的特征，第二种称为测试的方法将自动从视频内容中生成文本描述。所实现的系统将复杂的视频内容翻译成文本描述，并在一分钟的视频中考虑三个不同的对象。为了进行评估，我们考虑了YouTube的一个标准数据库，其中包含来自50个不同域的250个样本。整个系统的准确率为93%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Image, Graphics and Signal Processing

自引率

0.00%

发文量