Clustering scenes in cooking video guided by object access

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) Pub Date : 2015-07-30 DOI:10.1109/ICMEW.2015.7169812

Yuki Matsumura, Atsushi Hashimoto, Shinsuke Mori, M. Mukunoki, M. Minoh

引用次数: 4

Abstract

We propose a method in which scenes in a cooking video are clustered for every type of food processing, such as cutting or stir-frying. To extract motion feature, the method first divides the video into segments. The obtained segments are then clustered based on the similarity of the extracted motion feature. The key point is how to divide the video at the first step of the method. Though a simple approach is to divide the video into segments with the same length, this approach cannot deal with the difference of food processing techniques in cooking. Instead, we propose an approach based on object access, namely the moments when a chef picks up or puts down objects. It is expected to obtain segments reflecting such difference. We compare our method with methods using fixed lengths on three cooking videos in the KUSK Dataset, and evaluate the performance for clustering.

查看原文本刊更多论文

基于对象访问的烹饪视频场景聚类

我们提出了一种方法，该方法将烹饪视频中的场景聚集在一起，用于每种类型的食品加工，例如切割或煸炒。为了提取运动特征，该方法首先将视频分割成多个片段。然后根据提取的运动特征的相似度对得到的片段进行聚类。在该方法的第一步，关键是如何分割视频。虽然简单的方法是将视频分成相同长度的片段，但这种方法无法处理烹饪中食物加工技术的差异。相反，我们提出了一种基于对象访问的方法，即厨师拿起或放下对象的时刻。期望得到反映这种差异的片段。我们将我们的方法与使用固定长度的方法在KUSK数据集中的三个烹饪视频上进行比较，并评估聚类的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

自引率

0.00%

发文量