基于VSUMM数据集的视频摘要动态聚类算法

物联网技术 Pub Date : 2023-01-05 DOI:10.1109/IDCIoT56793.2023.10053559

Venkata Ravi Teja Jaladanki, Rajeswara Rao Duvvada, Hari Venkata Samba Siva Rao Badugu

{"title":"基于VSUMM数据集的视频摘要动态聚类算法","authors":"Venkata Ravi Teja Jaladanki, Rajeswara Rao Duvvada, Hari Venkata Samba Siva Rao Badugu","doi":"10.1109/IDCIoT56793.2023.10053559","DOIUrl":null,"url":null,"abstract":"The extent of videos being produced per day across the world is enormous, and in the past few years it has increased to an unprecedented level. Information extraction from a video, however, is more difficult than information extraction from an image. A viewer must see the entire video in order to understand its context. Aside from context awareness, it is nearly impossible to make a universally applicable summary video because each person has a different preferred keyframe. A number of approaches came into existence for tackling this problem which include supervised and unsupervised learning techniques, and some associated with Deep Learning techniques. However, it would require a significant amount of individualized data labelling if we attempted to approach problem video summarizing via a supervised learning method. In this paper, we developed an algorithm based on Dynamic Clustering of projected frame histograms approach to address the challenge of video summarization using unsupervised learning. We have tested the performance of the approach on the VSUMM, a benchmark dataset and showcased that using dynamic clustering algorithm has been proven to perform competitively with some existing approaches.","PeriodicalId":60583,"journal":{"name":"物联网技术","volume":"60 1","pages":"831-837"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic Clustering Algorithm for Video Summarization on VSUMM Dataset\",\"authors\":\"Venkata Ravi Teja Jaladanki, Rajeswara Rao Duvvada, Hari Venkata Samba Siva Rao Badugu\",\"doi\":\"10.1109/IDCIoT56793.2023.10053559\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The extent of videos being produced per day across the world is enormous, and in the past few years it has increased to an unprecedented level. Information extraction from a video, however, is more difficult than information extraction from an image. A viewer must see the entire video in order to understand its context. Aside from context awareness, it is nearly impossible to make a universally applicable summary video because each person has a different preferred keyframe. A number of approaches came into existence for tackling this problem which include supervised and unsupervised learning techniques, and some associated with Deep Learning techniques. However, it would require a significant amount of individualized data labelling if we attempted to approach problem video summarizing via a supervised learning method. In this paper, we developed an algorithm based on Dynamic Clustering of projected frame histograms approach to address the challenge of video summarization using unsupervised learning. We have tested the performance of the approach on the VSUMM, a benchmark dataset and showcased that using dynamic clustering algorithm has been proven to perform competitively with some existing approaches.\",\"PeriodicalId\":60583,\"journal\":{\"name\":\"物联网技术\",\"volume\":\"60 1\",\"pages\":\"831-837\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"物联网技术\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1109/IDCIoT56793.2023.10053559\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"物联网技术","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/IDCIoT56793.2023.10053559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

世界各地每天制作的视频数量是巨大的，在过去的几年里，它已经增长到前所未有的水平。然而，从视频中提取信息比从图像中提取信息要困难得多。观众必须看完整个视频才能理解它的背景。除了上下文感知之外，几乎不可能制作出普遍适用的总结视频，因为每个人都有不同的首选关键帧。为了解决这个问题，出现了许多方法，包括监督和无监督学习技术，以及一些与深度学习技术相关的方法。然而，如果我们试图通过监督学习方法来处理问题视频总结，则需要大量的个性化数据标记。在本文中，我们开发了一种基于投影帧直方图的动态聚类算法来解决使用无监督学习进行视频摘要的挑战。我们已经在VSUMM(一个基准数据集)上测试了该方法的性能，并展示了使用动态聚类算法已被证明与一些现有方法相比具有竞争力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Dynamic Clustering Algorithm for Video Summarization on VSUMM Dataset

The extent of videos being produced per day across the world is enormous, and in the past few years it has increased to an unprecedented level. Information extraction from a video, however, is more difficult than information extraction from an image. A viewer must see the entire video in order to understand its context. Aside from context awareness, it is nearly impossible to make a universally applicable summary video because each person has a different preferred keyframe. A number of approaches came into existence for tackling this problem which include supervised and unsupervised learning techniques, and some associated with Deep Learning techniques. However, it would require a significant amount of individualized data labelling if we attempted to approach problem video summarizing via a supervised learning method. In this paper, we developed an algorithm based on Dynamic Clustering of projected frame histograms approach to address the challenge of video summarization using unsupervised learning. We have tested the performance of the approach on the VSUMM, a benchmark dataset and showcased that using dynamic clustering algorithm has been proven to perform competitively with some existing approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

物联网技术

自引率

0.00%

发文量

5689