Venkata Ravi Teja Jaladanki, Rajeswara Rao Duvvada, Hari Venkata Samba Siva Rao Badugu
{"title":"基于VSUMM数据集的视频摘要动态聚类算法","authors":"Venkata Ravi Teja Jaladanki, Rajeswara Rao Duvvada, Hari Venkata Samba Siva Rao Badugu","doi":"10.1109/IDCIoT56793.2023.10053559","DOIUrl":null,"url":null,"abstract":"The extent of videos being produced per day across the world is enormous, and in the past few years it has increased to an unprecedented level. Information extraction from a video, however, is more difficult than information extraction from an image. A viewer must see the entire video in order to understand its context. Aside from context awareness, it is nearly impossible to make a universally applicable summary video because each person has a different preferred keyframe. A number of approaches came into existence for tackling this problem which include supervised and unsupervised learning techniques, and some associated with Deep Learning techniques. However, it would require a significant amount of individualized data labelling if we attempted to approach problem video summarizing via a supervised learning method. In this paper, we developed an algorithm based on Dynamic Clustering of projected frame histograms approach to address the challenge of video summarization using unsupervised learning. We have tested the performance of the approach on the VSUMM, a benchmark dataset and showcased that using dynamic clustering algorithm has been proven to perform competitively with some existing approaches.","PeriodicalId":60583,"journal":{"name":"物联网技术","volume":"60 1","pages":"831-837"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dynamic Clustering Algorithm for Video Summarization on VSUMM Dataset\",\"authors\":\"Venkata Ravi Teja Jaladanki, Rajeswara Rao Duvvada, Hari Venkata Samba Siva Rao Badugu\",\"doi\":\"10.1109/IDCIoT56793.2023.10053559\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The extent of videos being produced per day across the world is enormous, and in the past few years it has increased to an unprecedented level. Information extraction from a video, however, is more difficult than information extraction from an image. A viewer must see the entire video in order to understand its context. Aside from context awareness, it is nearly impossible to make a universally applicable summary video because each person has a different preferred keyframe. A number of approaches came into existence for tackling this problem which include supervised and unsupervised learning techniques, and some associated with Deep Learning techniques. However, it would require a significant amount of individualized data labelling if we attempted to approach problem video summarizing via a supervised learning method. In this paper, we developed an algorithm based on Dynamic Clustering of projected frame histograms approach to address the challenge of video summarization using unsupervised learning. We have tested the performance of the approach on the VSUMM, a benchmark dataset and showcased that using dynamic clustering algorithm has been proven to perform competitively with some existing approaches.\",\"PeriodicalId\":60583,\"journal\":{\"name\":\"物联网技术\",\"volume\":\"60 1\",\"pages\":\"831-837\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"物联网技术\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1109/IDCIoT56793.2023.10053559\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"物联网技术","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/IDCIoT56793.2023.10053559","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dynamic Clustering Algorithm for Video Summarization on VSUMM Dataset
The extent of videos being produced per day across the world is enormous, and in the past few years it has increased to an unprecedented level. Information extraction from a video, however, is more difficult than information extraction from an image. A viewer must see the entire video in order to understand its context. Aside from context awareness, it is nearly impossible to make a universally applicable summary video because each person has a different preferred keyframe. A number of approaches came into existence for tackling this problem which include supervised and unsupervised learning techniques, and some associated with Deep Learning techniques. However, it would require a significant amount of individualized data labelling if we attempted to approach problem video summarizing via a supervised learning method. In this paper, we developed an algorithm based on Dynamic Clustering of projected frame histograms approach to address the challenge of video summarization using unsupervised learning. We have tested the performance of the approach on the VSUMM, a benchmark dataset and showcased that using dynamic clustering algorithm has been proven to perform competitively with some existing approaches.