Jayanta Paul , Anuska Roy , Abhijit Mitra, Jaya Sil
{"title":"HyV-Summ: Social media video summarization on custom dataset using hybrid techniques","authors":"Jayanta Paul , Anuska Roy , Abhijit Mitra, Jaya Sil","doi":"10.1016/j.neucom.2024.128852","DOIUrl":null,"url":null,"abstract":"<div><div>The proliferation of social networking platforms such as YouTube, Facebook, Instagram, and X has led to an exponential growth in multimedia content, with billions of videos uploaded every hour. Efficient management of such vast amount of data necessitates advanced summarization techniques in order to eliminate irrelevant and redundant information. A summarized video, containing the most distinct frames or key frames, provides a concise representation of the original content. Existing deep learning and non-deep learning techniques for video summarization have certain limitations. Deep learning methods are complex and resource-intensive, while non-deep learning algorithms often fail to extract informative features from vast social media videos. This paper addresses the issue by proposing a novel hybrid technique, named Hybrid Video Summarization (<strong>HyV-Summ</strong>), which integrates deep and non-deep learning techniques to leverage their respective strengths by focusing only on social media content. We developed a custom dataset, <strong>SocialSum</strong> to train our proposed model <strong>HyV-Summ</strong>, since existing benchmark datasets like TVSum and SumMe contain diverse types of content not specific to social media videos. We provide a comparative analysis of existing techniques and datasets with our proposed techniques and dataset. The results demonstrate that HyV-Summ outperforms existing techniques, such as Long Short Term Memory (LSTM)-based and Generative Adversarial Network (GAN)-based summarization by achieving higher F1-scores while applied on both the SocialSum dataset and available datasets.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"614 ","pages":"Article 128852"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224016230","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The proliferation of social networking platforms such as YouTube, Facebook, Instagram, and X has led to an exponential growth in multimedia content, with billions of videos uploaded every hour. Efficient management of such vast amount of data necessitates advanced summarization techniques in order to eliminate irrelevant and redundant information. A summarized video, containing the most distinct frames or key frames, provides a concise representation of the original content. Existing deep learning and non-deep learning techniques for video summarization have certain limitations. Deep learning methods are complex and resource-intensive, while non-deep learning algorithms often fail to extract informative features from vast social media videos. This paper addresses the issue by proposing a novel hybrid technique, named Hybrid Video Summarization (HyV-Summ), which integrates deep and non-deep learning techniques to leverage their respective strengths by focusing only on social media content. We developed a custom dataset, SocialSum to train our proposed model HyV-Summ, since existing benchmark datasets like TVSum and SumMe contain diverse types of content not specific to social media videos. We provide a comparative analysis of existing techniques and datasets with our proposed techniques and dataset. The results demonstrate that HyV-Summ outperforms existing techniques, such as Long Short Term Memory (LSTM)-based and Generative Adversarial Network (GAN)-based summarization by achieving higher F1-scores while applied on both the SocialSum dataset and available datasets.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.