Efficient Content Based Video Retrieval System by Applying AlexNet on Key Frames

IF 1.7 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

ADCAIJ-Advances in Distributed Computing and Artificial Intelligence Journal Pub Date : 2022-10-21 DOI:10.14201/adcaij.27430

Altaf Hussain, Mehtab Ahmad, Tariq Hussain, Ijaz Ullah

{"title":"Efficient Content Based Video Retrieval System by Applying AlexNet on Key Frames","authors":"Altaf Hussain, Mehtab Ahmad, Tariq Hussain, Ijaz Ullah","doi":"10.14201/adcaij.27430","DOIUrl":null,"url":null,"abstract":"The video retrieval system refers to the task of retrieving the most relevant video collection, given a user query. By applying some feature extraction models the contents of the video can be extracted. With the exponential increase in video data in online and offline databases as well as a huge implementation of multiple applications in health, military, social media, and art, the Content-Based Video Retrieval (CBVR) system has emerged. The CBVR system takes the inner contents of the video frame and analyses features of each frame, through which similar videos are retrieved from the database. However, searching and retrieving the same clips from huge video collection is a hard job because of the presence of complex properties of visual data. Video clips have many frames and every frame has multiple properties that have many visual properties like color, shape, and texture. In this research, an efficient content-based video retrieval system using the AlexNet model of Convolutional Neural Network (CNN) on the keyframes system has been proposed. Firstly, select the keyframes from the video. Secondly, the color histogram is then calculated. Then the features of the color histogram are compared and analyzed for CBVR. The proposed system is based on the AlexNet model of CNN and color histogram, and extracted features from the frames are together to store in the feature vector. From MATLAB simulation results, the proposed method has been evaluated on benchmark dataset UCF101 which has 13320 videos from 101 action categories. The experiments of our system give a better performance as compared to the other state-of-the-art techniques. In contrast to the existing work, the proposed video retrieval system has shown a dramatic and outstanding performance by using accuracy and loss as performance evaluation parameters.","PeriodicalId":42597,"journal":{"name":"ADCAIJ-Advances in Distributed Computing and Artificial Intelligence Journal","volume":"129 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ADCAIJ-Advances in Distributed Computing and Artificial Intelligence Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14201/adcaij.27430","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The video retrieval system refers to the task of retrieving the most relevant video collection, given a user query. By applying some feature extraction models the contents of the video can be extracted. With the exponential increase in video data in online and offline databases as well as a huge implementation of multiple applications in health, military, social media, and art, the Content-Based Video Retrieval (CBVR) system has emerged. The CBVR system takes the inner contents of the video frame and analyses features of each frame, through which similar videos are retrieved from the database. However, searching and retrieving the same clips from huge video collection is a hard job because of the presence of complex properties of visual data. Video clips have many frames and every frame has multiple properties that have many visual properties like color, shape, and texture. In this research, an efficient content-based video retrieval system using the AlexNet model of Convolutional Neural Network (CNN) on the keyframes system has been proposed. Firstly, select the keyframes from the video. Secondly, the color histogram is then calculated. Then the features of the color histogram are compared and analyzed for CBVR. The proposed system is based on the AlexNet model of CNN and color histogram, and extracted features from the frames are together to store in the feature vector. From MATLAB simulation results, the proposed method has been evaluated on benchmark dataset UCF101 which has 13320 videos from 101 action categories. The experiments of our system give a better performance as compared to the other state-of-the-art techniques. In contrast to the existing work, the proposed video retrieval system has shown a dramatic and outstanding performance by using accuracy and loss as performance evaluation parameters.

查看原文本刊更多论文

基于关键帧AlexNet的高效视频检索系统

视频检索系统是指在给定用户查询的情况下，检索最相关的视频集合。通过应用一些特征提取模型，可以提取视频的内容。随着在线和离线数据库中视频数据的指数级增长，以及在医疗、军事、社交媒体和艺术等领域的广泛应用，基于内容的视频检索(CBVR)系统应运而生。CBVR系统获取视频帧的内部内容，分析每一帧的特征，通过这些特征从数据库中检索出相似的视频。然而，由于视觉数据的复杂属性，从海量的视频集合中搜索和检索相同的片段是一项艰巨的工作。视频剪辑有很多帧，每一帧都有多个属性，这些属性有很多视觉属性，比如颜色、形状和纹理。本文提出了一种基于关键帧系统的基于卷积神经网络(CNN)的AlexNet模型的高效视频检索系统。首先，从视频中选择关键帧。其次，计算颜色直方图。然后对CBVR的颜色直方图特征进行了比较和分析。该系统基于CNN的AlexNet模型和颜色直方图，并将从帧中提取的特征放在一起存储在特征向量中。MATLAB仿真结果表明，该方法已在基准数据集UCF101上进行了评估，该数据集包含来自101个动作类别的13320个视频。实验结果表明，与其他先进技术相比，我们的系统具有更好的性能。与已有的工作相比，本文提出的视频检索系统以精度和损失为性能评价参数，表现出了惊人的优异性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊