Design and Implementation of Multimodal Video Retrieval System

2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA) Pub Date : 2021-09-01 DOI:10.1109/ICDSBA53075.2021.00036

Bin Qi

引用次数: 0

Abstract

With the application of deep learning in various fields, it replaces the extraction of video by manual design. Through the analysis and research of modal video retrieval methods, this paper aims to use text, image or video to carry out different ways of video retrieval through multi-modal video retrieval, and strive to meet the needs of different scenes, different users of video retrieval, and maximize the accuracy and effectiveness of video retrieval. This paper designs and implements a deep learning-based multimodal video retrieval system based on Windows. The system is based on the design concept of program modularization, using MySQL as the database and PyQT4 as the development tool of system interface. It mainly realizes three functional modules of video retrieval based on text, video retrieval based on image and video retrieval based on video segment.

查看原文本刊更多论文

多模式视频检索系统的设计与实现

随着深度学习在各个领域的应用，它取代了手工设计的视频提取。通过对模态视频检索方法的分析和研究，本文旨在通过多模态视频检索，利用文本、图像或视频进行不同方式的视频检索，力求满足不同场景、不同用户对视频检索的需求，最大限度地提高视频检索的准确性和有效性。本文设计并实现了一个基于Windows的基于深度学习的多模式视频检索系统。本系统基于程序模块化的设计理念，采用MySQL作为数据库，PyQT4作为系统接口的开发工具。主要实现了基于文本的视频检索、基于图像的视频检索和基于视频片段的视频检索三个功能模块。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 5th Annual International Conference on Data Science and Business Analytics (ICDSBA)

自引率

0.00%

发文量