{"title":"Comparison and Evaluation of Video Retrieval Approaches Using Query Sentences","authors":"K. Ueki, Takayuki Hori","doi":"10.1145/3399637.3399657","DOIUrl":null,"url":null,"abstract":"Following are two mainstream approaches of video retrieval from large-scale video data using query sentences: (1) an approach to find pre-trained concepts such as objects, persons, scenes, and activities corresponding to a query sentence, and (2) an approach to map a query sentence and images/videos into the same feature space and directly search for images/videos that match the query sentence. In this study, we analyze the advantages and disadvantages of these two approaches using a large-scale video database of TRECVID benchmark and confirm whether the fusion of these approaches can improve video retrieval performance.","PeriodicalId":248664,"journal":{"name":"Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 2nd International Conference on Intelligent Medicine and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3399637.3399657","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Following are two mainstream approaches of video retrieval from large-scale video data using query sentences: (1) an approach to find pre-trained concepts such as objects, persons, scenes, and activities corresponding to a query sentence, and (2) an approach to map a query sentence and images/videos into the same feature space and directly search for images/videos that match the query sentence. In this study, we analyze the advantages and disadvantages of these two approaches using a large-scale video database of TRECVID benchmark and confirm whether the fusion of these approaches can improve video retrieval performance.