{"title":"Transformed Deep Spatio Temporal-Features with Fused Distance for Efficient Video Retrieval","authors":"A. Banerjee, Ela Kumar, Ravinder M","doi":"10.1109/AIST55798.2022.10064821","DOIUrl":null,"url":null,"abstract":"For the goal of video retrieval, this research proposes wavelet transformations on deep spatiotemporal characteristics. The component-wise similarities between the query video feature and prototype video feature are calculated because level 1 wavelets extract two components from any signal or feature vector. The ultimate dissimilarity for determining the top 1 and top 5 accuracy is created by fusing these differences. The outcomes demonstrate that the suggested technique performs better than a baseline strategy. The following strategy for improvement can be investigated further by employing fast learning networks that are trained on the training sets of both data sets to provide better classification of the query as well as the prototype feature vectors, which would enhance the retrieval accuracy.","PeriodicalId":360351,"journal":{"name":"2022 4th International Conference on Artificial Intelligence and Speech Technology (AIST)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Artificial Intelligence and Speech Technology (AIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIST55798.2022.10064821","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
For the goal of video retrieval, this research proposes wavelet transformations on deep spatiotemporal characteristics. The component-wise similarities between the query video feature and prototype video feature are calculated because level 1 wavelets extract two components from any signal or feature vector. The ultimate dissimilarity for determining the top 1 and top 5 accuracy is created by fusing these differences. The outcomes demonstrate that the suggested technique performs better than a baseline strategy. The following strategy for improvement can be investigated further by employing fast learning networks that are trained on the training sets of both data sets to provide better classification of the query as well as the prototype feature vectors, which would enhance the retrieval accuracy.