A Late Fusion Framework with Multiple Optimization Methods for Media Interestingness

Conference and Labs of the Evaluation Forum Pub Date : 2022-07-11 DOI:10.48550/arXiv.2207.04762

M. Shoukat, Khubaib Ahmad, Naina Said, Nasir Ahmad, Mohammed Hasanuzzaman, Kashif Ahmad

{"title":"A Late Fusion Framework with Multiple Optimization Methods for Media Interestingness","authors":"M. Shoukat, Khubaib Ahmad, Naina Said, Nasir Ahmad, Mohammed Hasanuzzaman, Kashif Ahmad","doi":"10.48550/arXiv.2207.04762","DOIUrl":null,"url":null,"abstract":"The recent advancement in Multimedia Analytical, Computer Vision (CV), and Artificial Intelligence (AI) algorithms resulted in several interesting tools allowing an automatic analysis and retrieval of multimedia content of users' interests. However, retrieving the content of interest generally involves analysis and extraction of semantic features, such as emotions and interestingness-level. The extraction of such meaningful information is a complex task and generally, the performance of individual algorithms is very low. One way to enhance the performance of the individual algorithms is to combine the predictive capabilities of multiple algorithms using fusion schemes. This allows the individual algorithms to complement each other, leading to improved performance. This paper proposes several fusion methods for the media interestingness score prediction task introduced in CLEF Fusion 2022. The proposed methods include both a naive fusion scheme, where all the inducers are treated equally and a merit-based fusion scheme where multiple weight optimization methods are employed to assign weights to the individual inducers. In total, we used six optimization methods including a Particle Swarm Optimization (PSO), a Genetic Algorithm (GA), Nelder Mead, Trust Region Constrained (TRC), and Limited-memory Broyden Fletcher Goldfarb Shanno Algorithm (LBFGSA), and Truncated Newton Algorithm (TNA). Overall better results are obtained with PSO and TNA achieving 0.109 mean average precision at 10. The task is complex and generally, scores are low. We believe the presented analysis will provide a baseline for future research in the domain.","PeriodicalId":232729,"journal":{"name":"Conference and Labs of the Evaluation Forum","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference and Labs of the Evaluation Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2207.04762","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The recent advancement in Multimedia Analytical, Computer Vision (CV), and Artificial Intelligence (AI) algorithms resulted in several interesting tools allowing an automatic analysis and retrieval of multimedia content of users' interests. However, retrieving the content of interest generally involves analysis and extraction of semantic features, such as emotions and interestingness-level. The extraction of such meaningful information is a complex task and generally, the performance of individual algorithms is very low. One way to enhance the performance of the individual algorithms is to combine the predictive capabilities of multiple algorithms using fusion schemes. This allows the individual algorithms to complement each other, leading to improved performance. This paper proposes several fusion methods for the media interestingness score prediction task introduced in CLEF Fusion 2022. The proposed methods include both a naive fusion scheme, where all the inducers are treated equally and a merit-based fusion scheme where multiple weight optimization methods are employed to assign weights to the individual inducers. In total, we used six optimization methods including a Particle Swarm Optimization (PSO), a Genetic Algorithm (GA), Nelder Mead, Trust Region Constrained (TRC), and Limited-memory Broyden Fletcher Goldfarb Shanno Algorithm (LBFGSA), and Truncated Newton Algorithm (TNA). Overall better results are obtained with PSO and TNA achieving 0.109 mean average precision at 10. The task is complex and generally, scores are low. We believe the presented analysis will provide a baseline for future research in the domain.

查看原文本刊更多论文

基于多优化方法的媒体兴趣度后期融合框架

多媒体分析、计算机视觉(CV)和人工智能(AI)算法的最新进展产生了一些有趣的工具，可以自动分析和检索用户感兴趣的多媒体内容。然而，检索感兴趣的内容通常涉及情感和兴趣水平等语义特征的分析和提取。这种有意义的信息的提取是一项复杂的任务，一般来说，单个算法的性能很低。提高单个算法性能的一种方法是使用融合方案将多个算法的预测能力结合起来。这使得各个算法可以相互补充，从而提高性能。针对CLEF fusion 2022中引入的媒体兴趣度评分预测任务，提出了几种融合方法。所提出的方法既包括朴素融合方案，其中所有诱导因子被平等对待，也包括基于优点的融合方案，其中采用多种权重优化方法为单个诱导因子分配权重。我们总共使用了6种优化方法，包括粒子群优化(PSO)、遗传算法(GA)、Nelder Mead、信任域约束(TRC)和有限内存Broyden Fletcher Goldfarb Shanno算法(LBFGSA)和截断牛顿算法(TNA)。总体而言，PSO和TNA获得了更好的结果，在10时平均精度为0.109。这个任务很复杂，一般来说分数都很低。我们相信所提出的分析将为该领域的未来研究提供基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference and Labs of the Evaluation Forum

自引率

0.00%

发文量