Semantic Video Concept Detection using Novel Mixed-Hybrid-Fusion Approach for Multi-Label Data

Q4 Computer Science

Electronic Letters on Computer Vision and Image Analysis Pub Date : 2017-10-31 DOI:10.5565/REV/ELCVIA.927

N. Janwe, K. Bhoyar

{"title":"Semantic Video Concept Detection using Novel Mixed-Hybrid-Fusion Approach for Multi-Label Data","authors":"N. Janwe, K. Bhoyar","doi":"10.5565/REV/ELCVIA.927","DOIUrl":null,"url":null,"abstract":"The performance of the semantic concept detection method depends on, the selection of the low-level visual features used to represent key-frames of a shot and the selection of the feature-fusion method used. This paper proposes a set of low-level visual features of considerably smaller size and also proposes novel ‘hybrid-fusion’ and ‘mixed-hybrid-fusion’, approaches which are formulated by combining early and late-fusion strategies proposed in the literature. In the initially proposed hybrid-fusion approach, the features from the same feature group are combined using early-fusion before classifier training; and the concept probability scores from multiple classifiers are merged using late-fusion approach to get final detection scores. A feature group is defined as the features from the same feature family such as color moment. The hybrid-fusion approach is refined and the “mixed-hybrid-fusion” approach is proposed to further improve detection rate. This paper presents a novel video concept detection system for multi-label data using a proposed mixed-hybrid-fusion approach. Support Vector Machine (SVM) is used to build classifiers that produce concept probabilities for a test frame. The proposed approaches are evaluated on multi-label TRECVID2007 development dataset. Experimental results show that, the proposed mixed-hybrid-fusion approach performs better than other proposed hybrid-fusion approach and outperforms all conventional early-fusion and late-fusion approaches by large margins with respect to feature set dimensionality and Mean Average Precision (MAP) values.","PeriodicalId":38711,"journal":{"name":"Electronic Letters on Computer Vision and Image Analysis","volume":"10 1","pages":"14-29"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Electronic Letters on Computer Vision and Image Analysis","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5565/REV/ELCVIA.927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 3

Abstract

The performance of the semantic concept detection method depends on, the selection of the low-level visual features used to represent key-frames of a shot and the selection of the feature-fusion method used. This paper proposes a set of low-level visual features of considerably smaller size and also proposes novel ‘hybrid-fusion’ and ‘mixed-hybrid-fusion’, approaches which are formulated by combining early and late-fusion strategies proposed in the literature. In the initially proposed hybrid-fusion approach, the features from the same feature group are combined using early-fusion before classifier training; and the concept probability scores from multiple classifiers are merged using late-fusion approach to get final detection scores. A feature group is defined as the features from the same feature family such as color moment. The hybrid-fusion approach is refined and the “mixed-hybrid-fusion” approach is proposed to further improve detection rate. This paper presents a novel video concept detection system for multi-label data using a proposed mixed-hybrid-fusion approach. Support Vector Machine (SVM) is used to build classifiers that produce concept probabilities for a test frame. The proposed approaches are evaluated on multi-label TRECVID2007 development dataset. Experimental results show that, the proposed mixed-hybrid-fusion approach performs better than other proposed hybrid-fusion approach and outperforms all conventional early-fusion and late-fusion approaches by large margins with respect to feature set dimensionality and Mean Average Precision (MAP) values.

查看原文本刊更多论文

基于混合-混合融合的多标签数据语义视频概念检测

语义概念检测方法的性能取决于用于表示镜头关键帧的低级视觉特征的选择以及所使用的特征融合方法的选择。本文提出了一组小得多的低级视觉特征，并提出了新的“混合融合”和“混合混合融合”方法，这些方法是通过结合文献中提出的早期和晚期融合策略制定的。在最初提出的混合融合方法中，在分类器训练之前使用早期融合将同一特征组的特征组合在一起;采用后期融合方法对多个分类器的概念概率分数进行融合，得到最终的检测分数。特征组被定义为来自相同特征族的特征，如颜色矩。对混合融合方法进行了改进，提出了“混合混合融合”方法，进一步提高了检测率。本文提出了一种基于混合融合的多标签数据视频概念检测系统。支持向量机(SVM)用于构建分类器，生成测试帧的概念概率。在多标签TRECVID2007开发数据集上对所提出的方法进行了评估。实验结果表明，所提出的混合融合方法在特征集维数和平均精度(MAP)值方面优于其他混合融合方法，并且在很大程度上优于所有传统的早期融合和后期融合方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊