{"title":"基于灰色关联分析的图像和视频字幕评价指标","authors":"Miao Ma, Bolong Wang","doi":"10.1109/GSIS.2017.8077673","DOIUrl":null,"url":null,"abstract":"Aiming at the performance evaluation on image captioning and video captioning, this paper discusses the existing performance metrics and then suggests a novel overall performance metric based on grey relational analysis of Grey System Theory. In our metric, all the available performance metrics of each captioning model is used to extract a comparative sequence. Meanwhile, a reference sequence is constructed by combining all the available performance metrics involved. Then an overall metric is obtained by computing the grey relational degrees between the two kinds of sequences. Experimental results on the most widely-used image dataset and video dataset show that the proposed metric is fast and effective that helps for spurring improvements and measuring progress in the state of the art of image captioning and video captioning.","PeriodicalId":425920,"journal":{"name":"2017 International Conference on Grey Systems and Intelligent Services (GSIS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A grey relational analysis based evaluation metric for image captioning and video captioning\",\"authors\":\"Miao Ma, Bolong Wang\",\"doi\":\"10.1109/GSIS.2017.8077673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the performance evaluation on image captioning and video captioning, this paper discusses the existing performance metrics and then suggests a novel overall performance metric based on grey relational analysis of Grey System Theory. In our metric, all the available performance metrics of each captioning model is used to extract a comparative sequence. Meanwhile, a reference sequence is constructed by combining all the available performance metrics involved. Then an overall metric is obtained by computing the grey relational degrees between the two kinds of sequences. Experimental results on the most widely-used image dataset and video dataset show that the proposed metric is fast and effective that helps for spurring improvements and measuring progress in the state of the art of image captioning and video captioning.\",\"PeriodicalId\":425920,\"journal\":{\"name\":\"2017 International Conference on Grey Systems and Intelligent Services (GSIS)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Grey Systems and Intelligent Services (GSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GSIS.2017.8077673\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Grey Systems and Intelligent Services (GSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GSIS.2017.8077673","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A grey relational analysis based evaluation metric for image captioning and video captioning
Aiming at the performance evaluation on image captioning and video captioning, this paper discusses the existing performance metrics and then suggests a novel overall performance metric based on grey relational analysis of Grey System Theory. In our metric, all the available performance metrics of each captioning model is used to extract a comparative sequence. Meanwhile, a reference sequence is constructed by combining all the available performance metrics involved. Then an overall metric is obtained by computing the grey relational degrees between the two kinds of sequences. Experimental results on the most widely-used image dataset and video dataset show that the proposed metric is fast and effective that helps for spurring improvements and measuring progress in the state of the art of image captioning and video captioning.