基于目标领域语义约束的中文图像描述评价方法

International Conference on Image Processing and Intelligent Control Pub Date : 2023-08-09 DOI:10.1117/12.3000808

Zhenhao Wang, Wenyi Sun, Zhengsong Wang, Le Yang

{"title":"基于目标领域语义约束的中文图像描述评价方法","authors":"Zhenhao Wang, Wenyi Sun, Zhengsong Wang, Le Yang","doi":"10.1117/12.3000808","DOIUrl":null,"url":null,"abstract":"To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.","PeriodicalId":210802,"journal":{"name":"International Conference on Image Processing and Intelligent Control","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chinese image description evaluation method based on target domain semantic constraints\",\"authors\":\"Zhenhao Wang, Wenyi Sun, Zhengsong Wang, Le Yang\",\"doi\":\"10.1117/12.3000808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.\",\"PeriodicalId\":210802,\"journal\":{\"name\":\"International Conference on Image Processing and Intelligent Control\",\"volume\":\"68 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Image Processing and Intelligent Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.3000808\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Image Processing and Intelligent Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3000808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

针对当前中文图像描述领域准确度不足、应用难度大的问题，提出了一种基于目标域语义约束的评价方法。与以往的研究不同，该方法作用于模型的输出阶段，基于目标应用领域关键语义的提取，不受该领域宏观语义空间或引入其他视觉任务外部语义信息的约束。实验表明，该方法有效地提高了模型输出描述句子与目标域输入图像之间的语义一致性，有助于特定领域图像描述的实际应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Chinese image description evaluation method based on target domain semantic constraints

To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Image Processing and Intelligent Control

自引率

0.00%

发文量