{"title":"基于目标领域语义约束的中文图像描述评价方法","authors":"Zhenhao Wang, Wenyi Sun, Zhengsong Wang, Le Yang","doi":"10.1117/12.3000808","DOIUrl":null,"url":null,"abstract":"To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.","PeriodicalId":210802,"journal":{"name":"International Conference on Image Processing and Intelligent Control","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chinese image description evaluation method based on target domain semantic constraints\",\"authors\":\"Zhenhao Wang, Wenyi Sun, Zhengsong Wang, Le Yang\",\"doi\":\"10.1117/12.3000808\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.\",\"PeriodicalId\":210802,\"journal\":{\"name\":\"International Conference on Image Processing and Intelligent Control\",\"volume\":\"68 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Image Processing and Intelligent Control\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.3000808\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Image Processing and Intelligent Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3000808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Chinese image description evaluation method based on target domain semantic constraints
To address the problems of insufficient accuracy and difficulty of application in the current Chinese image description field, this paper proposes an evaluation method based on semantic constraints in the target domain. Unlike previous research, this method acts on the output stage of the model, and based on the extraction of key semantics in the target application domain, it is constrained by the macroscopic semantic space of that domain or by introducing external semantic information from other visual tasks. The experiments show that the proposed method effectively improves the semantic coherence between the model output description sentences and the input images in the target domain, and is helpful for the practical application of image description in specific domains.