软注意神经图像字幕中嵌入通道激活的改进

Proceedings of the 2nd International Conference on Vision, Image and Signal Processing Pub Date : 2018-08-27 DOI:10.1145/3271553.3271592

Yanke Li

{"title":"软注意神经图像字幕中嵌入通道激活的改进","authors":"Yanke Li","doi":"10.1145/3271553.3271592","DOIUrl":null,"url":null,"abstract":"The paper dives into the topic of image captioning with the soft attention algorithm. We first review relevant works on the captioned topic in terms of background introduction and then explains the original model in details. On top of the plain soft attention model, we propose two approaches for further improvements: SE attention model which adds an extra channel-wise activation layer, and bi-directional attention model that explores two-way attention order feasibility. We implement both methods under limited experiment conditions and in addition swap the original encoder with state-of-art structure. Quantitative results and example demonstrations show that our proposed methods have achieved better performance than baselines. In the end, some suggestions of future work on top of proposed are summarized for a purpose of completeness.","PeriodicalId":414782,"journal":{"name":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","volume":"227 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improvement of Embedding Channel-Wise Activation in Soft-Attention Neural Image Captioning\",\"authors\":\"Yanke Li\",\"doi\":\"10.1145/3271553.3271592\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper dives into the topic of image captioning with the soft attention algorithm. We first review relevant works on the captioned topic in terms of background introduction and then explains the original model in details. On top of the plain soft attention model, we propose two approaches for further improvements: SE attention model which adds an extra channel-wise activation layer, and bi-directional attention model that explores two-way attention order feasibility. We implement both methods under limited experiment conditions and in addition swap the original encoder with state-of-art structure. Quantitative results and example demonstrations show that our proposed methods have achieved better performance than baselines. In the end, some suggestions of future work on top of proposed are summarized for a purpose of completeness.\",\"PeriodicalId\":414782,\"journal\":{\"name\":\"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing\",\"volume\":\"227 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3271553.3271592\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Vision, Image and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3271553.3271592","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文研究了基于软注意算法的图像字幕问题。我们首先在背景介绍方面回顾了标题主题的相关著作，然后对原始模型进行了详细的解释。在普通软注意模型的基础上，我们提出了两种进一步改进的方法:SE注意模型(增加了额外的通道激活层)和双向注意模型(探索双向注意顺序的可行性)。我们在有限的实验条件下实现了这两种方法，并且用最先进的结构交换了原始编码器。定量结果和实例演示表明，我们提出的方法取得了比基线更好的性能。最后，在此基础上对今后的工作提出了一些建议，以达到完善的目的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improvement of Embedding Channel-Wise Activation in Soft-Attention Neural Image Captioning

The paper dives into the topic of image captioning with the soft attention algorithm. We first review relevant works on the captioned topic in terms of background introduction and then explains the original model in details. On top of the plain soft attention model, we propose two approaches for further improvements: SE attention model which adds an extra channel-wise activation layer, and bi-directional attention model that explores two-way attention order feasibility. We implement both methods under limited experiment conditions and in addition swap the original encoder with state-of-art structure. Quantitative results and example demonstrations show that our proposed methods have achieved better performance than baselines. In the end, some suggestions of future work on top of proposed are summarized for a purpose of completeness.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2nd International Conference on Vision, Image and Signal Processing

自引率

0.00%

发文量