TasvirEt: A benchmark dataset for automatic Turkish description generation from images

2016 24th Signal Processing and Communication Application Conference (SIU) Pub Date : 2016-05-16 DOI:10.1109/SIU.2016.7496155

Mesut Erhan Unal, Begum Citamak, Semih Yagcioglu, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Ruken Cakici

引用次数: 25

Abstract

Automatically describing images with natural sentences is considered to be a challenging research problem that has recently been explored. Although the number of methods proposed to solve this problem increases over time, since the datasets used commonly in this field contain only English descriptions, the studies have mostly been limited to single language, namely English. In this study, for the first time in the literature, a new dataset is proposed which enables generating Turkish descriptions from images, which can be used as a benchmark for this purpose. Furthermore, two approaches are proposed, again for the first time in the literature, for image captioning in Turkish with the dataset we named as TasvirEt. Our findings indicate that the new Turkish dataset and the approaches used here can be successfully used for automatically describing images in Turkish.

查看原文本刊更多论文

TasvirEt:从图像中自动生成土耳其语描述的基准数据集

用自然语句自动描述图像被认为是一个具有挑战性的研究问题，近年来一直在探索。尽管随着时间的推移，提出的解决这一问题的方法越来越多，但由于该领域常用的数据集仅包含英语描述，因此研究大多局限于单一语言，即英语。在这项研究中，在文献中首次提出了一个新的数据集，可以从图像中生成土耳其语描述，这可以用作此目的的基准。此外，在文献中首次提出了两种方法，用于使用我们命名为TasvirEt的数据集进行土耳其语图像字幕。我们的研究结果表明，新的土耳其语数据集和这里使用的方法可以成功地用于自动描述土耳其语图像。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 24th Signal Processing and Communication Application Conference (SIU)

自引率

0.00%

发文量