Detection and Recognition of Object for Image Captioning

ASIAN JOURNAL OF CONVERGENCE IN TECHNOLOGY Pub Date : 2021-08-18 DOI:10.33130/ajct.2021v07i02.009

Prashant Yadav, Vishal Vishwakarma, Aakash Tiwari

引用次数: 0

Abstract

As one of the most intelligent beings on the planet, we are equipped with the most powerful visual and language system as it is easy for us to extract visual information from a given image and transform it into proper linguistic description. Image Caption Generator deals with generating captions for a given image. The capturing mechanism involves a tiring task that collaborates both computer vision and image processing. The mechanism must detect and establish relationships between objects, people, and animals. The aim of this paper is to detect, recognize and generate worthwhile captions for a given image. We use transfer learning CNN on sentences, and extract image representation with Neural Networks.

查看原文本刊更多论文

图像标题中目标的检测与识别

作为地球上最聪明的生物之一，我们拥有最强大的视觉和语言系统，因为我们很容易从给定的图像中提取视觉信息并将其转化为适当的语言描述。Image Caption Generator处理为给定图像生成标题。捕获机制涉及一项累人的任务，它需要计算机视觉和图像处理两方面的协作。该机制必须检测并建立物体、人和动物之间的关系。本文的目的是检测、识别和生成给定图像的有价值的标题。我们在句子上使用迁移学习CNN，并使用神经网络提取图像表示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ASIAN JOURNAL OF CONVERGENCE IN TECHNOLOGY

自引率

0.00%

发文量