{"title":"Detection and Recognition of Object for Image Captioning","authors":"Prashant Yadav, Vishal Vishwakarma, Aakash Tiwari","doi":"10.33130/ajct.2021v07i02.009","DOIUrl":null,"url":null,"abstract":"As one of the most intelligent beings on the planet, we are equipped with the most powerful visual and language system as it is easy for us to extract visual information from a given image and transform it into proper linguistic description. Image Caption Generator deals with generating captions for a given image. The capturing mechanism involves a tiring task that collaborates both computer vision and image processing. The mechanism must detect and establish relationships between objects, people, and animals. The aim of this paper is to detect, recognize and generate worthwhile captions for a given image. We use transfer learning CNN on sentences, and extract image representation with Neural Networks.","PeriodicalId":138101,"journal":{"name":"ASIAN JOURNAL OF CONVERGENCE IN TECHNOLOGY","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ASIAN JOURNAL OF CONVERGENCE IN TECHNOLOGY","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33130/ajct.2021v07i02.009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As one of the most intelligent beings on the planet, we are equipped with the most powerful visual and language system as it is easy for us to extract visual information from a given image and transform it into proper linguistic description. Image Caption Generator deals with generating captions for a given image. The capturing mechanism involves a tiring task that collaborates both computer vision and image processing. The mechanism must detect and establish relationships between objects, people, and animals. The aim of this paper is to detect, recognize and generate worthwhile captions for a given image. We use transfer learning CNN on sentences, and extract image representation with Neural Networks.