M. Sailaja, K. Harika, B. Sridhar, Rajan Singh, V. Charitha, Koppula Srinivas Rao
{"title":"Image Caption Generator using Deep Learning","authors":"M. Sailaja, K. Harika, B. Sridhar, Rajan Singh, V. Charitha, Koppula Srinivas Rao","doi":"10.1109/ASSIC55218.2022.10088345","DOIUrl":null,"url":null,"abstract":"Over the last few years deep neural network made image captioning conceivable. Image caption generator provides an appropriate title for an applied input image based on the dataset. The present work proposes a model based on deep learning and utilizes it to generate caption for the input image. The model takes an image as input and frame the sentence related to the given input image by using some algorithms like CNN and LSTM. This CNN model is used to identify the objects that are present in the image and Long Short-Term Memory (LSTM) model will not only generate the sentence but summarize the text and generate the caption that is suitable for the project. So, the proposed model mainly focuses on identify the objects and generating the most appropriate title for the input images.","PeriodicalId":441406,"journal":{"name":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancements in Smart, Secure and Intelligent Computing (ASSIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ASSIC55218.2022.10088345","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Over the last few years deep neural network made image captioning conceivable. Image caption generator provides an appropriate title for an applied input image based on the dataset. The present work proposes a model based on deep learning and utilizes it to generate caption for the input image. The model takes an image as input and frame the sentence related to the given input image by using some algorithms like CNN and LSTM. This CNN model is used to identify the objects that are present in the image and Long Short-Term Memory (LSTM) model will not only generate the sentence but summarize the text and generate the caption that is suitable for the project. So, the proposed model mainly focuses on identify the objects and generating the most appropriate title for the input images.