CNN-LSTM based Social Media Post Caption Generator

Anand Singh, Dinesh Vij
{"title":"CNN-LSTM based Social Media Post Caption Generator","authors":"Anand Singh, Dinesh Vij","doi":"10.1109/iciptm54933.2022.9754189","DOIUrl":null,"url":null,"abstract":"In artificial intelligence automatically captioning an image is a challenging problem that brings computer vision and natural language processing together, but the problem becomes more challenging when artificial intelligence has to caption an Instagram post. This is challenging because the caption of an Instagram post is not a simple description of the image but it contains some abstract features like jokes, sarcasm, references, etc. There has been a lot of work on image captioning but there is hard work on captioning social media posts. So, I propose a deep learning model that uses an amalgamation of Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) Recurrent Neural Network (RNN). The first major task was to get a dataset but there is no such popular dataset available. The option to scrap Instagram was explored but turns out scraping Instagram is prohibited, luckily, I was able to get a dataset available on Kaggle containing 35,000 records but due to computation limitations, I used 5000 images and captions for training. Eventually, the trained model was not very accurate due to fewer numbers of epochs i.e 20, small training dataset and majorly because of the low-quality dataset. Even though the model did not show better results but the methodology to generate captions for Instagram posts was established.","PeriodicalId":6810,"journal":{"name":"2022 2nd International Conference on Innovative Practices in Technology and Management (ICIPTM)","volume":"50 1","pages":"205-209"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 2nd International Conference on Innovative Practices in Technology and Management (ICIPTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iciptm54933.2022.9754189","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In artificial intelligence automatically captioning an image is a challenging problem that brings computer vision and natural language processing together, but the problem becomes more challenging when artificial intelligence has to caption an Instagram post. This is challenging because the caption of an Instagram post is not a simple description of the image but it contains some abstract features like jokes, sarcasm, references, etc. There has been a lot of work on image captioning but there is hard work on captioning social media posts. So, I propose a deep learning model that uses an amalgamation of Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) Recurrent Neural Network (RNN). The first major task was to get a dataset but there is no such popular dataset available. The option to scrap Instagram was explored but turns out scraping Instagram is prohibited, luckily, I was able to get a dataset available on Kaggle containing 35,000 records but due to computation limitations, I used 5000 images and captions for training. Eventually, the trained model was not very accurate due to fewer numbers of epochs i.e 20, small training dataset and majorly because of the low-quality dataset. Even though the model did not show better results but the methodology to generate captions for Instagram posts was established.
基于CNN-LSTM的社交媒体帖子标题生成器
在人工智能中,自动为图像配上文字是一个具有挑战性的问题,它将计算机视觉和自然语言处理结合在一起,但当人工智能必须为Instagram帖子配上文字时,这个问题就变得更具挑战性了。这是一个挑战,因为Instagram帖子的标题不是对图片的简单描述,而是包含一些抽象的特征,如笑话、讽刺、引用等。在图片标题上已经做了很多工作,但在社交媒体帖子上做标题却很困难。因此,我提出了一个深度学习模型,该模型使用卷积神经网络(CNN)和长短期记忆(LSTM)循环神经网络(RNN)的融合。第一个主要任务是获得一个数据集,但没有这样流行的数据集可用。我们探索了废弃Instagram的选项,但结果发现禁止抓取Instagram,幸运的是,我能够在Kaggle上获得包含35,000条记录的数据集,但由于计算限制,我使用了5000张图像和说明文字进行训练。最终,由于epoch数较少(即20个),训练数据集较小,主要是因为数据集质量较低,训练模型不是很准确。尽管该模型没有显示出更好的结果,但建立了为Instagram帖子生成标题的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信