信息性推文的分类和摘要

S. Roy, Sumit Mishra, Rakesh Matam
{"title":"信息性推文的分类和摘要","authors":"S. Roy, Sumit Mishra, Rakesh Matam","doi":"10.1109/SCEECS48394.2020.128","DOIUrl":null,"url":null,"abstract":"Microblogging websites like twitter, facebook, etc. has become a substantive platform for the people to publicize their feelings, requirements, etc. It allows users to post short messages for their online audience. These messages are the fusion of blogging and minute messaging, consisting of images, videos, or voice notes. We have primarily focused on information provided by microblogging sites for achieving real-time informational data. Microblogging websites are widely used around the globe by people for portraying what has been happening around their normal living. So, data through these sites eventually helps us getting non-manipulated data directly from the user. In this paper, a disaster dataset (Fani Cyclone dataset) is considered, which consists of the tweets related to a Cyclone named \"Fani\". The tweets are pre-processed and then classified into two categories – informative and non-informative. We have been able to achieve a classification accuracy of 74:268% when pre-processed data is being considered. As we are dealing with disaster dataset, so in the end, we have summarized the informative tweets for the concerned authorities, which would help them to have an overview of the data.","PeriodicalId":167175,"journal":{"name":"2020 IEEE International Students' Conference on Electrical,Electronics and Computer Science (SCEECS)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Classification and Summarization for Informative Tweets\",\"authors\":\"S. Roy, Sumit Mishra, Rakesh Matam\",\"doi\":\"10.1109/SCEECS48394.2020.128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Microblogging websites like twitter, facebook, etc. has become a substantive platform for the people to publicize their feelings, requirements, etc. It allows users to post short messages for their online audience. These messages are the fusion of blogging and minute messaging, consisting of images, videos, or voice notes. We have primarily focused on information provided by microblogging sites for achieving real-time informational data. Microblogging websites are widely used around the globe by people for portraying what has been happening around their normal living. So, data through these sites eventually helps us getting non-manipulated data directly from the user. In this paper, a disaster dataset (Fani Cyclone dataset) is considered, which consists of the tweets related to a Cyclone named \\\"Fani\\\". The tweets are pre-processed and then classified into two categories – informative and non-informative. We have been able to achieve a classification accuracy of 74:268% when pre-processed data is being considered. As we are dealing with disaster dataset, so in the end, we have summarized the informative tweets for the concerned authorities, which would help them to have an overview of the data.\",\"PeriodicalId\":167175,\"journal\":{\"name\":\"2020 IEEE International Students' Conference on Electrical,Electronics and Computer Science (SCEECS)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Students' Conference on Electrical,Electronics and Computer Science (SCEECS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCEECS48394.2020.128\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Students' Conference on Electrical,Electronics and Computer Science (SCEECS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCEECS48394.2020.128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

摘要

微博网站,如twitter、facebook等,已经成为人们表达自己的感受、要求等的实质性平台。它允许用户为他们的在线受众发布短消息。这些信息是博客和即时消息的融合,由图像、视频或语音笔记组成。我们主要关注微博网站提供的信息,以实现实时信息数据。微博网站在全球范围内被人们广泛使用,用来描述他们日常生活中发生的事情。因此,通过这些网站的数据最终帮助我们直接从用户那里获得未经操纵的数据。本文考虑一个灾害数据集(Fani Cyclone数据集),该数据集由与名为“Fani”的气旋相关的推文组成。这些推文经过预处理,然后分为两类——信息性和非信息性。当考虑预处理数据时,我们已经能够实现74:268%的分类精度。由于我们正在处理灾难数据集,所以最后,我们为有关当局总结了信息丰富的推文,这将有助于他们对数据有一个概述。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Classification and Summarization for Informative Tweets
Microblogging websites like twitter, facebook, etc. has become a substantive platform for the people to publicize their feelings, requirements, etc. It allows users to post short messages for their online audience. These messages are the fusion of blogging and minute messaging, consisting of images, videos, or voice notes. We have primarily focused on information provided by microblogging sites for achieving real-time informational data. Microblogging websites are widely used around the globe by people for portraying what has been happening around their normal living. So, data through these sites eventually helps us getting non-manipulated data directly from the user. In this paper, a disaster dataset (Fani Cyclone dataset) is considered, which consists of the tweets related to a Cyclone named "Fani". The tweets are pre-processed and then classified into two categories – informative and non-informative. We have been able to achieve a classification accuracy of 74:268% when pre-processed data is being considered. As we are dealing with disaster dataset, so in the end, we have summarized the informative tweets for the concerned authorities, which would help them to have an overview of the data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信