面向目标的自然灾害微博监测系统

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS) Pub Date : 2017-05-01 DOI:10.1109/ICIS.2017.7959984

S. Win, Than Nwe Aung

{"title":"面向目标的自然灾害微博监测系统","authors":"S. Win, Than Nwe Aung","doi":"10.1109/ICIS.2017.7959984","DOIUrl":null,"url":null,"abstract":"Twitter, Social Networking Site, becomes most popular microblogging service and people have started publishing data on the use of it in natural disasters. Twitter has also created the opportunities for first responders to know the critical information and work effective reactions for impacted communities. This paper introduces the tweet monitoring system to identify the messages that people updated during natural disasters into a set of information categories and provide user desired target information type automatically. In this system, classification is done at tweet level with three labels by using LibLinear classifier. This system intended to extract the small number of informational and actionable tweets from large amounts of raw tweets on Twitter using machine learning and natural language processing (NLP). Feature extraction of this work exploited only linguistic features, sentiment lexicon based features and especially disaster lexicon based features. The annotation system also creates disaster related corpus with new tweets collected from Twitter API and annotation is done on real time manner. The performance of this system is evaluated based on four publicly available annotated datasets. The experiments showed the classification accuracy on the proposed features set is higher than the classifier based on neural word embeddings and standard bag-of-words models. This system automatically annotated the Myanmar_Earthquake_2016 dataset at 75% accuracy on average.","PeriodicalId":301467,"journal":{"name":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"Target oriented tweets monitoring system during natural disasters\",\"authors\":\"S. Win, Than Nwe Aung\",\"doi\":\"10.1109/ICIS.2017.7959984\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Twitter, Social Networking Site, becomes most popular microblogging service and people have started publishing data on the use of it in natural disasters. Twitter has also created the opportunities for first responders to know the critical information and work effective reactions for impacted communities. This paper introduces the tweet monitoring system to identify the messages that people updated during natural disasters into a set of information categories and provide user desired target information type automatically. In this system, classification is done at tweet level with three labels by using LibLinear classifier. This system intended to extract the small number of informational and actionable tweets from large amounts of raw tweets on Twitter using machine learning and natural language processing (NLP). Feature extraction of this work exploited only linguistic features, sentiment lexicon based features and especially disaster lexicon based features. The annotation system also creates disaster related corpus with new tweets collected from Twitter API and annotation is done on real time manner. The performance of this system is evaluated based on four publicly available annotated datasets. The experiments showed the classification accuracy on the proposed features set is higher than the classifier based on neural word embeddings and standard bag-of-words models. This system automatically annotated the Myanmar_Earthquake_2016 dataset at 75% accuracy on average.\",\"PeriodicalId\":301467,\"journal\":{\"name\":\"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIS.2017.7959984\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIS.2017.7959984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 28

摘要

社交网站Twitter成为最受欢迎的微博服务，人们开始发布在自然灾害中使用Twitter的数据。Twitter也为急救人员提供了了解关键信息的机会，并为受影响的社区做出有效的反应。本文介绍了推特监测系统，将人们在自然灾害中更新的消息识别为一组信息类别，并自动提供用户想要的目标信息类型。在该系统中，使用LibLinear分类器在tweet级别上使用三个标签进行分类。该系统旨在使用机器学习和自然语言处理(NLP)从Twitter上的大量原始tweet中提取少量信息和可操作的tweet。本工作的特征提取只利用了语言特征、基于情感词汇的特征，特别是基于灾难词汇的特征。注释系统还利用从Twitter API中收集的新tweet创建与灾难相关的语料库，并实时进行注释。该系统的性能基于四个公开可用的带注释的数据集进行评估。实验表明，基于神经词嵌入和标准词袋模型的分类器在特征集上的分类准确率更高。该系统以平均75%的准确率自动标注了myanmar_seisake_2016数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Target oriented tweets monitoring system during natural disasters

Twitter, Social Networking Site, becomes most popular microblogging service and people have started publishing data on the use of it in natural disasters. Twitter has also created the opportunities for first responders to know the critical information and work effective reactions for impacted communities. This paper introduces the tweet monitoring system to identify the messages that people updated during natural disasters into a set of information categories and provide user desired target information type automatically. In this system, classification is done at tweet level with three labels by using LibLinear classifier. This system intended to extract the small number of informational and actionable tweets from large amounts of raw tweets on Twitter using machine learning and natural language processing (NLP). Feature extraction of this work exploited only linguistic features, sentiment lexicon based features and especially disaster lexicon based features. The annotation system also creates disaster related corpus with new tweets collected from Twitter API and annotation is done on real time manner. The performance of this system is evaluated based on four publicly available annotated datasets. The experiments showed the classification accuracy on the proposed features set is higher than the classifier based on neural word embeddings and standard bag-of-words models. This system automatically annotated the Myanmar_Earthquake_2016 dataset at 75% accuracy on average.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 IEEE/ACIS 16th International Conference on Computer and Information Science (ICIS)

自引率

0.00%

发文量