Twitter数据的文本分类和聚类在商业分析中的应用

2018 Majan International Conference (MIC) Pub Date : 2018-03-19 DOI:10.1109/MINTC.2018.8363162

Alrence S. Halibas, Abubucker Samsudeen Shaffi, Mohamed Mohamed

{"title":"Twitter数据的文本分类和聚类在商业分析中的应用","authors":"Alrence S. Halibas, Abubucker Samsudeen Shaffi, Mohamed Mohamed","doi":"10.1109/MINTC.2018.8363162","DOIUrl":null,"url":null,"abstract":"In the recent years, social networks in business are gaining unprecedented popularity because of their potential for business growth. Companies can know more about consumers' sentiments towards their products and services, and use it to better understand the market and improve their brand. Thus, companies regularly reinvent their marketing strategies and campaigns to fit consumers' preferences. Social analysis harnesses and utilizes the vast volume of data in social networks to mine critical data for strategic decision making. It uses machine learning techniques and tools in determining patterns and trends to gain actionable insights. This paper selected a popular food brand to evaluate a given stream of customer comments on Twitter. Several metrics in classification and clustering of data were used for analysis. A Twitter API is used to collect twitter corpus and feed it to a Binary Tree classifier that will discover the polarity lexicon of English tweets, whether positive or negative. A k-means clustering technique is used to group together similar words in tweets in order to discover certain business value. This paper attempts to discuss the technical and business perspectives of text mining analysis of Twitter data and recommends appropriate future opportunities in developing this emerging field.","PeriodicalId":250088,"journal":{"name":"2018 Majan International Conference (MIC)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":"{\"title\":\"Application of text classification and clustering of Twitter data for business analytics\",\"authors\":\"Alrence S. Halibas, Abubucker Samsudeen Shaffi, Mohamed Mohamed\",\"doi\":\"10.1109/MINTC.2018.8363162\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the recent years, social networks in business are gaining unprecedented popularity because of their potential for business growth. Companies can know more about consumers' sentiments towards their products and services, and use it to better understand the market and improve their brand. Thus, companies regularly reinvent their marketing strategies and campaigns to fit consumers' preferences. Social analysis harnesses and utilizes the vast volume of data in social networks to mine critical data for strategic decision making. It uses machine learning techniques and tools in determining patterns and trends to gain actionable insights. This paper selected a popular food brand to evaluate a given stream of customer comments on Twitter. Several metrics in classification and clustering of data were used for analysis. A Twitter API is used to collect twitter corpus and feed it to a Binary Tree classifier that will discover the polarity lexicon of English tweets, whether positive or negative. A k-means clustering technique is used to group together similar words in tweets in order to discover certain business value. This paper attempts to discuss the technical and business perspectives of text mining analysis of Twitter data and recommends appropriate future opportunities in developing this emerging field.\",\"PeriodicalId\":250088,\"journal\":{\"name\":\"2018 Majan International Conference (MIC)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"31\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Majan International Conference (MIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MINTC.2018.8363162\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Majan International Conference (MIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MINTC.2018.8363162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 31

摘要

近年来，商业社交网络因其潜在的商业增长潜力而获得前所未有的普及。企业可以更多地了解消费者对其产品和服务的看法，并利用它来更好地了解市场，提升品牌。因此，公司经常重新设计他们的营销策略和活动，以适应消费者的偏好。社交分析利用社交网络中的大量数据来挖掘关键数据以进行战略决策。它使用机器学习技术和工具来确定模式和趋势，以获得可操作的见解。本文选择了一个受欢迎的食品品牌来评估Twitter上给定的客户评论流。使用数据分类和聚类的几个指标进行分析。Twitter API用于收集Twitter语料库并将其提供给二进制树分类器，该分类器将发现英语tweet的极性词典，无论是积极的还是消极的。使用k-means聚类技术将tweet中的相似词分组，以发现某些商业价值。本文试图讨论Twitter数据的文本挖掘分析的技术和商业角度，并建议适当的未来发展这一新兴领域的机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Application of text classification and clustering of Twitter data for business analytics

In the recent years, social networks in business are gaining unprecedented popularity because of their potential for business growth. Companies can know more about consumers' sentiments towards their products and services, and use it to better understand the market and improve their brand. Thus, companies regularly reinvent their marketing strategies and campaigns to fit consumers' preferences. Social analysis harnesses and utilizes the vast volume of data in social networks to mine critical data for strategic decision making. It uses machine learning techniques and tools in determining patterns and trends to gain actionable insights. This paper selected a popular food brand to evaluate a given stream of customer comments on Twitter. Several metrics in classification and clustering of data were used for analysis. A Twitter API is used to collect twitter corpus and feed it to a Binary Tree classifier that will discover the polarity lexicon of English tweets, whether positive or negative. A k-means clustering technique is used to group together similar words in tweets in order to discover certain business value. This paper attempts to discuss the technical and business perspectives of text mining analysis of Twitter data and recommends appropriate future opportunities in developing this emerging field.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 Majan International Conference (MIC)

自引率

0.00%

发文量