{"title":"基于词嵌入和词典的Twitter数据情感分类","authors":"R. Raj, Prasanjeet Das, P. Sahu","doi":"10.1109/CSNT48778.2020.9115750","DOIUrl":null,"url":null,"abstract":"Twitter is one of the leading social media platforms for its users for tweeting 280 characters in a single tweet. Social media influence the users to share data, promoting advertisement and posting useful information to the followers. The social media data helps the individuals and business personals to take decision based on the analysis with the data. Twitter sentimental analysis is important for identifying the similar text patterns present in the given input text. The analysis further classified with emotional, positive and negative tweets. Existing approaches limited in accuracy, therefore, the word embedding and lexicon based approach has been introduced for increasing the accuracy. The twitter data stream is taken as an input and preprocessed by removing stop words, hash tags and urls. Further, the data is tokenized and applied with the word embedding method to detect the location and lexicon based approach to segregate the sentimental and emotional tweets. The system has been tested with the live data set as well as offline dataset, and the result shows very promising.","PeriodicalId":131745,"journal":{"name":"2020 IEEE 9th International Conference on Communication Systems and Network Technologies (CSNT)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Emotion Classification on Twitter Data Using Word Embedding and Lexicon Based Approach\",\"authors\":\"R. Raj, Prasanjeet Das, P. Sahu\",\"doi\":\"10.1109/CSNT48778.2020.9115750\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Twitter is one of the leading social media platforms for its users for tweeting 280 characters in a single tweet. Social media influence the users to share data, promoting advertisement and posting useful information to the followers. The social media data helps the individuals and business personals to take decision based on the analysis with the data. Twitter sentimental analysis is important for identifying the similar text patterns present in the given input text. The analysis further classified with emotional, positive and negative tweets. Existing approaches limited in accuracy, therefore, the word embedding and lexicon based approach has been introduced for increasing the accuracy. The twitter data stream is taken as an input and preprocessed by removing stop words, hash tags and urls. Further, the data is tokenized and applied with the word embedding method to detect the location and lexicon based approach to segregate the sentimental and emotional tweets. The system has been tested with the live data set as well as offline dataset, and the result shows very promising.\",\"PeriodicalId\":131745,\"journal\":{\"name\":\"2020 IEEE 9th International Conference on Communication Systems and Network Technologies (CSNT)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 9th International Conference on Communication Systems and Network Technologies (CSNT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CSNT48778.2020.9115750\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 9th International Conference on Communication Systems and Network Technologies (CSNT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CSNT48778.2020.9115750","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Emotion Classification on Twitter Data Using Word Embedding and Lexicon Based Approach
Twitter is one of the leading social media platforms for its users for tweeting 280 characters in a single tweet. Social media influence the users to share data, promoting advertisement and posting useful information to the followers. The social media data helps the individuals and business personals to take decision based on the analysis with the data. Twitter sentimental analysis is important for identifying the similar text patterns present in the given input text. The analysis further classified with emotional, positive and negative tweets. Existing approaches limited in accuracy, therefore, the word embedding and lexicon based approach has been introduced for increasing the accuracy. The twitter data stream is taken as an input and preprocessed by removing stop words, hash tags and urls. Further, the data is tokenized and applied with the word embedding method to detect the location and lexicon based approach to segregate the sentimental and emotional tweets. The system has been tested with the live data set as well as offline dataset, and the result shows very promising.