{"title":"基于长短期记忆和快速文本的Twitter社交媒体情感检测","authors":"M. Riza, N. Charibaldi","doi":"10.25139/IJAIR.V3I1.3827","DOIUrl":null,"url":null,"abstract":"Emotion detection is important in various fields such as education, business, employee recruitment. In this study, emotions will be detected with text that comes from Twitter because social media makes users tend to express emotions through text posts. One of the social media that has the highest user growth rate in Indonesia is Twitter. This study will use the LSTM method because this method is proven to be better than previous studies. Word embedding fast text will also be used in this study to improve Word2Vec and GloVe that cannot handle the problem of out of vocabulary (OOV). This research produces the best accuracy for each word embedding as follows, Word2Vec produces an accuracy of 73,15%, GloVe produces an accuracy of 60,10%, fast text produces an accuracy of 73,15%. The conclusion in this study is the best accuracy was obtained by Word2Vec and fast text. The fast text has the advantage of handling the problem of out of vocabulary (OOV), but in this study, it cannot improve the accuracy of word 2vec. This study has not been able to produce very good accuracy. This is because of the data used. In future works, to get even better results, it is expected to apply other deep learning methods, such as CNN, BiLSTM, etc. It is hoped that more data will be used in future studies.","PeriodicalId":365842,"journal":{"name":"International Journal of Artificial Intelligence & Robotics (IJAIR)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Emotion Detection in Twitter Social Media Using Long Short-Term Memory (LSTM) and Fast Text\",\"authors\":\"M. Riza, N. Charibaldi\",\"doi\":\"10.25139/IJAIR.V3I1.3827\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Emotion detection is important in various fields such as education, business, employee recruitment. In this study, emotions will be detected with text that comes from Twitter because social media makes users tend to express emotions through text posts. One of the social media that has the highest user growth rate in Indonesia is Twitter. This study will use the LSTM method because this method is proven to be better than previous studies. Word embedding fast text will also be used in this study to improve Word2Vec and GloVe that cannot handle the problem of out of vocabulary (OOV). This research produces the best accuracy for each word embedding as follows, Word2Vec produces an accuracy of 73,15%, GloVe produces an accuracy of 60,10%, fast text produces an accuracy of 73,15%. The conclusion in this study is the best accuracy was obtained by Word2Vec and fast text. The fast text has the advantage of handling the problem of out of vocabulary (OOV), but in this study, it cannot improve the accuracy of word 2vec. This study has not been able to produce very good accuracy. This is because of the data used. In future works, to get even better results, it is expected to apply other deep learning methods, such as CNN, BiLSTM, etc. It is hoped that more data will be used in future studies.\",\"PeriodicalId\":365842,\"journal\":{\"name\":\"International Journal of Artificial Intelligence & Robotics (IJAIR)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Artificial Intelligence & Robotics (IJAIR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.25139/IJAIR.V3I1.3827\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Artificial Intelligence & Robotics (IJAIR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25139/IJAIR.V3I1.3827","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
摘要
情感检测在教育、商业、员工招聘等各个领域都很重要。在本研究中,情感将通过来自Twitter的文本进行检测,因为社交媒体使用户倾向于通过文本帖子来表达情感。Twitter是印尼用户增长率最高的社交媒体之一。本研究将使用LSTM方法,因为该方法被证明比以往的研究更好。本研究还将使用单词嵌入快速文本来改进Word2Vec和GloVe无法处理out of vocabulary (OOV)的问题。本研究得出的每个词嵌入的最佳准确率如下,Word2Vec产生的准确率为73,15%,GloVe产生的准确率为66,10%,fast text产生的准确率为73,15%。本研究的结论是使用Word2Vec和fast text获得了最好的准确率。快速文本在处理词汇量不足(OOV)问题上具有优势,但在本研究中,它并不能提高单词2vec的准确率。这项研究还不能产生很好的准确性。这是因为所使用的数据。在未来的工作中,为了得到更好的结果,我们希望应用其他的深度学习方法,如CNN、BiLSTM等。希望在未来的研究中使用更多的数据。
Emotion Detection in Twitter Social Media Using Long Short-Term Memory (LSTM) and Fast Text
Emotion detection is important in various fields such as education, business, employee recruitment. In this study, emotions will be detected with text that comes from Twitter because social media makes users tend to express emotions through text posts. One of the social media that has the highest user growth rate in Indonesia is Twitter. This study will use the LSTM method because this method is proven to be better than previous studies. Word embedding fast text will also be used in this study to improve Word2Vec and GloVe that cannot handle the problem of out of vocabulary (OOV). This research produces the best accuracy for each word embedding as follows, Word2Vec produces an accuracy of 73,15%, GloVe produces an accuracy of 60,10%, fast text produces an accuracy of 73,15%. The conclusion in this study is the best accuracy was obtained by Word2Vec and fast text. The fast text has the advantage of handling the problem of out of vocabulary (OOV), but in this study, it cannot improve the accuracy of word 2vec. This study has not been able to produce very good accuracy. This is because of the data used. In future works, to get even better results, it is expected to apply other deep learning methods, such as CNN, BiLSTM, etc. It is hoped that more data will be used in future studies.