Sentimental Analysis on Social Media Comments with Recurring Models and Pretrained Word Embeddings in Portuguese

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval Pub Date : 2022-12-16 DOI:10.1145/3582768.3582805

Cristian Muoz Villalobos, Leonardo Mendoza Forero, Harold De Mello, Cesar Valencia, Alvaro Orjuela, R. Tanscheit, Marco Pacheco Cavalcanti

{"title":"Sentimental Analysis on Social Media Comments with Recurring Models and Pretrained Word Embeddings in Portuguese","authors":"Cristian Muoz Villalobos, Leonardo Mendoza Forero, Harold De Mello, Cesar Valencia, Alvaro Orjuela, R. Tanscheit, Marco Pacheco Cavalcanti","doi":"10.1145/3582768.3582805","DOIUrl":null,"url":null,"abstract":"Natural Language Processing (NLP) techniques are increasingly powerful for interpreting a person’s feelings and reaction to a product or service. Sentiment analysis has become a fundamental tool for this interpretation, and it has studies in languages other than English. This type of application is uncommon and unheard of in Portuguese. This article presents a sentiment analysis classification based on Portuguese social media comments. Representation of word embeddings with both pre-trained Glove and Word2Vec models were generated through a corpus entirely in Portuguese. This article presents a set of results with different models of pre-trained layers and deep learning models exclusive to the Portuguese language on social networks. Two classification models were used and compared: (i) Bidirectional Long Short-Term Memory (BI-LSTM) and (ii) Bidirectional Gated Recurrent Unit (BI-GRU), achieving accuracy results of 99.1","PeriodicalId":315721,"journal":{"name":"Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3582768.3582805","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Natural Language Processing (NLP) techniques are increasingly powerful for interpreting a person’s feelings and reaction to a product or service. Sentiment analysis has become a fundamental tool for this interpretation, and it has studies in languages other than English. This type of application is uncommon and unheard of in Portuguese. This article presents a sentiment analysis classification based on Portuguese social media comments. Representation of word embeddings with both pre-trained Glove and Word2Vec models were generated through a corpus entirely in Portuguese. This article presents a set of results with different models of pre-trained layers and deep learning models exclusive to the Portuguese language on social networks. Two classification models were used and compared: (i) Bidirectional Long Short-Term Memory (BI-LSTM) and (ii) Bidirectional Gated Recurrent Unit (BI-GRU), achieving accuracy results of 99.1

查看原文本刊更多论文

基于循环模型和预训练词嵌入的葡萄牙语社交媒体评论情感分析

自然语言处理(NLP)技术在解释人们对产品或服务的感受和反应方面越来越强大。情感分析已经成为这种解释的基本工具，并且在英语以外的语言中也有研究。这种类型的应用在葡萄牙语中是不常见和闻所未闻的。本文介绍了一种基于葡萄牙社交媒体评论的情感分析分类。用预训练的Glove和Word2Vec模型通过一个完全用葡萄牙语的语料库生成词嵌入的表示。这篇文章展示了一组不同的预训练层模型和深度学习模型的结果，这些模型只针对社交网络上的葡萄牙语。使用两种分类模型进行比较:(i)双向长短期记忆(BI-LSTM)和(ii)双向门控循环单元(BI-GRU)，准确率达到99.1

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval

自引率

0.00%

发文量