为希腊语开发一个特定领域的词典

Kyriakos Skoularikis, I. Savvas, G. Garani, George Kakarontzas
{"title":"为希腊语开发一个特定领域的词典","authors":"Kyriakos Skoularikis, I. Savvas, G. Garani, George Kakarontzas","doi":"10.1145/3575879.3576004","DOIUrl":null,"url":null,"abstract":"We live in a society where a massive quantity of data is generated daily on online social network platforms. This enormous data contains vital opinion-related information that many companies and other scientific and commercial industries are trying to exploit for their benefits. For that purpose, sentiment analysis is required. Sentiment analysis or opinion mining is the branch of data analytics for extracting sentiments from messages expressed by users on a particular subject. Although, in the past years a considerable research has been made for the English language, the works of Sentiment Analysis in Greek language is not so popular, due to smaller user base. In this work, we provide a method to create domain-specific dictionaries given a corpus of tweets in the Greek language. In those Lexicons, we take into consideration the significance of each word for the specific domain, by introducing a new attribute Weightw. Also, we deploy a hybrid framework which utilizes the newly created domain-specific Lexicon with the Naïve Bayes classifier to analyze and predict the sentiment of each tweet. Our framework has the ability to merge the better of the two basic concepts, the Lexicon and Machine Learning method, and demonstrates the significance of the words for domain-specific Lexicon, for achieving optimal results when performing Sentiment Analysis.","PeriodicalId":164036,"journal":{"name":"Proceedings of the 26th Pan-Hellenic Conference on Informatics","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing a Domain-Specific Lexicon for the Greek Language\",\"authors\":\"Kyriakos Skoularikis, I. Savvas, G. Garani, George Kakarontzas\",\"doi\":\"10.1145/3575879.3576004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We live in a society where a massive quantity of data is generated daily on online social network platforms. This enormous data contains vital opinion-related information that many companies and other scientific and commercial industries are trying to exploit for their benefits. For that purpose, sentiment analysis is required. Sentiment analysis or opinion mining is the branch of data analytics for extracting sentiments from messages expressed by users on a particular subject. Although, in the past years a considerable research has been made for the English language, the works of Sentiment Analysis in Greek language is not so popular, due to smaller user base. In this work, we provide a method to create domain-specific dictionaries given a corpus of tweets in the Greek language. In those Lexicons, we take into consideration the significance of each word for the specific domain, by introducing a new attribute Weightw. Also, we deploy a hybrid framework which utilizes the newly created domain-specific Lexicon with the Naïve Bayes classifier to analyze and predict the sentiment of each tweet. Our framework has the ability to merge the better of the two basic concepts, the Lexicon and Machine Learning method, and demonstrates the significance of the words for domain-specific Lexicon, for achieving optimal results when performing Sentiment Analysis.\",\"PeriodicalId\":164036,\"journal\":{\"name\":\"Proceedings of the 26th Pan-Hellenic Conference on Informatics\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 26th Pan-Hellenic Conference on Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3575879.3576004\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 26th Pan-Hellenic Conference on Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3575879.3576004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

我们生活在一个每天在在线社交网络平台上产生大量数据的社会。这些庞大的数据包含了重要的意见相关信息,许多公司和其他科学和商业行业正试图利用这些信息谋取利益。为此,需要进行情绪分析。情感分析或观点挖掘是数据分析的一个分支,用于从用户对特定主题表达的消息中提取情感。虽然在过去的几年里,对英语语言进行了相当多的研究,但由于用户基数较小,希腊语的情感分析作品并不那么受欢迎。在这项工作中,我们提供了一种方法来创建给定希腊语tweet语料库的特定领域字典。在这些lexicon中,我们通过引入新属性Weightw来考虑每个单词在特定领域的重要性。此外,我们部署了一个混合框架,该框架利用新创建的特定于领域的Lexicon和Naïve贝叶斯分类器来分析和预测每个tweet的情绪。我们的框架能够更好地融合两个基本概念,词典和机器学习方法,并展示了特定领域词典中单词的重要性,以便在执行情感分析时获得最佳结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Developing a Domain-Specific Lexicon for the Greek Language
We live in a society where a massive quantity of data is generated daily on online social network platforms. This enormous data contains vital opinion-related information that many companies and other scientific and commercial industries are trying to exploit for their benefits. For that purpose, sentiment analysis is required. Sentiment analysis or opinion mining is the branch of data analytics for extracting sentiments from messages expressed by users on a particular subject. Although, in the past years a considerable research has been made for the English language, the works of Sentiment Analysis in Greek language is not so popular, due to smaller user base. In this work, we provide a method to create domain-specific dictionaries given a corpus of tweets in the Greek language. In those Lexicons, we take into consideration the significance of each word for the specific domain, by introducing a new attribute Weightw. Also, we deploy a hybrid framework which utilizes the newly created domain-specific Lexicon with the Naïve Bayes classifier to analyze and predict the sentiment of each tweet. Our framework has the ability to merge the better of the two basic concepts, the Lexicon and Machine Learning method, and demonstrates the significance of the words for domain-specific Lexicon, for achieving optimal results when performing Sentiment Analysis.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信