Bharathi Raja Chakravarthi, R. Priyadharshini, V. Muralidaran, Shardul Suryawanshi, Navya Jose, E. Sherly, John P. McCrae
{"title":"Overview of the track on Sentiment Analysis for Dravidian Languages in Code-Mixed Text","authors":"Bharathi Raja Chakravarthi, R. Priyadharshini, V. Muralidaran, Shardul Suryawanshi, Navya Jose, E. Sherly, John P. McCrae","doi":"10.1145/3441501.3441515","DOIUrl":null,"url":null,"abstract":"Sentiment analysis of Dravidian languages has received attention in recent years. However, most social media text is code-mixed and there is no research available on sentiment analysis of code-mixed Dravidian languages. The Dravidian-CodeMix-FIRE 2020, a track on Sentiment Analysis for Dravidian Languages in Code-Mixed Text, focused on creating a platform for researchers to come together and investigate the problem. There were two languages for this track: (i) Tamil, and (ii) Malayalam. The participants were given a dataset of YouTube comments and the goal of the shared task submissions was to recognise the sentiment of each comment by classifying them into positive, negative, neutral, mixed-feeling classes or by recognising whether the comment is not in the intended language. The performance of the systems was evaluated by weighted-F1 score.","PeriodicalId":415985,"journal":{"name":"Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval Evaluation","volume":" 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"127","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 12th Annual Meeting of the Forum for Information Retrieval Evaluation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3441501.3441515","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 127
Abstract
Sentiment analysis of Dravidian languages has received attention in recent years. However, most social media text is code-mixed and there is no research available on sentiment analysis of code-mixed Dravidian languages. The Dravidian-CodeMix-FIRE 2020, a track on Sentiment Analysis for Dravidian Languages in Code-Mixed Text, focused on creating a platform for researchers to come together and investigate the problem. There were two languages for this track: (i) Tamil, and (ii) Malayalam. The participants were given a dataset of YouTube comments and the goal of the shared task submissions was to recognise the sentiment of each comment by classifying them into positive, negative, neutral, mixed-feeling classes or by recognising whether the comment is not in the intended language. The performance of the systems was evaluated by weighted-F1 score.
德拉威语的情感分析近年来备受关注。然而,大多数社交媒体文本是代码混合的,没有关于代码混合的德拉威语情感分析的研究。德拉威语- codemix - fire 2020是一篇关于德拉威语在代码混合文本中的情感分析的文章,专注于为研究人员创建一个平台,让他们聚集在一起调查这个问题。这条赛道有两种语言:(i)泰米尔语和(ii)马拉雅拉姆语。参与者得到了一个YouTube评论的数据集,提交共享任务的目标是通过将每个评论分为积极、消极、中立、混合情绪类,或者通过识别评论是否使用预期语言来识别每个评论的情绪。采用f1加权评分对系统的性能进行评价。