大型语料库与情感语料库研究综述

Q4 Computer Science

Journal of Information Processing Pub Date : 2014-01-01 DOI:10.11185/IMT.9.429

M. Ptaszynski, Rafal Rzepka, S. Oyama, M. Kurihara, K. Araki

{"title":"大型语料库与情感语料库研究综述","authors":"M. Ptaszynski, Rafal Rzepka, S. Oyama, M. Kurihara, K. Araki","doi":"10.11185/IMT.9.429","DOIUrl":null,"url":null,"abstract":"In this paper we present a survey on natural language corpora, with particular focus on corpora of large scale and those applicable to sentiment analysis. Natural language corpora are crucial for training various Software Engineering applications, from part-of-speech taggers and dependency parsers to dialog systems or sentiment analysis software. We compare several natural language corpora created for different languages, analyze their distinctive features and the amount of additional annotations provided by the developers of those corpora.","PeriodicalId":16243,"journal":{"name":"Journal of Information Processing","volume":"8 1","pages":"429-445"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Survey on Large Scale Corpora and Emotion Corpora\",\"authors\":\"M. Ptaszynski, Rafal Rzepka, S. Oyama, M. Kurihara, K. Araki\",\"doi\":\"10.11185/IMT.9.429\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we present a survey on natural language corpora, with particular focus on corpora of large scale and those applicable to sentiment analysis. Natural language corpora are crucial for training various Software Engineering applications, from part-of-speech taggers and dependency parsers to dialog systems or sentiment analysis software. We compare several natural language corpora created for different languages, analyze their distinctive features and the amount of additional annotations provided by the developers of those corpora.\",\"PeriodicalId\":16243,\"journal\":{\"name\":\"Journal of Information Processing\",\"volume\":\"8 1\",\"pages\":\"429-445\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11185/IMT.9.429\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11185/IMT.9.429","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 1

摘要

本文对自然语言语料库进行了综述，重点关注大规模语料库和适用于情感分析的语料库。自然语言语料库对于训练各种软件工程应用程序至关重要，从词性标注器和依赖解析器到对话系统或情感分析软件。我们比较了为不同语言创建的几种自然语言语料库，分析了它们的独特特征以及这些语料库开发人员提供的额外注释的数量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Survey on Large Scale Corpora and Emotion Corpora

In this paper we present a survey on natural language corpora, with particular focus on corpora of large scale and those applicable to sentiment analysis. Natural language corpora are crucial for training various Software Engineering applications, from part-of-speech taggers and dependency parsers to dialog systems or sentiment analysis software. We compare several natural language corpora created for different languages, analyze their distinctive features and the amount of additional annotations provided by the developers of those corpora.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Information Processing Computer Science-Computer Science (all)

CiteScore

1.20

自引率

0.00%

发文量