斯拉夫语料库与计算语言学

IF 0.4 0 LANGUAGE & LINGUISTICS
D. Divjak, Dagmar Serge Tomaž Sharoff, Dagmar Serge Tomaž Erjavec
{"title":"斯拉夫语料库与计算语言学","authors":"D. Divjak, Dagmar Serge Tomaž Sharoff, Dagmar Serge Tomaž Erjavec","doi":"10.1353/JSL.2017.0008","DOIUrl":null,"url":null,"abstract":"Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally we survey the types of research requiring corpora that Slavic linguists are involved in worldwide, and the resources they have at their disposal.","PeriodicalId":52037,"journal":{"name":"Journal of Slavic Linguistics","volume":null,"pages":null},"PeriodicalIF":0.4000,"publicationDate":"2018-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1353/JSL.2017.0008","citationCount":"5","resultStr":"{\"title\":\"Slavic Corpus and Computational Linguistics\",\"authors\":\"D. Divjak, Dagmar Serge Tomaž Sharoff, Dagmar Serge Tomaž Erjavec\",\"doi\":\"10.1353/JSL.2017.0008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally we survey the types of research requiring corpora that Slavic linguists are involved in worldwide, and the resources they have at their disposal.\",\"PeriodicalId\":52037,\"journal\":{\"name\":\"Journal of Slavic Linguistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2018-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1353/JSL.2017.0008\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Slavic Linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1353/JSL.2017.0008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Slavic Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1353/JSL.2017.0008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 5

摘要

摘要:本文主要关注语料库语言学的理论研究,以及语料库注释的计算语言学工作,使语料库对语言分析有用。首先,我们讨论了为什么语料库语言学方法在20世纪下半叶受到生成语言学家的质疑,它是如何通过计算的进步而卷土重来,并最终在21世纪初被基于用法的语言学所采用的。然后,我们继续概述必要的和常见的注释层,以及在执行自动注释时遇到的问题,特别强调斯拉夫语言。最后,我们调查了需要语料库的研究类型,斯拉夫语言学家在世界范围内参与,以及他们所拥有的资源。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Slavic Corpus and Computational Linguistics
Abstract:In this paper we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation that makes corpora useful for linguistic analysis. First we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was finally adopted by usage-based linguistics at the beginning of the 21st century. Then we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally we survey the types of research requiring corpora that Slavic linguists are involved in worldwide, and the resources they have at their disposal.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Slavic Linguistics
Journal of Slavic Linguistics LANGUAGE & LINGUISTICS-
CiteScore
0.50
自引率
0.00%
发文量
0
期刊介绍: Journal of Slavic Linguistics, or JSL, is the official journal of the Slavic Linguistics Society. JSL publishes research articles and book reviews that address the description and analysis of Slavic languages and that are of general interest to linguists. Published papers deal with any aspect of synchronic or diachronic Slavic linguistics – phonetics, phonology, morphology, syntax, semantics, or pragmatics – which raises substantive problems of broad theoretical concern or proposes significant descriptive generalizations. Comparative studies and formal analyses are also published. Different theoretical orientations are represented in the journal. One volume (two issues) is published per year, ca. 360 pp.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信