基于语料库的类型学:应用、挑战和一些解决方案。

IF 1.7 2区 文学 0 LANGUAGE & LINGUISTICS
Linguistic Typology Pub Date : 2022-05-25 Epub Date: 2021-03-30 DOI:10.1515/lingty-2020-0118
Natalia Levshina
{"title":"基于语料库的类型学:应用、挑战和一些解决方案。","authors":"Natalia Levshina","doi":"10.1515/lingty-2020-0118","DOIUrl":null,"url":null,"abstract":"<p><p>Over the last few years, the number of corpora that can be used for language comparison has dramatically increased. The corpora are so diverse in their structure, size and annotation style, that a novice might not know where to start. The present paper charts this new and changing territory, providing a few landmarks, warning signs and safe paths. Although no corpus at present can replace the traditional type of typological data based on language description in reference grammars, corpora can help with diverse tasks, being particularly well suited for investigating probabilistic and gradient properties of languages and for discovering and interpreting cross-linguistic generalizations based on processing and communicative mechanisms. At the same time, the use of corpora for typological purposes has not only advantages and opportunities, but also numerous challenges. This paper also contains an empirical case study addressing two pertinent problems: the role of text types in language comparison and the problem of the word as a comparative concept.</p>","PeriodicalId":45834,"journal":{"name":"Linguistic Typology","volume":"26 1","pages":"129-160"},"PeriodicalIF":1.7000,"publicationDate":"2022-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/lingty-2020-0118","citationCount":"12","resultStr":"{\"title\":\"Corpus-based typology: applications, challenges and some solutions.\",\"authors\":\"Natalia Levshina\",\"doi\":\"10.1515/lingty-2020-0118\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Over the last few years, the number of corpora that can be used for language comparison has dramatically increased. The corpora are so diverse in their structure, size and annotation style, that a novice might not know where to start. The present paper charts this new and changing territory, providing a few landmarks, warning signs and safe paths. Although no corpus at present can replace the traditional type of typological data based on language description in reference grammars, corpora can help with diverse tasks, being particularly well suited for investigating probabilistic and gradient properties of languages and for discovering and interpreting cross-linguistic generalizations based on processing and communicative mechanisms. At the same time, the use of corpora for typological purposes has not only advantages and opportunities, but also numerous challenges. This paper also contains an empirical case study addressing two pertinent problems: the role of text types in language comparison and the problem of the word as a comparative concept.</p>\",\"PeriodicalId\":45834,\"journal\":{\"name\":\"Linguistic Typology\",\"volume\":\"26 1\",\"pages\":\"129-160\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2022-05-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1515/lingty-2020-0118\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Linguistic Typology\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1515/lingty-2020-0118\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2021/3/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistic Typology","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/lingty-2020-0118","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/3/30 0:00:00","PubModel":"Epub","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 12

摘要

在过去的几年里,可用于语言比较的语料库数量急剧增加。语料库在结构、大小和注释风格上如此多样化,以至于新手可能不知道从哪里开始。本文描绘了这个新的和不断变化的领域,提供了一些地标,警告标志和安全路径。虽然目前没有语料库可以取代基于参考语法中语言描述的传统类型数据,但语料库可以帮助完成各种任务,特别适合于调查语言的概率和梯度特性,以及发现和解释基于处理和交际机制的跨语言概括。同时,使用语料库进行类型学研究既有优势和机遇,也有诸多挑战。本文还包含了一个实证案例研究,解决了两个相关问题:文本类型在语言比较中的作用和单词作为比较概念的问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Corpus-based typology: applications, challenges and some solutions.

Corpus-based typology: applications, challenges and some solutions.

Corpus-based typology: applications, challenges and some solutions.

Over the last few years, the number of corpora that can be used for language comparison has dramatically increased. The corpora are so diverse in their structure, size and annotation style, that a novice might not know where to start. The present paper charts this new and changing territory, providing a few landmarks, warning signs and safe paths. Although no corpus at present can replace the traditional type of typological data based on language description in reference grammars, corpora can help with diverse tasks, being particularly well suited for investigating probabilistic and gradient properties of languages and for discovering and interpreting cross-linguistic generalizations based on processing and communicative mechanisms. At the same time, the use of corpora for typological purposes has not only advantages and opportunities, but also numerous challenges. This paper also contains an empirical case study addressing two pertinent problems: the role of text types in language comparison and the problem of the word as a comparative concept.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.70
自引率
5.00%
发文量
13
期刊介绍: Linguistic Typology provides a forum for all work of relevance to the study of language typology and cross-linguistic variation. It welcomes work taking a typological perspective on all domains of the structure of spoken and signed languages, including historical change, language processing, and sociolinguistics. Diverse descriptive and theoretical frameworks are welcomed so long as they have a clear bearing on the study of cross-linguistic variation. We welcome cross-disciplinary approaches to the study of linguistic diversity, as well as work dealing with just one or a few languages, as long as it is typologically informed and typologically and theoretically relevant, and contains new empirical evidence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信