Basic word order typology revisited: a crosslinguistic quantitative study based on UD and WALS

IF 1.1 2区 文学 0 LANGUAGE & LINGUISTICS
Jianwei Yan, Haitao Liu
{"title":"Basic word order typology revisited: a crosslinguistic quantitative study based on UD and WALS","authors":"Jianwei Yan, Haitao Liu","doi":"10.1515/lingvan-2021-0001","DOIUrl":null,"url":null,"abstract":"Abstract This study quantitatively examines the first five universals of Greenberg’s basic word order typology based on 74 large-scale annotated corpora from two perspectives. The results show that (1) the dominant orders extracted from corpora concur with those retrieved from the World Atlas of Language Structures (henceforth, WALS) and provide knowledge of dominant orders to languages absent in the WALS, demonstrating the feasibility of adopting corpora to determine dominant orders in typological studies; (2) approaching word order as a discrete variable suggests that the relative order of adjective and noun cannot be predicted by the relative orders of object and verb and genitive and noun, which means the violation of Greenberg’s related universal; (3) approaching word order as a continuous variable also indicates the violation of this universal; and (4) the language samples based on the annotated corpora database further demonstrates that languages that are in line with this universal are rare and internally heterogeneous. Our findings suggest the possibility of drawing typological conclusions based on the frequencies and probabilities extracted from corpora materials and demonstrate that a more cautious adoption of the well-known universals is needed, indicating the importance of viewing word order features from various perspectives to better capture the characteristics of natural languages.","PeriodicalId":55960,"journal":{"name":"Linguistics Vanguard","volume":" ","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistics Vanguard","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/lingvan-2021-0001","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract This study quantitatively examines the first five universals of Greenberg’s basic word order typology based on 74 large-scale annotated corpora from two perspectives. The results show that (1) the dominant orders extracted from corpora concur with those retrieved from the World Atlas of Language Structures (henceforth, WALS) and provide knowledge of dominant orders to languages absent in the WALS, demonstrating the feasibility of adopting corpora to determine dominant orders in typological studies; (2) approaching word order as a discrete variable suggests that the relative order of adjective and noun cannot be predicted by the relative orders of object and verb and genitive and noun, which means the violation of Greenberg’s related universal; (3) approaching word order as a continuous variable also indicates the violation of this universal; and (4) the language samples based on the annotated corpora database further demonstrates that languages that are in line with this universal are rare and internally heterogeneous. Our findings suggest the possibility of drawing typological conclusions based on the frequencies and probabilities extracted from corpora materials and demonstrate that a more cautious adoption of the well-known universals is needed, indicating the importance of viewing word order features from various perspectives to better capture the characteristics of natural languages.
重新审视基本语序类型学——基于UD和WALS的跨语言定量研究
摘要本研究基于74个大规模注释语料库,从两个角度定量考察了格林伯格基本语序类型学的前五个共性。结果表明:(1)从语料库中提取的优势序与从世界语言结构图谱(以下简称WALS)中检索到的优势序一致,为WALS中没有的语言提供了优势序知识,证明了在类型学研究中采用语料库确定优势序的可行性;(2) 将语序作为一个离散变量,说明形容词与名词的相对语序不能用宾语与动词、属格与名词的相关语序来预测,这违反了格林伯格的相关普遍性;(3) 将语序作为一个连续变量也表明了对这一普遍性的违背;(4)基于注释语料库数据库的语言样本进一步表明,符合这种普遍性的语言是罕见的,并且内部是异构的。我们的研究结果表明,有可能根据从语料库材料中提取的频率和概率得出类型学结论,并表明需要更谨慎地采用众所周知的共性,这表明从不同角度看待语序特征对于更好地捕捉自然语言特征的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.00
自引率
18.20%
发文量
105
期刊介绍: Linguistics Vanguard is a new channel for high quality articles and innovative approaches in all major fields of linguistics. This multimodal journal is published solely online and provides an accessible platform supporting both traditional and new kinds of publications. Linguistics Vanguard seeks to publish concise and up-to-date reports on the state of the art in linguistics as well as cutting-edge research papers. With its topical breadth of coverage and anticipated quick rate of production, it is one of the leading platforms for scientific exchange in linguistics. Its broad theoretical range, international scope, and diversity of article formats engage students and scholars alike. All topics within linguistics are welcome. The journal especially encourages submissions taking advantage of its new multimodal platform designed to integrate interactive content, including audio and video, images, maps, software code, raw data, and any other media that enhances the traditional written word. The novel platform and concise article format allows for rapid turnaround of submissions. Full peer review assures quality and enables authors to receive appropriate credit for their work. The journal publishes general submissions as well as special collections. Ideas for special collections may be submitted to the editors for consideration.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信