Natiolectal Variation in Dutch Morphosyntax: A Large-Scale, Data-Driven Perspective

IF 0.7 3区文学 0 LANGUAGE & LINGUISTICS

Journal of Germanic Linguistics Pub Date : 2023-02-08 DOI:10.1017/S1470542722000071

Robbert De Troij, S. Grondelaers, D. Speelman

{"title":"Natiolectal Variation in Dutch Morphosyntax: A Large-Scale, Data-Driven Perspective","authors":"Robbert De Troij, S. Grondelaers, D. Speelman","doi":"10.1017/S1470542722000071","DOIUrl":null,"url":null,"abstract":"Abstract In this article, we report a large-scale corpus study aimed at tackling the (controversial) question to what extent the European national varieties of Dutch, that is, Belgian and Netherlandic Dutch, exhibit morpho-syntactic differences. Instead of relying on a manual selection of cases of morphosyntactic variation, we first marshal large bilingual parallel corpora and machine translation software to identify semiautomatically, in an extensively data-driven fashion, loci of variation from various “corners” of Dutch grammar. We then gauge the distribution of con-structional alternatives in a nationally as well as stylistically stratified corpus for a representative selection of twenty alternation patterns. We find that natiolectal variation in the grammar of Dutch is far more prevalent than often assumed, especially in less edited text types, and that it shows up in inflection phenomena, lexically conditioned syntactic variation, and pure word order permutations. Another key finding is that many cases of synchronic probabilistic asymmetries reflect a diachronic difference between the two varieties: Netherlandic Dutch often tends to be ahead in cases of ongoing grammatical change, with Belgian Dutch holding on somewhat longer to obsolescent features of the grammar.*","PeriodicalId":42927,"journal":{"name":"Journal of Germanic Linguistics","volume":"35 1","pages":"1 - 68"},"PeriodicalIF":0.7000,"publicationDate":"2023-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Germanic Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1017/S1470542722000071","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract In this article, we report a large-scale corpus study aimed at tackling the (controversial) question to what extent the European national varieties of Dutch, that is, Belgian and Netherlandic Dutch, exhibit morpho-syntactic differences. Instead of relying on a manual selection of cases of morphosyntactic variation, we first marshal large bilingual parallel corpora and machine translation software to identify semiautomatically, in an extensively data-driven fashion, loci of variation from various “corners” of Dutch grammar. We then gauge the distribution of con-structional alternatives in a nationally as well as stylistically stratified corpus for a representative selection of twenty alternation patterns. We find that natiolectal variation in the grammar of Dutch is far more prevalent than often assumed, especially in less edited text types, and that it shows up in inflection phenomena, lexically conditioned syntactic variation, and pure word order permutations. Another key finding is that many cases of synchronic probabilistic asymmetries reflect a diachronic difference between the two varieties: Netherlandic Dutch often tends to be ahead in cases of ongoing grammatical change, with Belgian Dutch holding on somewhat longer to obsolescent features of the grammar.*

查看原文本刊更多论文

荷兰语形态句法中的本土方言变异：一个大规模的、数据驱动的视角

摘要在本文中，我们报道了一项大规模的语料库研究，旨在解决（有争议的）问题，即荷兰语的欧洲国家变体，即比利时和荷兰荷兰语，在多大程度上表现出形态句法差异。我们不再依赖于手动选择形态句法变异的案例，而是首先整理大型双语平行语料库和机器翻译软件，以广泛的数据驱动方式半自动识别荷兰语法各个“角落”的变异位点。然后，我们对20种交替模式的代表性选择，在全国性和风格分层的语料库中衡量结构替代品的分布。我们发现，荷兰语语法中的国家方言变异比通常假设的要普遍得多，尤其是在编辑较少的文本类型中，它表现在屈折现象、词汇条件句法变异和纯词序排列中。另一个关键发现是，许多共时概率不对称的情况反映了这两个变体之间的历时差异：荷兰荷兰语在语法变化的情况下往往领先，而比利时荷兰语则保留了更长的语法特征*

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Germanic Linguistics Multiple-

CiteScore

1.10

自引率

20.00%

发文量