{"title":"Parsed Corpus as a Source for Testing Generalizations in Japanese Syntax","authors":"H. Kishimoto, Prashant Pardeshi","doi":"10.33011/lilt.v18i.1431","DOIUrl":null,"url":null,"abstract":"In this paper, we discuss constituent ordering generalizations in Japanese. Japanese has SOV as its basic order, but a significant range of argument order variations brought about by ‘scrambling’ is permitted. Although scrambling does not induce much in the way of semantic effects, it is conceivable that marked orders are derived from the unmarked order under some pragmatic or other motivations. The difference in the effect of basic and derived order is not reflected in native speaker’s grammaticality judgments, but we suggest that the intuition about the ordering of arguments may be attested in corpus data. By using the Keyaki treebank (a proper subset of which is NINJAL Parsed Corpus of Modern Japanese (NPCMJ)), it is shown that the naturallyoccurring corpus data confirm that marked orderings of arguments are less frequent than their unmarked ordering counterparts. We suggest some possible motivations lying behind the argument order variations.","PeriodicalId":218122,"journal":{"name":"Linguistic Issues in Language Technology","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistic Issues in Language Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33011/lilt.v18i.1431","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we discuss constituent ordering generalizations in Japanese. Japanese has SOV as its basic order, but a significant range of argument order variations brought about by ‘scrambling’ is permitted. Although scrambling does not induce much in the way of semantic effects, it is conceivable that marked orders are derived from the unmarked order under some pragmatic or other motivations. The difference in the effect of basic and derived order is not reflected in native speaker’s grammaticality judgments, but we suggest that the intuition about the ordering of arguments may be attested in corpus data. By using the Keyaki treebank (a proper subset of which is NINJAL Parsed Corpus of Modern Japanese (NPCMJ)), it is shown that the naturallyoccurring corpus data confirm that marked orderings of arguments are less frequent than their unmarked ordering counterparts. We suggest some possible motivations lying behind the argument order variations.
本文讨论了日语中的成分排序推广。日语以SOV作为其基本顺序,但允许“乱置”带来的显着范围的参数顺序变化。虽然置乱不会产生太多的语义效应,但可以想象,在一些语用或其他动机下,标记顺序是从未标记顺序衍生出来的。基本顺序和派生顺序的影响差异并没有反映在母语者的语法判断中,但我们认为关于论点顺序的直觉可能在语料库数据中得到证实。通过使用Keyaki树库(其中一个适当的子集是NINJAL Parsed Corpus of Modern Japanese (NPCMJ)),结果表明,自然发生的语料库数据证实,参数的标记顺序比未标记的顺序更少出现。我们提出了一些可能的动机背后的论点顺序变化。