{"title":"汉语句法选择预测——基于语料库的ba句和SVO句分析","authors":"Yu Fang, Haitao Liu","doi":"10.1515/cog-2020-0005","DOIUrl":null,"url":null,"abstract":"Abstract This paper investigates the effects of 10 factors on the choice between alternative ba sentences and SVO sentences in Mandarin Chinese. These factors are givenness, definiteness, animacy and pronominality of NP2s, NP2 length, VP length, verb sense, syntactic parallelism, dependency distance, and surprisal. Using corpus data and mixed-effects logistic regression modeling, we find that on the one hand, givenness, syntactic parallelism, and the log-transformed ratio of NP2 length and VP length are significant predictors of the choice between ba sentences and SVO sentences. A new NP2, a large length ratio and a parallel construction predict an SVO sentence rather than a ba sentence. On the other hand, dependency distance and surprisal estimated by the trigram model are effective in predicting the choice between naturally occurring ba/SVO sentences and their alternatives. Naturally occurring sentences are more likely to have shorter dependency distances and smaller surprisal values than the converted sentences. The effects of these five factors on syntactic choice are congruent with results of previous studies, which suggests that some determinants of syntactic choice are shared among languages.","PeriodicalId":51530,"journal":{"name":"Cognitive Linguistics","volume":"32 1","pages":"219 - 250"},"PeriodicalIF":1.8000,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cog-2020-0005","citationCount":"6","resultStr":"{\"title\":\"Predicting syntactic choice in Mandarin Chinese: a corpus-based analysis of ba sentences and SVO sentences\",\"authors\":\"Yu Fang, Haitao Liu\",\"doi\":\"10.1515/cog-2020-0005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract This paper investigates the effects of 10 factors on the choice between alternative ba sentences and SVO sentences in Mandarin Chinese. These factors are givenness, definiteness, animacy and pronominality of NP2s, NP2 length, VP length, verb sense, syntactic parallelism, dependency distance, and surprisal. Using corpus data and mixed-effects logistic regression modeling, we find that on the one hand, givenness, syntactic parallelism, and the log-transformed ratio of NP2 length and VP length are significant predictors of the choice between ba sentences and SVO sentences. A new NP2, a large length ratio and a parallel construction predict an SVO sentence rather than a ba sentence. On the other hand, dependency distance and surprisal estimated by the trigram model are effective in predicting the choice between naturally occurring ba/SVO sentences and their alternatives. Naturally occurring sentences are more likely to have shorter dependency distances and smaller surprisal values than the converted sentences. The effects of these five factors on syntactic choice are congruent with results of previous studies, which suggests that some determinants of syntactic choice are shared among languages.\",\"PeriodicalId\":51530,\"journal\":{\"name\":\"Cognitive Linguistics\",\"volume\":\"32 1\",\"pages\":\"219 - 250\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2021-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1515/cog-2020-0005\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Linguistics\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1515/cog-2020-0005\",\"RegionNum\":1,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1515/cog-2020-0005","RegionNum":1,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
Predicting syntactic choice in Mandarin Chinese: a corpus-based analysis of ba sentences and SVO sentences
Abstract This paper investigates the effects of 10 factors on the choice between alternative ba sentences and SVO sentences in Mandarin Chinese. These factors are givenness, definiteness, animacy and pronominality of NP2s, NP2 length, VP length, verb sense, syntactic parallelism, dependency distance, and surprisal. Using corpus data and mixed-effects logistic regression modeling, we find that on the one hand, givenness, syntactic parallelism, and the log-transformed ratio of NP2 length and VP length are significant predictors of the choice between ba sentences and SVO sentences. A new NP2, a large length ratio and a parallel construction predict an SVO sentence rather than a ba sentence. On the other hand, dependency distance and surprisal estimated by the trigram model are effective in predicting the choice between naturally occurring ba/SVO sentences and their alternatives. Naturally occurring sentences are more likely to have shorter dependency distances and smaller surprisal values than the converted sentences. The effects of these five factors on syntactic choice are congruent with results of previous studies, which suggests that some determinants of syntactic choice are shared among languages.
期刊介绍:
Cognitive Linguistics presents a forum for linguistic research of all kinds on the interaction between language and cognition. The journal focuses on language as an instrument for organizing, processing and conveying information. Cognitive Linguistics is a peer-reviewed journal of international scope and seeks to publish only works that represent a significant advancement to the theory or methods of cognitive linguistics, or that present an unknown or understudied phenomenon. Topics the structural characteristics of natural language categorization (such as prototypicality, cognitive models, metaphor, and imagery); the functional principles of linguistic organization, as illustrated by iconicity; the conceptual interface between syntax and semantics; the experiential background of language-in-use, including the cultural background; the relationship between language and thought, including matters of universality and language specificity.