{"title":"Frontmatter","authors":"","doi":"10.1515/cllt-2020-frontmatter2","DOIUrl":"https://doi.org/10.1515/cllt-2020-frontmatter2","url":null,"abstract":"","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2020-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2020-frontmatter2","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44212864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Light verb variations and varieties of Mandarin Chinese: Comparable corpus driven approaches to grammatical variations","authors":"Hongzhi Xu, M. Jiang, Jingxia Lin, Chu-Ren Huang","doi":"10.1515/cllt-2019-0049","DOIUrl":"https://doi.org/10.1515/cllt-2019-0049","url":null,"abstract":"Abstract This article presents a classification and clustering based study to account for the differences among five Chinese light verbs (congshi, gao, jiayi, jinxing, and zuo) as well as their variations in Mainland China Mandarin (ML) and Taiwan Mandarin (TW). Based on 13 linguistic features, both competition and co-development of these light verbs are studied in terms of their distinct and shared collocates. The proposed method discovers significant new grammatical differences in addition to confirming previously reported ones. Most significant discoveries include selectional restrictions differentiating deverbal nominals and event nouns, and degrees of transitivity of VO compounds. We also find that most variations between Mainland China Mandarin and Taiwan Mandarin are in fact differences in tendencies or preferences in contexts of usage of shared grammatical rules.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2020-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2019-0049","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66858392","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Syntactico-semantic realizations of pronouns in the English transitive construction: A corpus-based analysis","authors":"Haerim Hwang","doi":"10.1515/cllt-2019-0061","DOIUrl":"https://doi.org/10.1515/cllt-2019-0061","url":null,"abstract":"Abstract Pronouns serve as early linguistic cues for the acquisition of the English transitive construction (TC), but previous research has been limited to first language (L1) settings. This study focuses on TC input in the English as a foreign language (EFL) context, investigating syntactico-semantic differences in realizations of TC arguments, particularly pronouns, between L1 parental input and Korean EFL input. To this end, four corpora were created by collecting spoken data from L1-English parents talking to their children, L1-Korean EFL teachers, L1-English EFL teachers, and auditory EFL textbooks. From these corpora, transitive clauses were extracted so that their arguments could be categorized. Mixed-effects negative binomial regression analyses and hierarchical cluster analyses (preceded by principal component analyses) showed that in the realization of TC arguments, Korean EFL input differs syntactically and semantically from L1-English parental input, both for the subjects and objects of TCs. The syntactic difference was particularly pronounced for objects, where fewer pronouns were observed in the EFL input than in the L1-English parental input. Semantically, co-occurrence regularities between transitive verbs and arguments were identified only in the L1-English input and not in the EFL input. Pedagogical implications of the findings are also discussed.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2020-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2019-0061","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41382450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A corpus-based analysis of meaning variations in German tag questions Evidence from spoken and written conversational corpora","authors":"Yulia Clausen, Tatjana Scheffler","doi":"10.1515/cllt-2019-0060","DOIUrl":"https://doi.org/10.1515/cllt-2019-0060","url":null,"abstract":"Abstract This paper addresses semantic/pragmatic variability of tag questions in German and makes three main contributions. First, we document the prevalence and variety of question tags in German across three different types of conversational corpora. Second, by annotating question tags according to their syntactic and semantic context, discourse function, and pragmatic effect, we demonstrate the existing overlap and differences between the individual tag variants. Finally, we distinguish several groups of question tags by identifying the factors that influence the speakers’ choices of tags in the conversational context, such as clause type, function, speaker/hearer knowledge, as well as conversation type and medium. These factors provide the limits of variability by constraining certain question tags in German against occurring in specific contexts or with individual functions.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2020-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2019-0060","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42822164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reconceptualizing register in a continuous situational space","authors":"D. Biber, Jesse Egbert, D. Keller","doi":"10.1515/cllt-2018-0086","DOIUrl":"https://doi.org/10.1515/cllt-2018-0086","url":null,"abstract":"Abstract Corpus-based methods for the quantitative linguistic description of registers are well established. In contrast, situational analyses of registers have been based on qualitative descriptions of categorical situational characteristics. In the present study, we address this inconsistency by describing the variation among texts and registers in a continuous (quantitative) situational space. We describe “registers” as categorical constructs – culturally recognized categories of texts – but propose that they should be described in continuous terms. Such descriptions allow quantitative comparisons of registers, as well as analysis of the extent to which a register is well-delimited in terms of its situational characteristics. Applying this analytical framework, we also explore a deeper issue: the possibility that some texts are not instantiations of any culturally-recognized register category. Both issues are tackled through analysis of a corpus of web documents. We first identify quantitative situational dimensions of variation, employing the methods of multi-dimensional (MD) analysis. We then describe how the situational characteristics of texts and registers can be analyzed in a continuous MD space. And finally, we propose analysis of situational text types – categories that are statistically well-defined in their situational characteristics – as an approach to describing all texts, including texts that do not belong to a culturally recognized register category.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2020-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2018-0086","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41709273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The alternative negative constructions in spoken and written Korean: Logistic regression analysis","authors":"Beom-mo Kang","doi":"10.1515/cllt-2016-0021","DOIUrl":"https://doi.org/10.1515/cllt-2016-0021","url":null,"abstract":"Abstract Adopting quantitative corpus-based methods, this paper focuses on the alternative negative constructions in Korean, [an V] and [V anhda]. Logistic regression analyses for a mixed-effects model were carried out on data drawn from the Sejong Korean Corpus. Certain features of the verb or adjective in negative constructions significantly affect the use of the two negative constructions. A relevant factor is register/medium (spoken or written), among other significant interactions of factors. Furthermore, the fact that frequency is consistent with other relevant factors, together with certain diachronic facts of Korean, supports the claim that frequency of use plays an important role in linguistic changes. Another finding is that, notwithstanding noticeable differences between spoken and written language, the factors influencing the use of the two negative constructions in Korean are largely similar in the spoken and written registers.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2019-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2016-0021","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46783539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prototype-driven alternations: The case of German weak nouns","authors":"R. Schäfer","doi":"10.1515/cllt-2015-0051","DOIUrl":"https://doi.org/10.1515/cllt-2015-0051","url":null,"abstract":"Abstract Over the past years, multifactorial corpus-based explorations of alternations in grammar have become an accepted major tool in cognitively oriented corpus linguistics. For example, prototype theory as a theory of similarity-based and inherently probabilistic linguistic categorization has received support from studies showing that alternating constructions and items often occur with probabilities influenced by prototypical formal, semantic or contextual factors. In this paper, I analyze a low-frequency alternation effect in German noun inflection in terms of prototype theory, based on strong hypotheses from the existing literature that I integrate into an established theoretical framework of usage-based probabilistic morphology, which allows us to account for similarity effects even in seemingly regular areas of the grammar. Specifically, the so-called weak masculine nouns in German, which follow an unusual pattern of case marking and often have characteristic lexical properties, sporadically occur in forms of the dominant strong masculine nouns. Using data from the nine-billion-token DECOW12A web corpus of contemporary German, I demonstrate that the probability of the alternation is influenced by the presence or absence of semantic, phonotactic, and paradigmatic features. Token frequency is also shown to have an effect on the alternation, in line with common assumptions about the relation between frequency and entrenchment. I use a version of prototype theory with weighted features and polycentric categories, but I also discuss the question of whether such corpus data can be taken as strong evidence for or against specific models of cognitive representation (prototypes vs. exemplars).","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2019-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2015-0051","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48125671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Against statistical significance testing in corpus linguistics","authors":"Alexander Koplenig","doi":"10.1515/cllt-2016-0036","DOIUrl":"https://doi.org/10.1515/cllt-2016-0036","url":null,"abstract":"Abstract In the first volume of Corpus Linguistics and Linguistic Theory, Gries (2005. Null-hypothesis significance testing of word frequencies: A follow-up on Kilgarriff. Corpus Linguistics and Linguistic Theory 1(2). doi:10.1515/cllt.2005.1.2.277. http://www.degruyter.com/view/j/cllt.2005.1.issue-2/cllt.2005.1.2.277/cllt.2005.1.2.277.xml: 285) asked whether corpus linguists should abandon null-hypothesis significance testing. In this paper, I want to revive this discussion by defending the argument that the assumptions that allow inferences about a given population – in this case about the studied languages – based on results observed in a sample – in this case a collection of naturally occurring language data – are not fulfilled. As a consequence, corpus linguists should indeed abandon null-hypothesis significance testing.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2019-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2016-0036","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41582550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Definite article bridging relations in L2: A learner corpus study","authors":"P. Crosthwaite","doi":"10.1515/cllt-2015-0058","DOIUrl":"https://doi.org/10.1515/cllt-2015-0058","url":null,"abstract":"Abstract Bridging relations are used when the identity of a discourse-new entity can be inferred via lexical relations from an antecedent (e. g. a cake … the slice) or non-lexically via reference to world knowledge or discourse structure (e. g. a war … the survivors). Such relations are marked in English via the definite article, which is considered a difficult feature of the English language for L2 learners to acquire, particularly for L1 speakers of article-less languages. This paper provides an Integrated Contrastive Model (e. g. Granger 1996) of the L1 and L2 production of definite article bridging relations using L2 English learner corpus data produced by native Mandarin and Korean speakers at four L2 proficiency levels, alongside comparative native English data. The data is taken from the International Corpus Network of Asian Learners of English (ICNALE, Ishikawa 2011, 2013), totalling just under 400,000 words with over 1500 bridging NPs identified. Results suggest subtle but significant differences between L1-L2 and L2-L2 groupings in terms of the frequency of particular bridging relation types and lemmatised wordings identified in the data, although there was little evidence of pseudo-longitudinal development. Such differences may suggest an effect of L1-L2 linguistic relativity, influencing the selection of relational links between given/new discourse entities during L2 production.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2019-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2015-0058","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45865903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Grammatical construction of function words between old and modern written Arabic: A corpus-based analysis","authors":"Sultan Almujaiwel","doi":"10.1515/cllt-2016-0069","DOIUrl":"https://doi.org/10.1515/cllt-2016-0069","url":null,"abstract":"Abstract This paper argues that Arabic function words (FWs) vary in usage between old and modern Arabic, thus prompting an experimental investigation into their changeability. This investigation is carried out by testing classical Arabic (CA) in Arabic heritage language (AHL) texts – those labeled as archistratum – and the modern standard Arabic (MSA) of Arabic newspaper texts (ANT), each group of which contains randomly collected 5 million (M) word texts. The linguistic theory of the grammar of Arabic FWs is explained through the differences between CA and MSA, despite Arabic FW changes and the unlearnability and/or unusability of some FW constructions between in these two eras of Arabic usage. The dispersion/distribution of the construction grammar (CxG) of FWs and the number (n) of word attractions/repulsions between the two distinct eras is explored using the very latest and most sophisticated Arabic corpus processing tools, and Sketch Engine’s SkeEn gramrels operators. The analysis of a 5 M word corpus from each era of Arabic serves to prove the non-existence of rigorous Arabic CxG. The approach in this study adopts a technique which, by contrasting AHL with ANT, relies on analyzing the frequency distributions of FWs, the co-occurrences of FWs in a span of 2n-grams collocational patterning, and some cases of FW usage changes in terms of lexical cognition (FW grammatical relationships). The results show that the frequencies of FWs, in addition to the case studies, are not the same, and this implies that FWs and their associations with the main part of speech class in a fusion language like Arabic have grammatically changed in MSA. Their constructional changes are neglected in Arabic grammar.","PeriodicalId":45605,"journal":{"name":"Corpus Linguistics and Linguistic Theory","volume":null,"pages":null},"PeriodicalIF":1.6,"publicationDate":"2019-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1515/cllt-2016-0069","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46255286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}