关于品种研究中的回归模型

IF 1.3 2区文学 0 LANGUAGE & LINGUISTICS

World Englishes Pub Date : 2024-07-18 DOI:10.1111/weng.12694

Stefan Th. Gries

{"title":"关于品种研究中的回归模型","authors":"Stefan Th. Gries","doi":"10.1111/weng.12694","DOIUrl":null,"url":null,"abstract":"One particularly prominent methodological development in linguistics is what has been termed the “quantitative turn”: Not only are more and more studies using statistical tools to explore data and to test hypotheses, the complexity of the statistical methods employed is growing as well. This development is particularly prominent in all kinds of corpus‐linguistic studies: 20 years ago chi‐squared tests, t‐tests, and Pearson's r reigned supreme, but now more and more corpus studies are using multivariate exploratory tools and, for hypothesis testing, multifactorial predictive modeling techniques, in particular regression models (and, increasingly, tree‐based methods). However welcome this development is, it, and especially its pace as well as the fact that few places offer rigorous training in statistical methods, comes with its own risks, chief among them that analytical methods are misapplied, which can lead imprecise, incomplete, or wrong analyses. In this paper, I will revisit a recent regression‐analytic study in the research area of English varieties (on clause‐final also and only in three Asian Englishes) to: highlight in particular three fundamental yet frequent mistakes that it exemplifies; discuss why and how each of these mistakes should be addressed; reanalyze the data (as far as is possible with what is available) and show briefly how that affects the analysis's results and interpretation. ","PeriodicalId":23780,"journal":{"name":"World Englishes","volume":"38 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On regression modeling in varieties research\",\"authors\":\"Stefan Th. Gries\",\"doi\":\"10.1111/weng.12694\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One particularly prominent methodological development in linguistics is what has been termed the “quantitative turn”: Not only are more and more studies using statistical tools to explore data and to test hypotheses, the complexity of the statistical methods employed is growing as well. This development is particularly prominent in all kinds of corpus‐linguistic studies: 20 years ago chi‐squared tests, t‐tests, and Pearson's r reigned supreme, but now more and more corpus studies are using multivariate exploratory tools and, for hypothesis testing, multifactorial predictive modeling techniques, in particular regression models (and, increasingly, tree‐based methods). However welcome this development is, it, and especially its pace as well as the fact that few places offer rigorous training in statistical methods, comes with its own risks, chief among them that analytical methods are misapplied, which can lead imprecise, incomplete, or wrong analyses. In this paper, I will revisit a recent regression‐analytic study in the research area of English varieties (on clause‐final also and only in three Asian Englishes) to: highlight in particular three fundamental yet frequent mistakes that it exemplifies; discuss why and how each of these mistakes should be addressed; reanalyze the data (as far as is possible with what is available) and show briefly how that affects the analysis's results and interpretation. \",\"PeriodicalId\":23780,\"journal\":{\"name\":\"World Englishes\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Englishes\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1111/weng.12694\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Englishes","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1111/weng.12694","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 0

摘要

语言学方法论的一个特别突出的发展就是所谓的 "定量转向"：不仅越来越多的研究使用统计工具来探索数据和检验假设，而且所使用的统计方法也越来越复杂。这种发展在各种语料库语言学研究中尤为突出：20 年前，卡方检验、t 检验和皮尔逊 r 是最重要的检验方法，但现在越来越多的语料库研究开始使用多元探索工具，并在假设检验中使用多因素预测建模技术，特别是回归模型（以及越来越多的基于树的方法）。无论这一发展多么可喜，它，尤其是它的发展速度，以及很少有地方提供严格的统计方法培训这一事实，都伴随着自身的风险，其中最主要的是分析方法的错误应用，这可能导致不精确、不完整或错误的分析。在本文中，我将重温最近在英语变体研究领域进行的一项回归分析研究（关于三种亚洲英语中的分句末尾也是和只是），以：特别强调其中体现的三个基本但却经常出现的错误；讨论为什么以及如何解决每个错误；重新分析数据（尽可能利用现有数据），并简要说明这对分析结果和解释的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

On regression modeling in varieties research

One particularly prominent methodological development in linguistics is what has been termed the “quantitative turn”: Not only are more and more studies using statistical tools to explore data and to test hypotheses, the complexity of the statistical methods employed is growing as well. This development is particularly prominent in all kinds of corpus‐linguistic studies: 20 years ago chi‐squared tests, t‐tests, and Pearson's r reigned supreme, but now more and more corpus studies are using multivariate exploratory tools and, for hypothesis testing, multifactorial predictive modeling techniques, in particular regression models (and, increasingly, tree‐based methods). However welcome this development is, it, and especially its pace as well as the fact that few places offer rigorous training in statistical methods, comes with its own risks, chief among them that analytical methods are misapplied, which can lead imprecise, incomplete, or wrong analyses. In this paper, I will revisit a recent regression‐analytic study in the research area of English varieties (on clause‐final also and only in three Asian Englishes) to:

highlight in particular three fundamental yet frequent mistakes that it exemplifies;

discuss why and how each of these mistakes should be addressed;

reanalyze the data (as far as is possible with what is available) and show briefly how that affects the analysis's results and interpretation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

World Englishes Multiple-

CiteScore

3.90

自引率

12.50%

发文量

期刊介绍： World Englishes is integrative in its scope and includes theoretical and applied studies on language, literature and English teaching, with emphasis on cross-cultural perspectives and identities. The journal provides recent research, critical and evaluative papers, and reviews from Africa, Asia, Europe, Oceania and the Americas. Thematic special issues and colloquia appear regularly. Special sections such as ''Comments / Replies'' and ''Forum'' promote open discussions and debate.