使用R使用回归模型计数结果教程。

Q2 Social Sciences

Practical Assessment, Research and Evaluation Pub Date : 2016-02-01 DOI:10.7275/PJ8C-H254

A Alexander Beaujean, G. Morgan

{"title":"使用R使用回归模型计数结果教程。","authors":"A Alexander Beaujean, G. Morgan","doi":"10.7275/PJ8C-H254","DOIUrl":null,"url":null,"abstract":"Education researchers often study count variables, such as times a student reached a goal, discipline referrals, and absences. Most researchers that study these variables use typical regression methods (i.e., ordinary least-squares) either with or without transforming the count variables. In either case, using typical regression for count data can produce parameter estimates that are biased, thus diminishing any inferences made from such data. As count-variable regression models are seldom taught in training programs, we present a tutorial to help educational researchers use such methods in their own research. We demonstrate analyzing and interpreting count data using Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial regression models. The count regression methods are introduced through an example using the number of times students skipped class. The data for this example are freely available and the R syntax used run the example analyses are included in the Appendix. Count variables such as number of times a student reached a goal, discipline referrals, and absences are ubiquitous in school settings. After a review of published single-case design studies Shadish and Sullivan (2011) recently concluded that nearly all outcome variables were some form of a count. Yet, most analyses they reviewed used traditional data analysis methods designed for normally-distributed continuous data.","PeriodicalId":20361,"journal":{"name":"Practical Assessment, Research and Evaluation","volume":"37 1","pages":"1-19"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"67","resultStr":"{\"title\":\"Tutorial on Using Regression Models with Count Outcomes Using R.\",\"authors\":\"A Alexander Beaujean, G. Morgan\",\"doi\":\"10.7275/PJ8C-H254\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Education researchers often study count variables, such as times a student reached a goal, discipline referrals, and absences. Most researchers that study these variables use typical regression methods (i.e., ordinary least-squares) either with or without transforming the count variables. In either case, using typical regression for count data can produce parameter estimates that are biased, thus diminishing any inferences made from such data. As count-variable regression models are seldom taught in training programs, we present a tutorial to help educational researchers use such methods in their own research. We demonstrate analyzing and interpreting count data using Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial regression models. The count regression methods are introduced through an example using the number of times students skipped class. The data for this example are freely available and the R syntax used run the example analyses are included in the Appendix. Count variables such as number of times a student reached a goal, discipline referrals, and absences are ubiquitous in school settings. After a review of published single-case design studies Shadish and Sullivan (2011) recently concluded that nearly all outcome variables were some form of a count. Yet, most analyses they reviewed used traditional data analysis methods designed for normally-distributed continuous data.\",\"PeriodicalId\":20361,\"journal\":{\"name\":\"Practical Assessment, Research and Evaluation\",\"volume\":\"37 1\",\"pages\":\"1-19\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"67\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Practical Assessment, Research and Evaluation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.7275/PJ8C-H254\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Practical Assessment, Research and Evaluation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7275/PJ8C-H254","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 67

摘要

教育研究人员经常研究计数变量，如学生达到目标的次数，纪律推荐和缺勤。大多数研究这些变量的研究人员使用典型的回归方法(即普通最小二乘)，要么转换计数变量，要么不转换计数变量。在任何一种情况下，对计数数据使用典型回归都可能产生有偏差的参数估计，从而减少从这些数据得出的任何推论。由于在培训项目中很少教授计数变量回归模型，我们提出了一个教程来帮助教育研究者在他们自己的研究中使用这些方法。我们演示了使用泊松、负二项、零膨胀泊松和零膨胀负二项回归模型分析和解释计数数据。以学生逃课次数为例，介绍了计数回归方法。本示例的数据可以免费获得，附录中包含了运行示例分析所使用的R语法。计算变量，如学生达到目标的次数，纪律推荐和缺勤在学校环境中无处不在。在回顾了已发表的单例设计研究后，Shadish和Sullivan(2011)最近得出结论，几乎所有的结果变量都是某种形式的计数。然而，他们回顾的大多数分析都使用了传统的数据分析方法，这些方法是为正态分布的连续数据设计的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Tutorial on Using Regression Models with Count Outcomes Using R.

Education researchers often study count variables, such as times a student reached a goal, discipline referrals, and absences. Most researchers that study these variables use typical regression methods (i.e., ordinary least-squares) either with or without transforming the count variables. In either case, using typical regression for count data can produce parameter estimates that are biased, thus diminishing any inferences made from such data. As count-variable regression models are seldom taught in training programs, we present a tutorial to help educational researchers use such methods in their own research. We demonstrate analyzing and interpreting count data using Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial regression models. The count regression methods are introduced through an example using the number of times students skipped class. The data for this example are freely available and the R syntax used run the example analyses are included in the Appendix. Count variables such as number of times a student reached a goal, discipline referrals, and absences are ubiquitous in school settings. After a review of published single-case design studies Shadish and Sullivan (2011) recently concluded that nearly all outcome variables were some form of a count. Yet, most analyses they reviewed used traditional data analysis methods designed for normally-distributed continuous data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Practical Assessment, Research and Evaluation Social Sciences-Education

CiteScore

2.60

自引率

0.00%

发文量