{"title":"Comparison of beginning R students' perceptions of peer-made plots created in two plotting systems: a randomized experiment.","authors":"Leslie Myint, Aboozar Hadavand, Leah Jager, Jeffrey Leek","doi":"10.1080/10691898.2019.1695554","DOIUrl":null,"url":null,"abstract":"<p><p>We performed an empirical study of the perceived quality of scientific graphics produced by beginning R users in two plotting systems: the base graphics package (\"base R\") and the ggplot2 add-on package. In our experiment, students taking a data science course on the Coursera platform were randomized to complete identical plotting exercises using either base R or ggplot2. This exercise involved creating two plots: one bivariate scatterplot and one plot of a multivariate relationship that necessitated using color or panels. Students evaluated their peers on visual characteristics key to clear scientific communication, including plot clarity and sufficient labeling. We observed that graphics created with the two systems rated similarly on many characteristics. However, ggplot2 graphics were generally perceived by students to be slightly more clear overall with respect to presentation of a scientific relationship. This increase was more pronounced for the multivariate relationship. Through expert analysis of submissions, we also find that certain concrete plot features (e.g., trend lines, axis labels, legends, panels, and color) tend to be used more commonly in one system than the other. These observations may help educators emphasize the use of certain plot features targeted to correct common student mistakes.</p>","PeriodicalId":45775,"journal":{"name":"Journal of Statistics Education","volume":"28 1","pages":"98-108"},"PeriodicalIF":2.2000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/10691898.2019.1695554","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistics Education","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10691898.2019.1695554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2019/12/23 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 5
Abstract
We performed an empirical study of the perceived quality of scientific graphics produced by beginning R users in two plotting systems: the base graphics package ("base R") and the ggplot2 add-on package. In our experiment, students taking a data science course on the Coursera platform were randomized to complete identical plotting exercises using either base R or ggplot2. This exercise involved creating two plots: one bivariate scatterplot and one plot of a multivariate relationship that necessitated using color or panels. Students evaluated their peers on visual characteristics key to clear scientific communication, including plot clarity and sufficient labeling. We observed that graphics created with the two systems rated similarly on many characteristics. However, ggplot2 graphics were generally perceived by students to be slightly more clear overall with respect to presentation of a scientific relationship. This increase was more pronounced for the multivariate relationship. Through expert analysis of submissions, we also find that certain concrete plot features (e.g., trend lines, axis labels, legends, panels, and color) tend to be used more commonly in one system than the other. These observations may help educators emphasize the use of certain plot features targeted to correct common student mistakes.
期刊介绍:
The "Datasets and Stories" department of the Journal of Statistics Education provides a forum for exchanging interesting datasets and discussing ways they can be used effectively in teaching statistics. This section of JSE is described fully in the article "Datasets and Stories: Introduction and Guidelines" by Robin H. Lock and Tim Arnold (1993). The Journal of Statistics Education maintains a Data Archive that contains the datasets described in "Datasets and Stories" articles, as well as additional datasets useful to statistics teachers. Lock and Arnold (1993) describe several criteria that will be considered before datasets are placed in the JSE Data Archive.