{"title":"Reproducible Data Science with Python: An Open Learning Resource","authors":"V. Danchev","doi":"10.21105/jose.00156","DOIUrl":"https://doi.org/10.21105/jose.00156","url":null,"abstract":"Summary This paper describes a computational learning resource on Reproducible Data Science with Python. The resource provides an accessible, hands-on introduction to data science techniques, skills, and workflows necessary to perform open, reproducible, and ethical data analysis. By using research problems of real-world relevance (such as vaccine hesitancy and the impact of COVID-19 lockdown measures on human mobility) and real-world social data (including anonymised mobility data from digital sources and recent COVID-19 survey data), the resource encourages students to use open-source tools and coding to learn from diverse and large social data sources. The learning resource aims to minimise barriers to entry for students from social sciences, public health, and related fields. With no software installation and setup requirements, students can start coding from their web browser using free and open-source software (FOSS), including the Python programming language, Jupyter notebook, and Markdown. Through real-world data applications, students are introduced to the open source Python ecosystem of libraries for data science—including pandas (McKinney, 2010), seaborn (Waskom, 2021), scikit-learn (Pedregosa et al., 2011), statsmodels (Seabold & Perk-told, 2010), and networkX (Hagberg et al., 2008)—and learn about open and reproducible workflow, data wrangling, data exploration and visualization, pattern discovery (e.g., clustering), prediction and machine learning, causal inference, network analysis, and data ethics.","PeriodicalId":75094,"journal":{"name":"The Journal of open source education","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41456634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Moodie, B. Carlson, B. Foreman, J. Kwang, K. Naito, J. Nittrouer
{"title":"SedEdu: software organizing sediment-related educational modules","authors":"A. Moodie, B. Carlson, B. Foreman, J. Kwang, K. Naito, J. Nittrouer","doi":"10.21105/jose.00129","DOIUrl":"https://doi.org/10.21105/jose.00129","url":null,"abstract":"1 Rice University (Houston, TX, USA) 2 University of Texas at Austin (Austin, TX, USA) 3 University of Colorado Boulder (Boulder, CO, USA) 4 Western Washington University (Bellingham, WA, USA) 5 University of Illinois at Urbana-Champaign (Urbana, IL, USA) 6 University of Massachusetts Amherst (Amherst, MA, USA) 7 Universidad de Ingeniería y Tecnología (Lima, Peru) 8 Texas Tech University (Lubbock, TX, USA)","PeriodicalId":75094,"journal":{"name":"The Journal of open source education","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41390686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abhijna Parigi, Marisa Lim, Saranya Canchi, Jose Sanchez, Jeremy Walter, Rayna M. Harris, Amanda L Charbonneau, C. Brown
{"title":"A virtual training module for introducing the use of Amazon Web Services","authors":"Abhijna Parigi, Marisa Lim, Saranya Canchi, Jose Sanchez, Jeremy Walter, Rayna M. Harris, Amanda L Charbonneau, C. Brown","doi":"10.21105/jose.00167","DOIUrl":"https://doi.org/10.21105/jose.00167","url":null,"abstract":"We present our lesson material and resources for introducing the use of Amazon Web Services (AWS, https://aws.amazon.com/) for cloud computation. This lesson was developed for the Common Fund Data Ecosystem (CFDE), an NIH initiative that aims to promote data re-use and cloud computing for biomedical research. The lesson materials, technology set-up instructions, and our instructional experiences can serve as a resource for prospective instructors who wish to re-use and remix our materials for teaching AWS and cloud computing.","PeriodicalId":75094,"journal":{"name":"The Journal of open source education","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43060772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Villano, K. Harris, Judit Bergfalk, Raphael Hatami, Francis Vititoe, Julia Johnston
{"title":"The Data Behind Dark Matter: Exploring Galactic\u0000Rotation","authors":"A. Villano, K. Harris, Judit Bergfalk, Raphael Hatami, Francis Vititoe, Julia Johnston","doi":"10.21105/jose.00184","DOIUrl":"https://doi.org/10.21105/jose.00184","url":null,"abstract":"Dark matter is estimated to make up ~84% of all normal/baryonic matter, but cannot be directly imaged. Despite the fact that dark matter cannot be directly observed yet, its influence on the motion of stars and gas in spiral galaxies have been detected. One way to show motion in galaxies are rotation curves that are plots of velocity measurements of how fast stars and gas move in a galaxy around the center of mass. According to Newton's Law of Gravitation, the rotational velocity is an indication of the amount of visible and non-visible mass in the galaxy. Given that the visible matter is measurable using photometry, dark matter mass can therefore be estimated, offering an insight into the size distribution in galaxies. In order to gain a greater appreciation of the research scientists' findings about dark matter, their method should be easily reproduced by any curious individual. Our interactive workshop is an excellent educational tool to investigate how dark matter impacts the rotation of visible matter by providing a guide to produce galactic rotation curves. The Python-based notebooks are set up to walk you through the whole process of producing rotation curves using an online database (SPARC) and to allow you to learn about each component of the galaxy. The three steps of the rotation curve building process is plotting the measured velocity data, constructing the rotation curves for each component, and fitting the total velocity to the measured values.","PeriodicalId":75094,"journal":{"name":"The Journal of open source education","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42617307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}