{"title":"Data science tools in the analysis of developing inquiry skills and computational thinking within the “IT Academy” Project","authors":"J. Hanč, M. Hančová, V. Jurková, D. Sveda","doi":"10.1109/ICETA51985.2020.9379266","DOIUrl":null,"url":null,"abstract":"We present our ideas how to apply modern data science technology and methodology to effectively prepare and statistically analyze large educational datasets which in our case map inquiry skills and computational thinking developed by students in primary and secondary schools at the Slovak national scale, within the project “IT Academy - Education for the 21st Century”. Combining the top two open-source data science tools, Python (within Jupyter notebooks) and R (within R studio software), we illustrate some of results from data preprocessing (cleaning, wrangling) for the diagnostic primary-school test of inquiry skills where Python tools (e.g. Pandas library) became more advantageous. As for the subsequent intensive statistical analysis, R environment was more suitable. We demonstrate summary results of the statistical analysis of the computer thinking diagnostic test for primary and secondary schools, finally cross-checked in SPSS software. Due to the current COVID-19 situation, we are still collecting data from ITA project impacts for which we finally show how we plan to implement further methods for data collecting, analysis and implementation in collaboration with our colleagues - education researchers.","PeriodicalId":149716,"journal":{"name":"2020 18th International Conference on Emerging eLearning Technologies and Applications (ICETA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 18th International Conference on Emerging eLearning Technologies and Applications (ICETA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETA51985.2020.9379266","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We present our ideas how to apply modern data science technology and methodology to effectively prepare and statistically analyze large educational datasets which in our case map inquiry skills and computational thinking developed by students in primary and secondary schools at the Slovak national scale, within the project “IT Academy - Education for the 21st Century”. Combining the top two open-source data science tools, Python (within Jupyter notebooks) and R (within R studio software), we illustrate some of results from data preprocessing (cleaning, wrangling) for the diagnostic primary-school test of inquiry skills where Python tools (e.g. Pandas library) became more advantageous. As for the subsequent intensive statistical analysis, R environment was more suitable. We demonstrate summary results of the statistical analysis of the computer thinking diagnostic test for primary and secondary schools, finally cross-checked in SPSS software. Due to the current COVID-19 situation, we are still collecting data from ITA project impacts for which we finally show how we plan to implement further methods for data collecting, analysis and implementation in collaboration with our colleagues - education researchers.