{"title":"Text Mining Analyses of Programming Education Articles Since the 1970s","authors":"Takahisa Furuta, Gerald Knezek","doi":"10.12937/itel.3.1.reg.p001","DOIUrl":null,"url":null,"abstract":"In order to assess the extent to which text-mining techniques can be used to gain insights into a particular topic area, we apply hierarchical word clustering and the Term Frequency-Inverse Document Frequency (TF-IDF) measure to articles on computer programming published since the 1970s, when research articles on teaching programming are now more readily available in PDF files. Study 1 compares two sets of papers published before and after the introduction of the concept of Computational Thinking in 2006 to highlight the changes seen in these research sets. Articles mentioned in frequently cited review papers were selected as the target articles to ensure the quality of the sample. Study 2 extends the sample pool to include a range of papers published after the 1970s, allowing us to examine the stability of the conceptual structures identified in Study 1. In both studies, the obtained word clusters or concepts align with known research trends in the programming-education literature. The significance and potential of text-mining techniques are also discussed.","PeriodicalId":259246,"journal":{"name":"Information and Technology in Education and Learning","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information and Technology in Education and Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12937/itel.3.1.reg.p001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In order to assess the extent to which text-mining techniques can be used to gain insights into a particular topic area, we apply hierarchical word clustering and the Term Frequency-Inverse Document Frequency (TF-IDF) measure to articles on computer programming published since the 1970s, when research articles on teaching programming are now more readily available in PDF files. Study 1 compares two sets of papers published before and after the introduction of the concept of Computational Thinking in 2006 to highlight the changes seen in these research sets. Articles mentioned in frequently cited review papers were selected as the target articles to ensure the quality of the sample. Study 2 extends the sample pool to include a range of papers published after the 1970s, allowing us to examine the stability of the conceptual structures identified in Study 1. In both studies, the obtained word clusters or concepts align with known research trends in the programming-education literature. The significance and potential of text-mining techniques are also discussed.