High school students building babyGPTs: Engaging in data practices and addressing ethical issues through the construction of generative language models
Luis Morales-Navarro, Daniel J. Noh, Yasmin B. Kafai
{"title":"High school students building babyGPTs: Engaging in data practices and addressing ethical issues through the construction of generative language models","authors":"Luis Morales-Navarro, Daniel J. Noh, Yasmin B. Kafai","doi":"10.1016/j.ijcci.2025.100769","DOIUrl":null,"url":null,"abstract":"<div><div>As generative language models have gained popularity, high school students are increasingly using them in their everyday lives. While most current research has focused on examining youth as productive <em>users</em> of generative language model-powered systems, far fewer efforts have focused on how to engage high school students as <em>designers</em> of these models to foster a better understanding of how these systems work. Building on the rich legacy of research that positions youth as designers of computing systems, we explore how to support high school students in designing very small-scale generative language models, which we call babyGPTs. Through an in-depth case study of three teenagers building a babyGPT screenplay generator, we illustrate how the team defined a design problem, developed a model, and reflected while engaging in AI/ML data practices and addressing ethical issues. This paper contributes a case study showing how students engage in data practices and ethical considerations in the construction of generative language models and outlines directions for future research on construction activities and tools to support youth in designing generative language models.</div></div>","PeriodicalId":38431,"journal":{"name":"International Journal of Child-Computer Interaction","volume":"45 ","pages":"Article 100769"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Child-Computer Interaction","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212868925000509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
As generative language models have gained popularity, high school students are increasingly using them in their everyday lives. While most current research has focused on examining youth as productive users of generative language model-powered systems, far fewer efforts have focused on how to engage high school students as designers of these models to foster a better understanding of how these systems work. Building on the rich legacy of research that positions youth as designers of computing systems, we explore how to support high school students in designing very small-scale generative language models, which we call babyGPTs. Through an in-depth case study of three teenagers building a babyGPT screenplay generator, we illustrate how the team defined a design problem, developed a model, and reflected while engaging in AI/ML data practices and addressing ethical issues. This paper contributes a case study showing how students engage in data practices and ethical considerations in the construction of generative language models and outlines directions for future research on construction activities and tools to support youth in designing generative language models.