大型语言模式在教育中的实践和伦理挑战：一项系统的范围界定综述

IF 8.1 1区教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH

British Journal of Educational Technology Pub Date : 2023-08-06 DOI:10.1111/bjet.13370

Lixiang Yan, Lele Sha, Linxuan Zhao, Yuheng Li, Roberto Martinez-Maldonado, Guanliang Chen, Xinyu Li, Yueqiao Jin, Dragan Gašević

{"title":"大型语言模式在教育中的实践和伦理挑战：一项系统的范围界定综述","authors":"Lixiang Yan, Lele Sha, Linxuan Zhao, Yuheng Li, Roberto Martinez-Maldonado, Guanliang Chen, Xinyu Li, Yueqiao Jin, Dragan Gašević","doi":"10.1111/bjet.13370","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <p>Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs-based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer-reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state-of-the-art models (eg, GPT-3/4), embracing the initiative of open-sourcing models/systems, and adopting a human-centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models.</p>\n </section>\n \n <section>\n \n <div>\n \n <div>\n \n <h3>Practitioner notes</h3>\n <p>What is currently known about this topic\n\n </p><ul>\n \n <li>Generating and analysing text-based content are time-consuming and laborious tasks.</li>\n \n <li>Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks.</li>\n \n <li>Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring.</li>\n </ul>\n <p>What this paper adds\n\n </p><ul>\n \n <li>A comprehensive list of different educational tasks that could potentially benefit from LLMs-based innovations through automation.</li>\n \n <li>A structured assessment of the practicality and ethicality of existing LLMs-based innovations from seven important aspects using established frameworks.</li>\n \n <li>Three recommendations that could potentially support future studies to develop LLMs-based innovations that are practical and ethical to implement in authentic educational contexts.</li>\n </ul>\n <p>Implications for practice and/or policy\n\n </p><ul>\n \n <li>Updating existing innovations with state-of-the-art models may further reduce the amount of manual effort required for adapting existing models to different educational tasks.</li>\n \n <li>The reporting standards of empirical research that aims to develop educational technologies using large language models need to be improved.</li>\n \n <li>Adopting a human-centred approach throughout the developmental process could contribute to resolving the practical and ethical challenges of large language models in education.</li>\n </ul>\n </div>\n </div>\n </section>\n </div>","PeriodicalId":48315,"journal":{"name":"British Journal of Educational Technology","volume":"55 1","pages":"90-112"},"PeriodicalIF":8.1000,"publicationDate":"2023-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bjet.13370","citationCount":"0","resultStr":"{\"title\":\"Practical and ethical challenges of large language models in education: A systematic scoping review\",\"authors\":\"Lixiang Yan, Lele Sha, Linxuan Zhao, Yuheng Li, Roberto Martinez-Maldonado, Guanliang Chen, Xinyu Li, Yueqiao Jin, Dragan Gašević\",\"doi\":\"10.1111/bjet.13370\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <p>Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs-based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer-reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state-of-the-art models (eg, GPT-3/4), embracing the initiative of open-sourcing models/systems, and adopting a human-centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models.</p>\\n </section>\\n \\n <section>\\n \\n <div>\\n \\n <div>\\n \\n <h3>Practitioner notes</h3>\\n <p>What is currently known about this topic\\n\\n </p><ul>\\n \\n <li>Generating and analysing text-based content are time-consuming and laborious tasks.</li>\\n \\n <li>Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks.</li>\\n \\n <li>Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring.</li>\\n </ul>\\n <p>What this paper adds\\n\\n </p><ul>\\n \\n <li>A comprehensive list of different educational tasks that could potentially benefit from LLMs-based innovations through automation.</li>\\n \\n <li>A structured assessment of the practicality and ethicality of existing LLMs-based innovations from seven important aspects using established frameworks.</li>\\n \\n <li>Three recommendations that could potentially support future studies to develop LLMs-based innovations that are practical and ethical to implement in authentic educational contexts.</li>\\n </ul>\\n <p>Implications for practice and/or policy\\n\\n </p><ul>\\n \\n <li>Updating existing innovations with state-of-the-art models may further reduce the amount of manual effort required for adapting existing models to different educational tasks.</li>\\n \\n <li>The reporting standards of empirical research that aims to develop educational technologies using large language models need to be improved.</li>\\n \\n <li>Adopting a human-centred approach throughout the developmental process could contribute to resolving the practical and ethical challenges of large language models in education.</li>\\n </ul>\\n </div>\\n </div>\\n </section>\\n </div>\",\"PeriodicalId\":48315,\"journal\":{\"name\":\"British Journal of Educational Technology\",\"volume\":\"55 1\",\"pages\":\"90-112\"},\"PeriodicalIF\":8.1000,\"publicationDate\":\"2023-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bjet.13370\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"British Journal of Educational Technology\",\"FirstCategoryId\":\"95\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/bjet.13370\",\"RegionNum\":1,\"RegionCategory\":\"教育学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Educational Technology","FirstCategoryId":"95","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/bjet.13370","RegionNum":1,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}

引用次数: 0

摘要

利用大型语言模型（LLM）的教育技术创新已经显示出自动化生成和分析文本内容这一费力过程的潜力。虽然已经开发了各种创新来自动化一系列教育任务（例如，问题生成、反馈提供和论文评分），但人们对这些创新的实用性和道德性表示担忧。这些担忧可能会阻碍未来的研究和在真实的教育背景下采用基于LLM的创新。为了解决这一问题，我们对2017年以来发表的118篇同行评审论文进行了系统的范围界定审查，以确定使用LLM自动化和支持教育任务的研究现状。研究结果揭示了LLM在自动化教育任务中的53个用例，分为九大类：分析/标记、检测、评分、教学支持、预测、知识表示、反馈、内容生成和推荐。此外，我们还发现了一些实际和道德挑战，包括技术准备程度低、缺乏可复制性和透明度以及隐私和福利考虑不足。研究结果总结为未来研究的三项建议，包括用最先进的模型（如GPT-3/4）更新现有创新，采用开源模型/系统的举措，以及在整个开发过程中采用以人为本的方法。随着人工智能与教育的交叉点不断发展，这项研究的发现可以作为研究人员的重要参考点，使他们能够利用优势，从局限中学习，并发现ChatGPT和其他生成人工智能模型所带来的潜在研究机会。目前对该主题的了解生成和分析基于文本的内容是一项耗时且费力的任务。大型语言模型能够有效地分析前所未有的文本内容，并完成复杂的自然语言处理和生成任务。大型语言模型越来越多地被用于开发旨在自动生成和分析文本内容的教育技术，如自动生成问题和作文评分。本文添加了一份不同教育任务的综合列表，这些任务可能通过自动化从基于LLM的创新中受益。使用既定框架，从七个重要方面对现有基于LLM的创新的实用性和道德性进行结构化评估。三项建议可能支持未来的研究，以开发基于LLM的创新，这些创新在真实的教育环境中实施是实用和合乎道德的。对实践和/或政策的影响用最先进的模型更新现有创新可能会进一步减少使现有模型适应不同教育任务所需的人工工作量。旨在利用大型语言模型开发教育技术的实证研究的报告标准需要改进。在整个发展过程中采用以人为本的方法有助于解决大型语言模式在教育中的实践和道德挑战。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Practical and ethical challenges of large language models in education: A systematic scoping review

查看原文本刊更多论文

Practical and ethical challenges of large language models in education: A systematic scoping review

Educational technology innovations leveraging large language models (LLMs) have shown the potential to automate the laborious process of generating and analysing textual content. While various innovations have been developed to automate a range of educational tasks (eg, question generation, feedback provision, and essay grading), there are concerns regarding the practicality and ethicality of these innovations. Such concerns may hinder future research and the adoption of LLMs-based innovations in authentic educational contexts. To address this, we conducted a systematic scoping review of 118 peer-reviewed papers published since 2017 to pinpoint the current state of research on using LLMs to automate and support educational tasks. The findings revealed 53 use cases for LLMs in automating education tasks, categorised into nine main categories: profiling/labelling, detection, grading, teaching support, prediction, knowledge representation, feedback, content generation, and recommendation. Additionally, we also identified several practical and ethical challenges, including low technological readiness, lack of replicability and transparency and insufficient privacy and beneficence considerations. The findings were summarised into three recommendations for future studies, including updating existing innovations with state-of-the-art models (eg, GPT-3/4), embracing the initiative of open-sourcing models/systems, and adopting a human-centred approach throughout the developmental process. As the intersection of AI and education is continuously evolving, the findings of this study can serve as an essential reference point for researchers, allowing them to leverage the strengths, learn from the limitations, and uncover potential research opportunities enabled by ChatGPT and other generative AI models.

Practitioner notes

What is currently known about this topic

Generating and analysing text-based content are time-consuming and laborious tasks.
Large language models are capable of efficiently analysing an unprecedented amount of textual content and completing complex natural language processing and generation tasks.
Large language models have been increasingly used to develop educational technologies that aim to automate the generation and analysis of textual content, such as automated question generation and essay scoring.

What this paper adds

A comprehensive list of different educational tasks that could potentially benefit from LLMs-based innovations through automation.
A structured assessment of the practicality and ethicality of existing LLMs-based innovations from seven important aspects using established frameworks.
Three recommendations that could potentially support future studies to develop LLMs-based innovations that are practical and ethical to implement in authentic educational contexts.

Implications for practice and/or policy

Updating existing innovations with state-of-the-art models may further reduce the amount of manual effort required for adapting existing models to different educational tasks.
The reporting standards of empirical research that aims to develop educational technologies using large language models need to be improved.
Adopting a human-centred approach throughout the developmental process could contribute to resolving the practical and ethical challenges of large language models in education.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

British Journal of Educational Technology EDUCATION & EDUCATIONAL RESEARCH-

CiteScore

15.60

自引率

4.50%

发文量

111

期刊介绍： BJET is a primary source for academics and professionals in the fields of digital educational and training technology throughout the world. The Journal is published by Wiley on behalf of The British Educational Research Association (BERA). It publishes theoretical perspectives, methodological developments and high quality empirical research that demonstrate whether and how applications of instructional/educational technology systems, networks, tools and resources lead to improvements in formal and non-formal education at all levels, from early years through to higher, technical and vocational education, professional development and corporate training.