{"title":"音乐教师对人工智能(AI)和人类教案的标注准确性和质量评级","authors":"Patrick K Cooper","doi":"10.1177/02557614241249163","DOIUrl":null,"url":null,"abstract":"This study explored the potential of artificial intelligence (ChatGPT) to generate lesson plans for music classes that were indistinguishable from music lesson plans created by humans, with current music teachers as assessors. Fifty-six assessors made a total of 410 ratings across eight lesson plans, assigning a quality score to each lesson plan and labeling if they believed each lesson plan was created by a human or generated by AI. Despite the human-made lesson plans being rated higher in quality as a group ( p < .01, d = 0.44), assessors were unable to accurately label if a lesson plan was created by a human or generated by AI (55% accurate overall). Labeling accuracy was positively predicted by quality scores on human-made lesson plans and previous personal use of AI, while accuracy was negatively predicted by quality scores on AI-generated lesson plans and perception of how useful AI will be in the future. Open-ended responses from 42 teachers suggested assessors used three factors when making evaluations: specific details, evidence of classroom knowledge, and wording. Implications provide suggestions for how music teachers can use prompt engineering with a GPT model to create a virtual assistant or Intelligent Tutor System (ITS) for their classroom.","PeriodicalId":46623,"journal":{"name":"International Journal of Music Education","volume":"50 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Music teachers’ labeling accuracy and quality ratings of lesson plans by artificial intelligence (AI) and humans\",\"authors\":\"Patrick K Cooper\",\"doi\":\"10.1177/02557614241249163\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study explored the potential of artificial intelligence (ChatGPT) to generate lesson plans for music classes that were indistinguishable from music lesson plans created by humans, with current music teachers as assessors. Fifty-six assessors made a total of 410 ratings across eight lesson plans, assigning a quality score to each lesson plan and labeling if they believed each lesson plan was created by a human or generated by AI. Despite the human-made lesson plans being rated higher in quality as a group ( p < .01, d = 0.44), assessors were unable to accurately label if a lesson plan was created by a human or generated by AI (55% accurate overall). Labeling accuracy was positively predicted by quality scores on human-made lesson plans and previous personal use of AI, while accuracy was negatively predicted by quality scores on AI-generated lesson plans and perception of how useful AI will be in the future. Open-ended responses from 42 teachers suggested assessors used three factors when making evaluations: specific details, evidence of classroom knowledge, and wording. Implications provide suggestions for how music teachers can use prompt engineering with a GPT model to create a virtual assistant or Intelligent Tutor System (ITS) for their classroom.\",\"PeriodicalId\":46623,\"journal\":{\"name\":\"International Journal of Music Education\",\"volume\":\"50 1\",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Music Education\",\"FirstCategoryId\":\"95\",\"ListUrlMain\":\"https://doi.org/10.1177/02557614241249163\",\"RegionNum\":3,\"RegionCategory\":\"教育学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"EDUCATION & EDUCATIONAL RESEARCH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Music Education","FirstCategoryId":"95","ListUrlMain":"https://doi.org/10.1177/02557614241249163","RegionNum":3,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
Music teachers’ labeling accuracy and quality ratings of lesson plans by artificial intelligence (AI) and humans
This study explored the potential of artificial intelligence (ChatGPT) to generate lesson plans for music classes that were indistinguishable from music lesson plans created by humans, with current music teachers as assessors. Fifty-six assessors made a total of 410 ratings across eight lesson plans, assigning a quality score to each lesson plan and labeling if they believed each lesson plan was created by a human or generated by AI. Despite the human-made lesson plans being rated higher in quality as a group ( p < .01, d = 0.44), assessors were unable to accurately label if a lesson plan was created by a human or generated by AI (55% accurate overall). Labeling accuracy was positively predicted by quality scores on human-made lesson plans and previous personal use of AI, while accuracy was negatively predicted by quality scores on AI-generated lesson plans and perception of how useful AI will be in the future. Open-ended responses from 42 teachers suggested assessors used three factors when making evaluations: specific details, evidence of classroom knowledge, and wording. Implications provide suggestions for how music teachers can use prompt engineering with a GPT model to create a virtual assistant or Intelligent Tutor System (ITS) for their classroom.
期刊介绍:
The International Journal of Music Education (IJME) is a peer-reviewed journal published by the International Society for Music Education (ISME) four times a year. Manuscripts published are scholarly works, representing empirical research in a variety of modalities. They enhance knowledge regarding the teaching and learning of music with a special interest toward an international constituency. Manuscripts report results of quantitative or qualitative research studies, summarize bodies or research, present theories, models, or philosophical positions, etc. Papers show relevance to advancing the practice of music teaching and learning at all age levels with issues of direct concern to the classroom or studio, in school and out, private and group instruction. All manuscripts should contain evidence of a scholarly approach and be situated within the current literature. Implications for learning and teaching of music should be clearly stated, relevant, contemporary, and of interest to an international readership.