Evaluating the Success of AI Tools in Supporting Student Performance in Mathematical Kangaroo Competition

IF 2.2 3区 工程技术 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Marina Svičević, Aleksandar Milenković, Nemanja Vučićević, Marko Stanković
{"title":"Evaluating the Success of AI Tools in Supporting Student Performance in Mathematical Kangaroo Competition","authors":"Marina Svičević,&nbsp;Aleksandar Milenković,&nbsp;Nemanja Vučićević,&nbsp;Marko Stanković","doi":"10.1002/cae.70063","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study explores the potential of generative artificial intelligence (AI) tools in supporting students preparing for mathematical competitions, focusing on the Mathematical Kangaroo competition in the context of the Serbian-speaking region. The research analyzed tools such as ChatGPT-free, ChatGPT-paid, AI Math Solver, Math Mentor, and o1-preview, assessing their accuracy and efficiency in solving tasks of varying difficulty levels and domains (algebra, geometry, logic, and numbers), as well as different formats (text and image-based). Testing included tasks in both Serbian and English, allowing for the evaluation of language barriers in tool performance. The results indicate that tools perform better with text-based task formats, with o1-preview standing out for its exceptionally high accuracy in this format. All tools achieve the highest precision in numbers and algebra, while results are significantly lower in geometry and logic, highlighting challenges in processing visual information and logical reasoning. The conclusions of this study emphasize the importance of generative AI in improving mathematics education but highlight the need for further development of tools that can better handle visual tasks, support local languages, and be more specialized in solving mathematical problems in general.</p>\n </div>","PeriodicalId":50643,"journal":{"name":"Computer Applications in Engineering Education","volume":"33 4","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Applications in Engineering Education","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cae.70063","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

This study explores the potential of generative artificial intelligence (AI) tools in supporting students preparing for mathematical competitions, focusing on the Mathematical Kangaroo competition in the context of the Serbian-speaking region. The research analyzed tools such as ChatGPT-free, ChatGPT-paid, AI Math Solver, Math Mentor, and o1-preview, assessing their accuracy and efficiency in solving tasks of varying difficulty levels and domains (algebra, geometry, logic, and numbers), as well as different formats (text and image-based). Testing included tasks in both Serbian and English, allowing for the evaluation of language barriers in tool performance. The results indicate that tools perform better with text-based task formats, with o1-preview standing out for its exceptionally high accuracy in this format. All tools achieve the highest precision in numbers and algebra, while results are significantly lower in geometry and logic, highlighting challenges in processing visual information and logical reasoning. The conclusions of this study emphasize the importance of generative AI in improving mathematics education but highlight the need for further development of tools that can better handle visual tasks, support local languages, and be more specialized in solving mathematical problems in general.

评估人工智能工具在数学袋鼠比赛中支持学生成绩的成功
本研究探讨了生成式人工智能(AI)工具在支持学生准备数学竞赛方面的潜力,重点关注塞尔维亚语地区的数学袋鼠竞赛。该研究分析了chatgpt免费、chatgpt付费、AI Math Solver、Math Mentor和01 -preview等工具,评估了它们在解决不同难度和领域(代数、几何、逻辑和数字)以及不同格式(基于文本和图像)任务时的准确性和效率。测试包括塞尔维亚语和英语的任务,允许评估工具性能中的语言障碍。结果表明,工具在基于文本的任务格式中表现得更好,其中01 -preview在这种格式中以其异常高的准确性脱颖而出。所有工具在数字和代数方面都达到了最高的精度,而在几何和逻辑方面的结果明显较低,突出了处理视觉信息和逻辑推理方面的挑战。本研究的结论强调了生成式人工智能在改善数学教育中的重要性,但也强调了进一步开发工具的必要性,这些工具可以更好地处理视觉任务,支持当地语言,并更专门地解决一般的数学问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computer Applications in Engineering Education
Computer Applications in Engineering Education 工程技术-工程:综合
CiteScore
7.20
自引率
10.30%
发文量
100
审稿时长
6-12 weeks
期刊介绍: Computer Applications in Engineering Education provides a forum for publishing peer-reviewed timely information on the innovative uses of computers, Internet, and software tools in engineering education. Besides new courses and software tools, the CAE journal covers areas that support the integration of technology-based modules in the engineering curriculum and promotes discussion of the assessment and dissemination issues associated with these new implementation methods.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信