Marina Svičević, Aleksandar Milenković, Nemanja Vučićević, Marko Stanković
{"title":"Evaluating the Success of AI Tools in Supporting Student Performance in Mathematical Kangaroo Competition","authors":"Marina Svičević, Aleksandar Milenković, Nemanja Vučićević, Marko Stanković","doi":"10.1002/cae.70063","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This study explores the potential of generative artificial intelligence (AI) tools in supporting students preparing for mathematical competitions, focusing on the Mathematical Kangaroo competition in the context of the Serbian-speaking region. The research analyzed tools such as ChatGPT-free, ChatGPT-paid, AI Math Solver, Math Mentor, and o1-preview, assessing their accuracy and efficiency in solving tasks of varying difficulty levels and domains (algebra, geometry, logic, and numbers), as well as different formats (text and image-based). Testing included tasks in both Serbian and English, allowing for the evaluation of language barriers in tool performance. The results indicate that tools perform better with text-based task formats, with o1-preview standing out for its exceptionally high accuracy in this format. All tools achieve the highest precision in numbers and algebra, while results are significantly lower in geometry and logic, highlighting challenges in processing visual information and logical reasoning. The conclusions of this study emphasize the importance of generative AI in improving mathematics education but highlight the need for further development of tools that can better handle visual tasks, support local languages, and be more specialized in solving mathematical problems in general.</p>\n </div>","PeriodicalId":50643,"journal":{"name":"Computer Applications in Engineering Education","volume":"33 4","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Applications in Engineering Education","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cae.70063","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This study explores the potential of generative artificial intelligence (AI) tools in supporting students preparing for mathematical competitions, focusing on the Mathematical Kangaroo competition in the context of the Serbian-speaking region. The research analyzed tools such as ChatGPT-free, ChatGPT-paid, AI Math Solver, Math Mentor, and o1-preview, assessing their accuracy and efficiency in solving tasks of varying difficulty levels and domains (algebra, geometry, logic, and numbers), as well as different formats (text and image-based). Testing included tasks in both Serbian and English, allowing for the evaluation of language barriers in tool performance. The results indicate that tools perform better with text-based task formats, with o1-preview standing out for its exceptionally high accuracy in this format. All tools achieve the highest precision in numbers and algebra, while results are significantly lower in geometry and logic, highlighting challenges in processing visual information and logical reasoning. The conclusions of this study emphasize the importance of generative AI in improving mathematics education but highlight the need for further development of tools that can better handle visual tasks, support local languages, and be more specialized in solving mathematical problems in general.
期刊介绍:
Computer Applications in Engineering Education provides a forum for publishing peer-reviewed timely information on the innovative uses of computers, Internet, and software tools in engineering education. Besides new courses and software tools, the CAE journal covers areas that support the integration of technology-based modules in the engineering curriculum and promotes discussion of the assessment and dissemination issues associated with these new implementation methods.