Solving Math Word Problems concerning Systems of Equations with GPT-3

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence Pub Date : 2023-06-26 DOI:10.1609/aaai.v37i13.26896

M. Zong, Bhaskar Krishnamachari

{"title":"Solving Math Word Problems concerning Systems of Equations with GPT-3","authors":"M. Zong, Bhaskar Krishnamachari","doi":"10.1609/aaai.v37i13.26896","DOIUrl":null,"url":null,"abstract":"Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One challenging set of tasks for school students is learning to solve math word problems. We explore how recent advances in natural language processing, specifically the rise of powerful transformer based models, can be applied to help math learners with such problems. Concretely, we evaluate the use of GPT-3, a 1.75B parameter transformer model recently released by OpenAI, for three related challenges pertaining to math word problems corresponding to systems of two linear equations. The three challenges are classifying word problems, extracting equations from word problems, and generating word problems. For the first challenge, we define a set of problem classes and find that GPT-3 has generally very high accuracy in classifying word problems (80%-100%), for all but one of these classes. For the second challenge, we find the accuracy for extracting equations improves with number of examples provided to the model, ranging from an accuracy of 31% for zero-shot learning to about 69% using 3-shot learning, which is further improved to a high value of 80% with fine-tuning. For the third challenge, we find that GPT-3 is able to generate problems with accuracy ranging from 33% to 93%, depending on the problem type.","PeriodicalId":74506,"journal":{"name":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","volume":"2 1","pages":"15972-15979"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/aaai.v37i13.26896","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

Abstract

Researchers have been interested in developing AI tools to help students learn various mathematical subjects. One challenging set of tasks for school students is learning to solve math word problems. We explore how recent advances in natural language processing, specifically the rise of powerful transformer based models, can be applied to help math learners with such problems. Concretely, we evaluate the use of GPT-3, a 1.75B parameter transformer model recently released by OpenAI, for three related challenges pertaining to math word problems corresponding to systems of two linear equations. The three challenges are classifying word problems, extracting equations from word problems, and generating word problems. For the first challenge, we define a set of problem classes and find that GPT-3 has generally very high accuracy in classifying word problems (80%-100%), for all but one of these classes. For the second challenge, we find the accuracy for extracting equations improves with number of examples provided to the model, ranging from an accuracy of 31% for zero-shot learning to about 69% using 3-shot learning, which is further improved to a high value of 80% with fine-tuning. For the third challenge, we find that GPT-3 is able to generate problems with accuracy ranging from 33% to 93%, depending on the problem type.

查看原文本刊更多论文

用GPT-3求解方程组数学应用题

研究人员一直对开发人工智能工具来帮助学生学习各种数学科目感兴趣。对于学生来说，一组具有挑战性的任务是学习解决数学单词问题。我们探讨了自然语言处理的最新进展，特别是基于强大的变压器模型的兴起，如何应用于帮助数学学习者解决这些问题。具体而言，我们评估了GPT-3 (OpenAI最近发布的1.75B参数转换器模型)在三个相关挑战中的使用，这些挑战与两个线性方程组对应的数学单词问题有关。这三个挑战是对单词问题进行分类，从单词问题中提取方程，以及生成单词问题。对于第一个挑战，我们定义了一组问题类别，并发现GPT-3在分类单词问题方面通常具有非常高的准确率(80%-100%)，除了这些类别中的一个。对于第二个挑战，我们发现提取方程的准确性随着提供给模型的样本数量的增加而提高，从零次学习的31%的准确率到使用3次学习的约69%的准确率，通过微调进一步提高到80%的高值。对于第三个挑战，我们发现GPT-3能够生成准确率在33%到93%之间的问题，具体取决于问题类型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

自引率

0.00%

发文量