归纳定理证明者的数学基准

Logic Programming and Automated Reasoning Pub Date : 2023-04-06 DOI:10.48550/arXiv.2304.02986

Thibault Gauthier, C. Brown, Mikoláš Janota, J. Urban

{"title":"归纳定理证明者的数学基准","authors":"Thibault Gauthier, C. Brown, Mikoláš Janota, J. Urban","doi":"10.48550/arXiv.2304.02986","DOIUrl":null,"url":null,"abstract":"We present a benchmark of 29687 problems derived from the On-Line Encyclopedia of Integer Sequences (OEIS). Each problem expresses the equivalence of two syntactically different programs generating the same OEIS sequence. Such programs were conjectured by a learning-guided synthesis system using a language with looping operators. The operators implement recursion, and thus many of the proofs require induction on natural numbers. The benchmark contains problems of varying difficulty from a wide area of mathematical domains. We believe that these characteristics will make it an effective judge for the progress of inductive theorem provers in this domain for years to come.","PeriodicalId":207621,"journal":{"name":"Logic Programming and Automated Reasoning","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Mathematical Benchmark for Inductive Theorem Provers\",\"authors\":\"Thibault Gauthier, C. Brown, Mikoláš Janota, J. Urban\",\"doi\":\"10.48550/arXiv.2304.02986\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present a benchmark of 29687 problems derived from the On-Line Encyclopedia of Integer Sequences (OEIS). Each problem expresses the equivalence of two syntactically different programs generating the same OEIS sequence. Such programs were conjectured by a learning-guided synthesis system using a language with looping operators. The operators implement recursion, and thus many of the proofs require induction on natural numbers. The benchmark contains problems of varying difficulty from a wide area of mathematical domains. We believe that these characteristics will make it an effective judge for the progress of inductive theorem provers in this domain for years to come.\",\"PeriodicalId\":207621,\"journal\":{\"name\":\"Logic Programming and Automated Reasoning\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Logic Programming and Automated Reasoning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2304.02986\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Logic Programming and Automated Reasoning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2304.02986","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们提出了一个基于整数序列在线百科全书(OEIS)的29687个问题的基准。每个问题都表示生成相同OEIS序列的两个语法不同的程序的等价性。这样的程序是通过使用带有循环操作符的语言的学习引导综合系统推测出来的。运算符实现递归，因此许多证明需要对自然数进行归纳法。该基准包含来自广泛数学领域的不同难度的问题。我们相信，这些特征将使它成为未来几年归纳定理证明者在这一领域进展的有效判断。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Mathematical Benchmark for Inductive Theorem Provers

We present a benchmark of 29687 problems derived from the On-Line Encyclopedia of Integer Sequences (OEIS). Each problem expresses the equivalence of two syntactically different programs generating the same OEIS sequence. Such programs were conjectured by a learning-guided synthesis system using a language with looping operators. The operators implement recursion, and thus many of the proofs require induction on natural numbers. The benchmark contains problems of varying difficulty from a wide area of mathematical domains. We believe that these characteristics will make it an effective judge for the progress of inductive theorem provers in this domain for years to come.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Logic Programming and Automated Reasoning

自引率

0.00%

发文量