常识推理的自合理化模型研究

IF 0.9 Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Stats Pub Date : 2023-08-29 DOI:10.3390/stats6030056

Fanny Rancourt, Paula Vondrlik, Diego Maupomé, Marie-Jean Meurs

{"title":"常识推理的自合理化模型研究","authors":"Fanny Rancourt, Paula Vondrlik, Diego Maupomé, Marie-Jean Meurs","doi":"10.3390/stats6030056","DOIUrl":null,"url":null,"abstract":"The rise of explainable natural language processing spurred a bulk of work on datasets augmented with human explanations, as well as technical approaches to leverage them. Notably, generative large language models offer new possibilities, as they can output a prediction as well as an explanation in natural language. This work investigates the capabilities of fine-tuned text-to-text transfer Transformer (T5) models for commonsense reasoning and explanation generation. Our experiments suggest that while self-rationalizing models achieve interesting results, a significant gap remains: classifiers consistently outperformed self-rationalizing models, and a substantial fraction of model-generated explanations are not valid. Furthermore, training with expressive free-text explanations substantially altered the inner representation of the model, suggesting that they supplied additional information and may bridge the knowledge gap. Our code is publicly available, and the experiments were run on open-access datasets, hence allowing full reproducibility.","PeriodicalId":93142,"journal":{"name":"Stats","volume":" ","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Investigating Self-Rationalizing Models for Commonsense Reasoning\",\"authors\":\"Fanny Rancourt, Paula Vondrlik, Diego Maupomé, Marie-Jean Meurs\",\"doi\":\"10.3390/stats6030056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rise of explainable natural language processing spurred a bulk of work on datasets augmented with human explanations, as well as technical approaches to leverage them. Notably, generative large language models offer new possibilities, as they can output a prediction as well as an explanation in natural language. This work investigates the capabilities of fine-tuned text-to-text transfer Transformer (T5) models for commonsense reasoning and explanation generation. Our experiments suggest that while self-rationalizing models achieve interesting results, a significant gap remains: classifiers consistently outperformed self-rationalizing models, and a substantial fraction of model-generated explanations are not valid. Furthermore, training with expressive free-text explanations substantially altered the inner representation of the model, suggesting that they supplied additional information and may bridge the knowledge gap. Our code is publicly available, and the experiments were run on open-access datasets, hence allowing full reproducibility.\",\"PeriodicalId\":93142,\"journal\":{\"name\":\"Stats\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2023-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stats\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/stats6030056\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stats","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/stats6030056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

可解释的自然语言处理的兴起促使人们在数据集上进行了大量的工作，这些数据集增加了人类的解释，以及利用这些解释的技术方法。值得注意的是，生成性大型语言模型提供了新的可能性，因为它们可以用自然语言输出预测和解释。这项工作研究了微调的文本到文本转换转换器（T5）模型用于常识推理和解释生成的能力。我们的实验表明，虽然自合理化模型取得了令人感兴趣的结果，但仍存在一个显著的差距：分类器始终优于自合理化模式，并且相当一部分模型生成的解释是无效的。此外，用富有表现力的自由文本解释进行训练大大改变了模型的内部表示，这表明它们提供了额外的信息，并可能弥合知识差距。我们的代码是公开的，实验是在开放访问的数据集上运行的，因此允许完全的再现性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Investigating Self-Rationalizing Models for Commonsense Reasoning

The rise of explainable natural language processing spurred a bulk of work on datasets augmented with human explanations, as well as technical approaches to leverage them. Notably, generative large language models offer new possibilities, as they can output a prediction as well as an explanation in natural language. This work investigates the capabilities of fine-tuned text-to-text transfer Transformer (T5) models for commonsense reasoning and explanation generation. Our experiments suggest that while self-rationalizing models achieve interesting results, a significant gap remains: classifiers consistently outperformed self-rationalizing models, and a substantial fraction of model-generated explanations are not valid. Furthermore, training with expressive free-text explanations substantially altered the inner representation of the model, suggesting that they supplied additional information and may bridge the knowledge gap. Our code is publicly available, and the experiments were run on open-access datasets, hence allowing full reproducibility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Stats

CiteScore

0.60

自引率

0.00%

发文量

审稿时长

7 weeks