{"title":"Evaluating Reader Comprehension of Plan-Based Stories Containing Failed Actions","authors":"Rushit Sanghrajka, R. Young","doi":"10.1609/aiide.v18i1.21962","DOIUrl":null,"url":null,"abstract":"A growing number of algorithms for story planning include the ability to create stories with failed actions -- in particular failed actions that occur because of the mistaken beliefs of the characters attempting them. To date, most of these systems have been evaluated analytically, primarily by comparing their expressive range to prior story generation systems. Empirical evaluation of these systems has been preliminary. In this paper, we outline a general comprehension-based approach to the evaluation of plan-based story generation. We describe how we specialize it for use evaluating story plans containing failed actions, and we describe the design and results of an experiment using this approach to evaluate plot lines produced by HeadSpace, a system that models the beliefs of characters and uses that model to generate plot lines containing actions that are attempted but that fail.","PeriodicalId":92576,"journal":{"name":"Proceedings. AAAI Artificial Intelligence and Interactive Digital Entertainment Conference","volume":"93 1","pages":"179-188"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. AAAI Artificial Intelligence and Interactive Digital Entertainment Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1609/aiide.v18i1.21962","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A growing number of algorithms for story planning include the ability to create stories with failed actions -- in particular failed actions that occur because of the mistaken beliefs of the characters attempting them. To date, most of these systems have been evaluated analytically, primarily by comparing their expressive range to prior story generation systems. Empirical evaluation of these systems has been preliminary. In this paper, we outline a general comprehension-based approach to the evaluation of plan-based story generation. We describe how we specialize it for use evaluating story plans containing failed actions, and we describe the design and results of an experiment using this approach to evaluate plot lines produced by HeadSpace, a system that models the beliefs of characters and uses that model to generate plot lines containing actions that are attempted but that fail.