AutoTutor中用户回答电子问题的语义匹配评价

Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale Pub Date : 2019-06-24 DOI:10.1145/3330430.3333649

Colin M. Carmon, Andrew J. Hampton, Brent Morgan, Zhiqiang Cai, Lijia Wang, A. Graesser

{"title":"AutoTutor中用户回答电子问题的语义匹配评价","authors":"Colin M. Carmon, Andrew J. Hampton, Brent Morgan, Zhiqiang Cai, Lijia Wang, A. Graesser","doi":"10.1145/3330430.3333649","DOIUrl":null,"url":null,"abstract":"Relatedness between user input and an ideal response is a salient feature required for proper functioning of an Intelligent Tutoring System (ITS) using natural language processing. Improper assessment of text input causes maladaptation in ITSs. Meta-assessment of user responses in ITSs can improve instruction efficacy and user satisfaction. Therefore, this paper evaluates the quality of semantic matching between user input and the expected response in AutoTutor, an ITS which holds a conversation with the user in natural language. AutoTutor's dialogue is driven by the AutoTutor Conversation Engine (ACE), which uses a combination of Latent Semantic Analysis (LSA) and Regular Expressions (RegEx) to assess user input. We assessed ACE via responses from 219 Amazon Mechanical Turk users, who answered 118 electronics questions broken into 5202 response pairings (n = 5202). These analyses explore the relationship between RegEx and LSA, agreement between the two judges, and agreement between human judges and ACE. Additionally, we calculated precision and recall. As expected, regular expressions and LSA had a moderate, positive relationship, and the agreement between ACE and human was fair, but slightly lower than agreement between human.","PeriodicalId":20693,"journal":{"name":"Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale","volume":"67 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic Matching Evaluation of User Responses to Electronics Questions in AutoTutor\",\"authors\":\"Colin M. Carmon, Andrew J. Hampton, Brent Morgan, Zhiqiang Cai, Lijia Wang, A. Graesser\",\"doi\":\"10.1145/3330430.3333649\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Relatedness between user input and an ideal response is a salient feature required for proper functioning of an Intelligent Tutoring System (ITS) using natural language processing. Improper assessment of text input causes maladaptation in ITSs. Meta-assessment of user responses in ITSs can improve instruction efficacy and user satisfaction. Therefore, this paper evaluates the quality of semantic matching between user input and the expected response in AutoTutor, an ITS which holds a conversation with the user in natural language. AutoTutor's dialogue is driven by the AutoTutor Conversation Engine (ACE), which uses a combination of Latent Semantic Analysis (LSA) and Regular Expressions (RegEx) to assess user input. We assessed ACE via responses from 219 Amazon Mechanical Turk users, who answered 118 electronics questions broken into 5202 response pairings (n = 5202). These analyses explore the relationship between RegEx and LSA, agreement between the two judges, and agreement between human judges and ACE. Additionally, we calculated precision and recall. As expected, regular expressions and LSA had a moderate, positive relationship, and the agreement between ACE and human was fair, but slightly lower than agreement between human.\",\"PeriodicalId\":20693,\"journal\":{\"name\":\"Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale\",\"volume\":\"67 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3330430.3333649\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3330430.3333649","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

用户输入和理想响应之间的相关性是使用自然语言处理的智能辅导系统(ITS)正常运行所需的一个显著特征。不恰当的文本输入评估导致了网络信息翻译的不适应。资讯科技教学中用户反应的元评估可以提高教学效能和用户满意度。因此，本文评估了AutoTutor中用户输入和预期响应之间的语义匹配质量，AutoTutor是一种以自然语言与用户进行对话的智能学习系统。AutoTutor的对话是由AutoTutor会话引擎(ACE)驱动的，它使用潜在语义分析(LSA)和正则表达式(RegEx)的组合来评估用户输入。我们通过219名Amazon Mechanical Turk用户的回复来评估ACE，他们回答了118个电子问题，分为5202对回复(n = 5202)。这些分析探讨了RegEx和LSA之间的关系，两个法官之间的一致性，以及人类法官与ACE之间的一致性。此外，我们计算了精度和召回率。正如预期的那样，正则表达式和LSA之间存在适度的正相关关系，ACE和human之间的一致性是公平的，但略低于human之间的一致性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Semantic Matching Evaluation of User Responses to Electronics Questions in AutoTutor

Relatedness between user input and an ideal response is a salient feature required for proper functioning of an Intelligent Tutoring System (ITS) using natural language processing. Improper assessment of text input causes maladaptation in ITSs. Meta-assessment of user responses in ITSs can improve instruction efficacy and user satisfaction. Therefore, this paper evaluates the quality of semantic matching between user input and the expected response in AutoTutor, an ITS which holds a conversation with the user in natural language. AutoTutor's dialogue is driven by the AutoTutor Conversation Engine (ACE), which uses a combination of Latent Semantic Analysis (LSA) and Regular Expressions (RegEx) to assess user input. We assessed ACE via responses from 219 Amazon Mechanical Turk users, who answered 118 electronics questions broken into 5202 response pairings (n = 5202). These analyses explore the relationship between RegEx and LSA, agreement between the two judges, and agreement between human judges and ACE. Additionally, we calculated precision and recall. As expected, regular expressions and LSA had a moderate, positive relationship, and the agreement between ACE and human was fair, but slightly lower than agreement between human.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale

自引率

0.00%

发文量