人工智能驱动的聊天机器人

International Journal of Data Mining Techniques and Applications Pub Date : 2022-06-06 DOI:10.20894/ijdmta.102.011.001.006

Aishwarya Ms, Shobha Rani B R

{"title":"人工智能驱动的聊天机器人","authors":"Aishwarya Ms, Shobha Rani B R","doi":"10.20894/ijdmta.102.011.001.006","DOIUrl":null,"url":null,"abstract":"Today, most large-scale conversational AI agents (e.g. Alexa, Siri, or Google Assistant) are built using manually annotated data to train the different components of the system. Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data. As the scope of these systems increase to cover more scenarios and domains, manual annotation to improve the accuracy of these components becomes prohibitively costly and time- consuming. In this paper, a group of Amazon researchers propose a system that leverages user-system interaction feedback signals to automate learning without any manual annotation. Users here tend to modify a previous query in hopes of fixing an error in the previous turn to get the right results. These reformulations, which are often preceded by defective experiences caused by errors in ASR, NLU, ER or the application. In some cases, users may not properly formulate their requests (e.g. providing partial title of a song), but gleaning across a wider pool of users and sessions reveals the underlying recurrent patterns. The proposed self-learning system automatically detects the errors, generate reformulations and deploys fixes to the runtime system to correct different types of errors occurring in different components of the system. The results show that the approach is highly scalable, and able to learn reformulations that reduce Alexa-user errors by pooling anonymized data across millions of customers.","PeriodicalId":414709,"journal":{"name":"International Journal of Data Mining Techniques and Applications","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Artificial Intelligence Driven Chatbot\",\"authors\":\"Aishwarya Ms, Shobha Rani B R\",\"doi\":\"10.20894/ijdmta.102.011.001.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Today, most large-scale conversational AI agents (e.g. Alexa, Siri, or Google Assistant) are built using manually annotated data to train the different components of the system. Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data. As the scope of these systems increase to cover more scenarios and domains, manual annotation to improve the accuracy of these components becomes prohibitively costly and time- consuming. In this paper, a group of Amazon researchers propose a system that leverages user-system interaction feedback signals to automate learning without any manual annotation. Users here tend to modify a previous query in hopes of fixing an error in the previous turn to get the right results. These reformulations, which are often preceded by defective experiences caused by errors in ASR, NLU, ER or the application. In some cases, users may not properly formulate their requests (e.g. providing partial title of a song), but gleaning across a wider pool of users and sessions reveals the underlying recurrent patterns. The proposed self-learning system automatically detects the errors, generate reformulations and deploys fixes to the runtime system to correct different types of errors occurring in different components of the system. The results show that the approach is highly scalable, and able to learn reformulations that reduce Alexa-user errors by pooling anonymized data across millions of customers.\",\"PeriodicalId\":414709,\"journal\":{\"name\":\"International Journal of Data Mining Techniques and Applications\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Data Mining Techniques and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20894/ijdmta.102.011.001.006\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining Techniques and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20894/ijdmta.102.011.001.006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

今天，大多数大规模对话AI代理(例如Alexa, Siri或Google Assistant)都是使用手动注释的数据来训练系统的不同组件。通常，通过手动转录和注释数据来提高这些组件中ML模型的准确性。随着这些系统的范围扩大到涵盖更多的场景和领域，提高这些组件的准确性的手动注释变得非常昂贵和耗时。在本文中，一组Amazon研究人员提出了一个系统，该系统利用用户-系统交互反馈信号来自动学习，而无需任何手动注释。这里的用户倾向于修改前一个查询，希望修复前一个查询中的错误以获得正确的结果。这些重新配方，通常在ASR, NLU, ER或应用错误引起的缺陷经验之前。在某些情况下，用户可能不会正确地表达他们的请求(例如，提供歌曲的部分标题)，但通过更广泛的用户池和会话进行收集，可以揭示潜在的循环模式。建议的自学习系统自动检测错误，生成重新表述，并将修复程序部署到运行时系统，以纠正系统不同组件中发生的不同类型的错误。结果表明，该方法具有高度可扩展性，并且能够通过汇集数百万客户的匿名数据来学习减少alexa用户错误的重新配方。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Artificial Intelligence Driven Chatbot

Today, most large-scale conversational AI agents (e.g. Alexa, Siri, or Google Assistant) are built using manually annotated data to train the different components of the system. Typically, the accuracy of the ML models in these components are improved by manually transcribing and annotating data. As the scope of these systems increase to cover more scenarios and domains, manual annotation to improve the accuracy of these components becomes prohibitively costly and time- consuming. In this paper, a group of Amazon researchers propose a system that leverages user-system interaction feedback signals to automate learning without any manual annotation. Users here tend to modify a previous query in hopes of fixing an error in the previous turn to get the right results. These reformulations, which are often preceded by defective experiences caused by errors in ASR, NLU, ER or the application. In some cases, users may not properly formulate their requests (e.g. providing partial title of a song), but gleaning across a wider pool of users and sessions reveals the underlying recurrent patterns. The proposed self-learning system automatically detects the errors, generate reformulations and deploys fixes to the runtime system to correct different types of errors occurring in different components of the system. The results show that the approach is highly scalable, and able to learn reformulations that reduce Alexa-user errors by pooling anonymized data across millions of customers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Data Mining Techniques and Applications

自引率

0.00%

发文量