对话系统质量保证研究综述

2022 IEEE International Conference On Artificial Intelligence Testing (AITest) Pub Date : 2022-08-01 DOI:10.1109/AITest55621.2022.00021

Xiaomin Li, Chuanqi Tao, Jerry Gao, Hongjing Guo

{"title":"对话系统质量保证研究综述","authors":"Xiaomin Li, Chuanqi Tao, Jerry Gao, Hongjing Guo","doi":"10.1109/AITest55621.2022.00021","DOIUrl":null,"url":null,"abstract":"With the development of machine learning and big data technology, dialogue systems have been applied to many fields, including aerospace, banking and other scenarios that require high accuracy of answer. This has prompted a great deal of research on quality verification and assurance of dialogue systems. As two means to ensure the quality of software, testing and evaluation are rarely comprehensively summarized in current research work. Firstly, the dialogue systems are classified according to different classification standards. Secondly, this paper reviews the existing quality assurance work of dialogue systems from testing and dialogue evaluation, including testing methods, testing tools, evaluation metrics and dialogue quality attributes. Moreover, the issues and needs are discussed aiming at the deficiency in the current work, which can provide references for future research.","PeriodicalId":427386,"journal":{"name":"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Review of Quality Assurance Research of Dialogue Systems\",\"authors\":\"Xiaomin Li, Chuanqi Tao, Jerry Gao, Hongjing Guo\",\"doi\":\"10.1109/AITest55621.2022.00021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of machine learning and big data technology, dialogue systems have been applied to many fields, including aerospace, banking and other scenarios that require high accuracy of answer. This has prompted a great deal of research on quality verification and assurance of dialogue systems. As two means to ensure the quality of software, testing and evaluation are rarely comprehensively summarized in current research work. Firstly, the dialogue systems are classified according to different classification standards. Secondly, this paper reviews the existing quality assurance work of dialogue systems from testing and dialogue evaluation, including testing methods, testing tools, evaluation metrics and dialogue quality attributes. Moreover, the issues and needs are discussed aiming at the deficiency in the current work, which can provide references for future research.\",\"PeriodicalId\":427386,\"journal\":{\"name\":\"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AITest55621.2022.00021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AITest55621.2022.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着机器学习和大数据技术的发展，对话系统已经被应用到许多领域，包括航空航天、银行等对答案精度要求很高的场景。这促使人们对对话系统的质量核查和保证进行了大量的研究。测试和评估作为保证软件质量的两种手段，在目前的研究工作中很少得到全面的总结。首先，根据不同的分类标准对对话系统进行分类。其次，从测试和对话评估两方面回顾了现有的对话系统质量保证工作，包括测试方法、测试工具、评估指标和对话质量属性。并针对目前工作中存在的不足，探讨了存在的问题和需求，为今后的研究提供参考。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Review of Quality Assurance Research of Dialogue Systems

With the development of machine learning and big data technology, dialogue systems have been applied to many fields, including aerospace, banking and other scenarios that require high accuracy of answer. This has prompted a great deal of research on quality verification and assurance of dialogue systems. As two means to ensure the quality of software, testing and evaluation are rarely comprehensively summarized in current research work. Firstly, the dialogue systems are classified according to different classification standards. Secondly, this paper reviews the existing quality assurance work of dialogue systems from testing and dialogue evaluation, including testing methods, testing tools, evaluation metrics and dialogue quality attributes. Moreover, the issues and needs are discussed aiming at the deficiency in the current work, which can provide references for future research.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference On Artificial Intelligence Testing (AITest)

自引率

0.00%

发文量