{"title":"对话系统质量保证研究综述","authors":"Xiaomin Li, Chuanqi Tao, Jerry Gao, Hongjing Guo","doi":"10.1109/AITest55621.2022.00021","DOIUrl":null,"url":null,"abstract":"With the development of machine learning and big data technology, dialogue systems have been applied to many fields, including aerospace, banking and other scenarios that require high accuracy of answer. This has prompted a great deal of research on quality verification and assurance of dialogue systems. As two means to ensure the quality of software, testing and evaluation are rarely comprehensively summarized in current research work. Firstly, the dialogue systems are classified according to different classification standards. Secondly, this paper reviews the existing quality assurance work of dialogue systems from testing and dialogue evaluation, including testing methods, testing tools, evaluation metrics and dialogue quality attributes. Moreover, the issues and needs are discussed aiming at the deficiency in the current work, which can provide references for future research.","PeriodicalId":427386,"journal":{"name":"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Review of Quality Assurance Research of Dialogue Systems\",\"authors\":\"Xiaomin Li, Chuanqi Tao, Jerry Gao, Hongjing Guo\",\"doi\":\"10.1109/AITest55621.2022.00021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of machine learning and big data technology, dialogue systems have been applied to many fields, including aerospace, banking and other scenarios that require high accuracy of answer. This has prompted a great deal of research on quality verification and assurance of dialogue systems. As two means to ensure the quality of software, testing and evaluation are rarely comprehensively summarized in current research work. Firstly, the dialogue systems are classified according to different classification standards. Secondly, this paper reviews the existing quality assurance work of dialogue systems from testing and dialogue evaluation, including testing methods, testing tools, evaluation metrics and dialogue quality attributes. Moreover, the issues and needs are discussed aiming at the deficiency in the current work, which can provide references for future research.\",\"PeriodicalId\":427386,\"journal\":{\"name\":\"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AITest55621.2022.00021\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference On Artificial Intelligence Testing (AITest)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AITest55621.2022.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Review of Quality Assurance Research of Dialogue Systems
With the development of machine learning and big data technology, dialogue systems have been applied to many fields, including aerospace, banking and other scenarios that require high accuracy of answer. This has prompted a great deal of research on quality verification and assurance of dialogue systems. As two means to ensure the quality of software, testing and evaluation are rarely comprehensively summarized in current research work. Firstly, the dialogue systems are classified according to different classification standards. Secondly, this paper reviews the existing quality assurance work of dialogue systems from testing and dialogue evaluation, including testing methods, testing tools, evaluation metrics and dialogue quality attributes. Moreover, the issues and needs are discussed aiming at the deficiency in the current work, which can provide references for future research.