通过多任务提高DST模型的少镜头性能，更好地为语言障碍人士服务

2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) Pub Date : 2023-06-04 DOI:10.1109/ICASSPW59220.2023.10193387

Mingyang Sun, QiXiang Gao, Yutao Mou, Guanting Dong, Ruifang Liu, Wenbin Guo

{"title":"通过多任务提高DST模型的少镜头性能，更好地为语言障碍人士服务","authors":"Mingyang Sun, QiXiang Gao, Yutao Mou, Guanting Dong, Ruifang Liu, Wenbin Guo","doi":"10.1109/ICASSPW59220.2023.10193387","DOIUrl":null,"url":null,"abstract":"Artificial intelligence-based virtual assistants make people’s daily life more convenient. However, the utterances of language-impaired people are limited and different in characteristics and domains from that of ordinary people. So it is difficult for language-impaired people to benefit from standard data-driven artificial intelligence algorithms. In this paper, we propose a multi-task training method for the dialogue state tracking (DST) task in dialogue systems that make up virtual assistants, improving the performance of T5 on the few-shot cross-domain DST task. Specifically, we consider two ways of handling DST task: predicting the dialogue state from the beginning or updating the dialogue state every turn, and accordingly design the main task and auxiliary task for the model. Experiments show that our method outperforms most previous works on the MultiWOZ 2.0 and 2.1 datasets for the few-shot cross-domain DST task. For the artificial-crafted language-impaired dataset, our method can effectively improve the few-shot cross-domain performance of the model. Additionally, we analyzed the possible reason why this multitasking approach works well.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Improving Few-Shot Performance of DST Model Through Multitask to Better Serve Language-Impaired People\",\"authors\":\"Mingyang Sun, QiXiang Gao, Yutao Mou, Guanting Dong, Ruifang Liu, Wenbin Guo\",\"doi\":\"10.1109/ICASSPW59220.2023.10193387\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Artificial intelligence-based virtual assistants make people’s daily life more convenient. However, the utterances of language-impaired people are limited and different in characteristics and domains from that of ordinary people. So it is difficult for language-impaired people to benefit from standard data-driven artificial intelligence algorithms. In this paper, we propose a multi-task training method for the dialogue state tracking (DST) task in dialogue systems that make up virtual assistants, improving the performance of T5 on the few-shot cross-domain DST task. Specifically, we consider two ways of handling DST task: predicting the dialogue state from the beginning or updating the dialogue state every turn, and accordingly design the main task and auxiliary task for the model. Experiments show that our method outperforms most previous works on the MultiWOZ 2.0 and 2.1 datasets for the few-shot cross-domain DST task. For the artificial-crafted language-impaired dataset, our method can effectively improve the few-shot cross-domain performance of the model. Additionally, we analyzed the possible reason why this multitasking approach works well.\",\"PeriodicalId\":158726,\"journal\":{\"name\":\"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)\",\"volume\":\"65 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSPW59220.2023.10193387\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSPW59220.2023.10193387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

基于人工智能的虚拟助手使人们的日常生活更加便利。然而，语言障碍者的话语是有限的，并且在特征和领域上与普通人有所不同。因此，语言障碍人士很难从标准的数据驱动的人工智能算法中受益。本文针对虚拟助手组成的对话系统中的对话状态跟踪(DST)任务，提出了一种多任务训练方法，提高了T5在少镜头跨域DST任务上的性能。具体来说，我们考虑了从一开始就预测对话状态和每回合更新对话状态两种处理DST任务的方式，并相应地为模型设计了主任务和辅助任务。实验表明，在MultiWOZ 2.0和2.1数据集上，我们的方法在少镜头跨域DST任务上优于大多数先前的工作。对于人工语言障碍数据集，我们的方法可以有效地提高模型的少镜头跨域性能。此外，我们还分析了这种多任务处理方法效果良好的可能原因。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving Few-Shot Performance of DST Model Through Multitask to Better Serve Language-Impaired People

Artificial intelligence-based virtual assistants make people’s daily life more convenient. However, the utterances of language-impaired people are limited and different in characteristics and domains from that of ordinary people. So it is difficult for language-impaired people to benefit from standard data-driven artificial intelligence algorithms. In this paper, we propose a multi-task training method for the dialogue state tracking (DST) task in dialogue systems that make up virtual assistants, improving the performance of T5 on the few-shot cross-domain DST task. Specifically, we consider two ways of handling DST task: predicting the dialogue state from the beginning or updating the dialogue state every turn, and accordingly design the main task and auxiliary task for the model. Experiments show that our method outperforms most previous works on the MultiWOZ 2.0 and 2.1 datasets for the few-shot cross-domain DST task. For the artificial-crafted language-impaired dataset, our method can effectively improve the few-shot cross-domain performance of the model. Additionally, we analyzed the possible reason why this multitasking approach works well.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)

自引率

0.00%

发文量