面向土耳其语自然语言理解的联合意图检测和槽填充

IF 1.5 4区计算机科学 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Turkish Journal of Electrical Engineering and Computer Sciences Pub Date : 2023-09-29 DOI:10.55730/1300-0632.4021

OSMAN BÜYÜK

{"title":"面向土耳其语自然语言理解的联合意图检测和槽填充","authors":"OSMAN BÜYÜK","doi":"10.55730/1300-0632.4021","DOIUrl":null,"url":null,"abstract":"Intent detection and slot filling are two crucial subtasks of a text-based goal-oriented dialogue system. In a goal-oriented dialogue system, users interact with the system to complete a goal (or to fulfill their intent) and provide the necessary information (slot values) to achieve that goal. Therefore, a user?s text input includes information about the user?s intent and contains required slot values. Recently, joint models that simultaneously detect the intent and extract the slots are proposed to benefit from the interaction between the two tasks. The proposed methods are usually tested using benchmark data sets in English such as ATIS and SNIPS. Intent detection and slot filling problems are much less studied for the Turkish language mainly due to the lack of publicly available Turkish data sets. In this paper, we translate ATIS in English to Turkish and report intent detection and slot filling accuracies of several different joint models for the translated data set. We publicly share the Turkish ATIS data set to accelerate the research on the tasks. In our experiments, the best performance is obtained with the state-of-the-art bidirectional encoder representations from a transformers (BERT) based model. The BERT model is trained using a combination of intent detection and slot filling losses to jointly optimize a single model for both tasks. We achieved 96.54% intent detection accuracy and 91.56% slot filling F1 for the Turkish language. These accuracies significantly improve (7% absolute in slot filling F1) previously reported results for the same tasks in Turkish. On the other hand, we observe that the accuracy in Turkish is still slightly lower compared to the accuracy in English counterparts. This observation indicates that there is still room for improvement in the results for Turkish.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"6 1","pages":"0"},"PeriodicalIF":1.5000,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Joint intent detection and slot filling for Turkish natural language understanding\",\"authors\":\"OSMAN BÜYÜK\",\"doi\":\"10.55730/1300-0632.4021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Intent detection and slot filling are two crucial subtasks of a text-based goal-oriented dialogue system. In a goal-oriented dialogue system, users interact with the system to complete a goal (or to fulfill their intent) and provide the necessary information (slot values) to achieve that goal. Therefore, a user?s text input includes information about the user?s intent and contains required slot values. Recently, joint models that simultaneously detect the intent and extract the slots are proposed to benefit from the interaction between the two tasks. The proposed methods are usually tested using benchmark data sets in English such as ATIS and SNIPS. Intent detection and slot filling problems are much less studied for the Turkish language mainly due to the lack of publicly available Turkish data sets. In this paper, we translate ATIS in English to Turkish and report intent detection and slot filling accuracies of several different joint models for the translated data set. We publicly share the Turkish ATIS data set to accelerate the research on the tasks. In our experiments, the best performance is obtained with the state-of-the-art bidirectional encoder representations from a transformers (BERT) based model. The BERT model is trained using a combination of intent detection and slot filling losses to jointly optimize a single model for both tasks. We achieved 96.54% intent detection accuracy and 91.56% slot filling F1 for the Turkish language. These accuracies significantly improve (7% absolute in slot filling F1) previously reported results for the same tasks in Turkish. On the other hand, we observe that the accuracy in Turkish is still slightly lower compared to the accuracy in English counterparts. This observation indicates that there is still room for improvement in the results for Turkish.\",\"PeriodicalId\":49410,\"journal\":{\"name\":\"Turkish Journal of Electrical Engineering and Computer Sciences\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2023-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Turkish Journal of Electrical Engineering and Computer Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.55730/1300-0632.4021\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Electrical Engineering and Computer Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55730/1300-0632.4021","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

意图检测和槽填充是基于文本的目标导向对话系统的两个关键子任务。在面向目标的对话系统中，用户与系统交互以完成一个目标(或实现他们的意图)，并提供实现该目标所需的信息(槽值)。因此，用户?文本输入包括关于用户的信息?S意图并包含所需的槽值。近年来，为了充分利用这两个任务之间的相互作用，提出了同时检测意图和提取槽的联合模型。所提出的方法通常使用英语的基准数据集(如ATIS和SNIPS)进行测试。由于缺乏公开可用的土耳其语数据集，对土耳其语的意图检测和槽填充问题的研究要少得多。在本文中，我们将英语的ATIS翻译成土耳其语，并报告了翻译数据集的几种不同联合模型的意图检测和槽填充精度。我们公开分享土耳其ATIS数据集，以加快对任务的研究。在我们的实验中，使用基于变压器(BERT)模型的最先进的双向编码器表示获得了最佳性能。BERT模型使用意图检测和槽填充损失的组合来训练，以联合优化两个任务的单个模型。我们对土耳其语的意图检测准确率达到96.54%，槽位填充F1达到91.56%。这些准确性显著提高(7%的绝对槽填充F1)先前报道的结果在土耳其语相同的任务。另一方面，我们观察到土耳其语的准确性仍然略低于英语的准确性。这一观察结果表明，土耳其的结果仍有改进的余地。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Joint intent detection and slot filling for Turkish natural language understanding

Intent detection and slot filling are two crucial subtasks of a text-based goal-oriented dialogue system. In a goal-oriented dialogue system, users interact with the system to complete a goal (or to fulfill their intent) and provide the necessary information (slot values) to achieve that goal. Therefore, a user?s text input includes information about the user?s intent and contains required slot values. Recently, joint models that simultaneously detect the intent and extract the slots are proposed to benefit from the interaction between the two tasks. The proposed methods are usually tested using benchmark data sets in English such as ATIS and SNIPS. Intent detection and slot filling problems are much less studied for the Turkish language mainly due to the lack of publicly available Turkish data sets. In this paper, we translate ATIS in English to Turkish and report intent detection and slot filling accuracies of several different joint models for the translated data set. We publicly share the Turkish ATIS data set to accelerate the research on the tasks. In our experiments, the best performance is obtained with the state-of-the-art bidirectional encoder representations from a transformers (BERT) based model. The BERT model is trained using a combination of intent detection and slot filling losses to jointly optimize a single model for both tasks. We achieved 96.54% intent detection accuracy and 91.56% slot filling F1 for the Turkish language. These accuracies significantly improve (7% absolute in slot filling F1) previously reported results for the same tasks in Turkish. On the other hand, we observe that the accuracy in Turkish is still slightly lower compared to the accuracy in English counterparts. This observation indicates that there is still room for improvement in the results for Turkish.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Turkish Journal of Electrical Engineering and Computer Sciences COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-ENGINEERING, ELECTRICAL & ELECTRONIC

CiteScore

2.90

自引率

9.10%

发文量

审稿时长

6.9 months

期刊介绍： The Turkish Journal of Electrical Engineering & Computer Sciences is published electronically 6 times a year by the Scientific and Technological Research Council of Turkey (TÜBİTAK) Accepts English-language manuscripts in the areas of power and energy, environmental sustainability and energy efficiency, electronics, industry applications, control systems, information and systems, applied electromagnetics, communications, signal and image processing, tomographic image reconstruction, face recognition, biometrics, speech processing, video processing and analysis, object recognition, classification, feature extraction, parallel and distributed computing, cognitive systems, interaction, robotics, digital libraries and content, personalized healthcare, ICT for mobility, sensors, and artificial intelligence. Contribution is open to researchers of all nationalities.