{"title":"Joint intent detection and slot filling for Turkish natural language understanding","authors":"OSMAN BÜYÜK","doi":"10.55730/1300-0632.4021","DOIUrl":null,"url":null,"abstract":"Intent detection and slot filling are two crucial subtasks of a text-based goal-oriented dialogue system. In a goal-oriented dialogue system, users interact with the system to complete a goal (or to fulfill their intent) and provide the necessary information (slot values) to achieve that goal. Therefore, a user?s text input includes information about the user?s intent and contains required slot values. Recently, joint models that simultaneously detect the intent and extract the slots are proposed to benefit from the interaction between the two tasks. The proposed methods are usually tested using benchmark data sets in English such as ATIS and SNIPS. Intent detection and slot filling problems are much less studied for the Turkish language mainly due to the lack of publicly available Turkish data sets. In this paper, we translate ATIS in English to Turkish and report intent detection and slot filling accuracies of several different joint models for the translated data set. We publicly share the Turkish ATIS data set to accelerate the research on the tasks. In our experiments, the best performance is obtained with the state-of-the-art bidirectional encoder representations from a transformers (BERT) based model. The BERT model is trained using a combination of intent detection and slot filling losses to jointly optimize a single model for both tasks. We achieved 96.54% intent detection accuracy and 91.56% slot filling F1 for the Turkish language. These accuracies significantly improve (7% absolute in slot filling F1) previously reported results for the same tasks in Turkish. On the other hand, we observe that the accuracy in Turkish is still slightly lower compared to the accuracy in English counterparts. This observation indicates that there is still room for improvement in the results for Turkish.","PeriodicalId":49410,"journal":{"name":"Turkish Journal of Electrical Engineering and Computer Sciences","volume":"6 1","pages":"0"},"PeriodicalIF":1.2000,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Electrical Engineering and Computer Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.55730/1300-0632.4021","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Intent detection and slot filling are two crucial subtasks of a text-based goal-oriented dialogue system. In a goal-oriented dialogue system, users interact with the system to complete a goal (or to fulfill their intent) and provide the necessary information (slot values) to achieve that goal. Therefore, a user?s text input includes information about the user?s intent and contains required slot values. Recently, joint models that simultaneously detect the intent and extract the slots are proposed to benefit from the interaction between the two tasks. The proposed methods are usually tested using benchmark data sets in English such as ATIS and SNIPS. Intent detection and slot filling problems are much less studied for the Turkish language mainly due to the lack of publicly available Turkish data sets. In this paper, we translate ATIS in English to Turkish and report intent detection and slot filling accuracies of several different joint models for the translated data set. We publicly share the Turkish ATIS data set to accelerate the research on the tasks. In our experiments, the best performance is obtained with the state-of-the-art bidirectional encoder representations from a transformers (BERT) based model. The BERT model is trained using a combination of intent detection and slot filling losses to jointly optimize a single model for both tasks. We achieved 96.54% intent detection accuracy and 91.56% slot filling F1 for the Turkish language. These accuracies significantly improve (7% absolute in slot filling F1) previously reported results for the same tasks in Turkish. On the other hand, we observe that the accuracy in Turkish is still slightly lower compared to the accuracy in English counterparts. This observation indicates that there is still room for improvement in the results for Turkish.
期刊介绍:
The Turkish Journal of Electrical Engineering & Computer Sciences is published electronically 6 times a year by the Scientific and Technological Research Council of Turkey (TÜBİTAK)
Accepts English-language manuscripts in the areas of power and energy, environmental sustainability and energy efficiency, electronics, industry applications, control systems, information and systems, applied electromagnetics, communications, signal and image processing, tomographic image reconstruction, face recognition, biometrics, speech processing, video processing and analysis, object recognition, classification, feature extraction, parallel and distributed computing, cognitive systems, interaction, robotics, digital libraries and content, personalized healthcare, ICT for mobility, sensors, and artificial intelligence.
Contribution is open to researchers of all nationalities.