G. Sunilkumar, S. S, Steven Frederick Gilbert, C. S.
{"title":"A Concurrent Intelligent Natural Language Understanding Model for an Automated Inquiry System","authors":"G. Sunilkumar, S. S, Steven Frederick Gilbert, C. S.","doi":"10.1109/AIC55036.2022.9848883","DOIUrl":null,"url":null,"abstract":"The work is intended to tackle a vital field that lies at the intersection of speech processing and natural language processing: Spoken Language Understanding (SLU). Its idea is to understand the essence of machine-directed human speech in order to facilitate its further processing and take on board its cognitive impact. The proposed system is CIDIS -Concurrent Intelligent Model for Dialogue Act Classification, Intent Detection and Slot Filling, that uses a deep concurrent multi-task paradigm to perform the three fundamental tasks of the SLU domain: Dialogue Act Classification, Intent Detection and Slot Filling. Since the model is orchestrated in a multi-task fashion, every task interacts with the other to have a global understanding of the input query. It follows an intelligent encoding strategy involving concatenation of the query’s BERT and CharCNN embedding to handle all possible edge cases and ambiguities involved in human speech queries. This intelligent encoding is passed through a Stacked BiLSTM architecture followed by task-specific attention layers. The three supplementary outputs are in turn fed to the final module that generates the expected query response in real-time based on the dialogue act, intent and slot. The developed models are evaluated against standard benchmark datasets like ATIS, TRAINS and FRAMES and the achieved state-of-the-art performances are eventually tabulated.","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"169 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The work is intended to tackle a vital field that lies at the intersection of speech processing and natural language processing: Spoken Language Understanding (SLU). Its idea is to understand the essence of machine-directed human speech in order to facilitate its further processing and take on board its cognitive impact. The proposed system is CIDIS -Concurrent Intelligent Model for Dialogue Act Classification, Intent Detection and Slot Filling, that uses a deep concurrent multi-task paradigm to perform the three fundamental tasks of the SLU domain: Dialogue Act Classification, Intent Detection and Slot Filling. Since the model is orchestrated in a multi-task fashion, every task interacts with the other to have a global understanding of the input query. It follows an intelligent encoding strategy involving concatenation of the query’s BERT and CharCNN embedding to handle all possible edge cases and ambiguities involved in human speech queries. This intelligent encoding is passed through a Stacked BiLSTM architecture followed by task-specific attention layers. The three supplementary outputs are in turn fed to the final module that generates the expected query response in real-time based on the dialogue act, intent and slot. The developed models are evaluated against standard benchmark datasets like ATIS, TRAINS and FRAMES and the achieved state-of-the-art performances are eventually tabulated.
这项工作旨在解决语音处理和自然语言处理的交叉领域:口语理解(SLU)。它的想法是理解机器指导的人类语言的本质,以便促进其进一步处理并考虑其认知影响。本文提出的系统是CIDIS (concurrent Intelligent Model for Dialogue Act Classification, Intent Detection and Slot Filling),该系统使用深度并发多任务范式来完成SLU领域的三个基本任务:对话行为分类、意图检测和Slot填充。由于模型是以多任务方式编排的,因此每个任务都与其他任务交互,以获得对输入查询的全局理解。它遵循一种智能编码策略,包括查询的BERT和CharCNN嵌入的连接,以处理人类语音查询中涉及的所有可能的边缘情况和歧义。这种智能编码通过堆叠的BiLSTM架构传递,然后是特定于任务的注意层。三个补充输出依次馈送到最终模块,该模块根据对话行为、意图和插槽实时生成预期的查询响应。开发的模型将根据ATIS、TRAINS和FRAMES等标准基准数据集进行评估,并最终将达到的最先进性能制成表格。