在语音操作系统中使用EuroWordNet

LDV Forum Pub Date : 2004-07-01 DOI:10.21248/jlcl.19.2004.52
I. Thabet, Bernd Ludwig, Frank-Peter Schweinberger, K. Bücher, Günther Görz
{"title":"在语音操作系统中使用EuroWordNet","authors":"I. Thabet, Bernd Ludwig, Frank-Peter Schweinberger, K. Bücher, Günther Görz","doi":"10.21248/jlcl.19.2004.52","DOIUrl":null,"url":null,"abstract":"In natural language processing, incremental semantic composition is one of the most prominent issues. In the past, numerous approaches have been developed for assigning meaning to noun and verb phrases and their complements and modifiers. However, their inferential power is often too weak to be applied to practical applications, or the expressiveness of the representation language is so complex, that it leads to intractable inference procedures. As an answer to these problems, we have developed an approach that relies on Description Logics (DL) for handling semantic construction. First, we will discuss this appraoch and show how a semantic knowledge base can be setup dependant on EUROWORDNET 1 (EWN) as a linguistic ontology. Subsequently we will outline our experience with and demands on EWN. 1 What is EMBASSI and its Objective? EMBASSI (”Elektronische Multimediale Bedienund Service-Assistenz”) has been http://www.illc.uva.nl/EuroWordNet/ a German joint project sponsored by the German Fed. Ministery of Research2. Our contribution to this project consists mainly of three components: the dialogue manager, formal ontologies for several multilingual application domains, and the language generation component to communicate system utterances to the user. The long-term goal of our research is to design and implement a generic dialogue system for rational (spoken) dialogues that helps a user to achieve certain goals in terms of operations of a technical application system – e.g. an information system, a system for controlling devices, or any other kind of problem solving systems. One of its design criteria is the ability to recognize users’ intentions in order to establish corresponding subgoals and control their processing. Furthermore, it should enable mixed-initiative, flexible and cooperative conversations, provide a high level of robustness as well as scalability at the linguistic and application dimensions, and easy portability to new domains. In addition, it should be possible to integrate multilingual linguistic interaction with multimodal It aims to provide easy access for everybody to complex technical systems (A/V home theatre, car devices, and public terminals), encouraging multimodal as well as multilingual user input. forms of input and output such as graphical user interfaces, and – by means of appropriate devices – the recognition of deictic actions. 2 DL Models of Applications Applications are characterized by a DL terminology which models the concepts used for making propositions about application situations. Basically, EMBASSI’s knowledge base is composed of two parts: the EWN ontology, which encodes the linguistic meaning of words determined on an empirical basis, and the STANDARD UPPER ONTOLOGY (SUMO) (NP01), which is used as a generic base model for concepts of the application domain (see (Lu02)). 3 Semantic Construction This section discusses the issue of semantic construction during analyzing natural language input. We are using an incremental approach to the composition of semantic representations. The backbone of our approach is -DRT (Fis96). The parser builds Discourse Representation Structures (DRSes) (KaR93) incrementally and maps them onto ABoxes3 (see (BLG02)). The main question here is how the mapping of domain independent in terms of EWN to application specific language usage in terms of a domain model is done. In the discourse domain, referents usually refer to instances in the application domain. Such pairs of a discourse referent and a corresponding inA general characteristic of DL-Systems is that the knowledge base is made up of two components: the intensional one, called TBox, and the extensional one, called ABox. TBox is a general schema characterizing the classes of individuals to be represented, their general properties and mutual relationships, while ABox is a partial instantiation of this schema, containing assertions relating either individuals to classes, or individuals to each other. So given a concept language L, an ABox-statement in L has one of the forms (DL96): C(a) Concept Membership Assertion R(a, b) Role Membership Assertion where C is an L-concept, R is an L-role, and a, b are individuals. stance are represented by means of a special role called has-lex. For instance, in the definition AvEvent has-lex Program1 it is claimed that an AvEvent4 is related to a discourse referent of Program1. Consequently, all words that are assigned Program1 as a meaning in EWN, designate an instance of AvEvent in the application domain. The DRS:","PeriodicalId":346957,"journal":{"name":"LDV Forum","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2004-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Using EuroWordNet within the Speech Operated System EMBASSI\",\"authors\":\"I. Thabet, Bernd Ludwig, Frank-Peter Schweinberger, K. Bücher, Günther Görz\",\"doi\":\"10.21248/jlcl.19.2004.52\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In natural language processing, incremental semantic composition is one of the most prominent issues. In the past, numerous approaches have been developed for assigning meaning to noun and verb phrases and their complements and modifiers. However, their inferential power is often too weak to be applied to practical applications, or the expressiveness of the representation language is so complex, that it leads to intractable inference procedures. As an answer to these problems, we have developed an approach that relies on Description Logics (DL) for handling semantic construction. First, we will discuss this appraoch and show how a semantic knowledge base can be setup dependant on EUROWORDNET 1 (EWN) as a linguistic ontology. Subsequently we will outline our experience with and demands on EWN. 1 What is EMBASSI and its Objective? EMBASSI (”Elektronische Multimediale Bedienund Service-Assistenz”) has been http://www.illc.uva.nl/EuroWordNet/ a German joint project sponsored by the German Fed. Ministery of Research2. Our contribution to this project consists mainly of three components: the dialogue manager, formal ontologies for several multilingual application domains, and the language generation component to communicate system utterances to the user. The long-term goal of our research is to design and implement a generic dialogue system for rational (spoken) dialogues that helps a user to achieve certain goals in terms of operations of a technical application system – e.g. an information system, a system for controlling devices, or any other kind of problem solving systems. One of its design criteria is the ability to recognize users’ intentions in order to establish corresponding subgoals and control their processing. Furthermore, it should enable mixed-initiative, flexible and cooperative conversations, provide a high level of robustness as well as scalability at the linguistic and application dimensions, and easy portability to new domains. In addition, it should be possible to integrate multilingual linguistic interaction with multimodal It aims to provide easy access for everybody to complex technical systems (A/V home theatre, car devices, and public terminals), encouraging multimodal as well as multilingual user input. forms of input and output such as graphical user interfaces, and – by means of appropriate devices – the recognition of deictic actions. 2 DL Models of Applications Applications are characterized by a DL terminology which models the concepts used for making propositions about application situations. Basically, EMBASSI’s knowledge base is composed of two parts: the EWN ontology, which encodes the linguistic meaning of words determined on an empirical basis, and the STANDARD UPPER ONTOLOGY (SUMO) (NP01), which is used as a generic base model for concepts of the application domain (see (Lu02)). 3 Semantic Construction This section discusses the issue of semantic construction during analyzing natural language input. We are using an incremental approach to the composition of semantic representations. The backbone of our approach is -DRT (Fis96). The parser builds Discourse Representation Structures (DRSes) (KaR93) incrementally and maps them onto ABoxes3 (see (BLG02)). The main question here is how the mapping of domain independent in terms of EWN to application specific language usage in terms of a domain model is done. In the discourse domain, referents usually refer to instances in the application domain. Such pairs of a discourse referent and a corresponding inA general characteristic of DL-Systems is that the knowledge base is made up of two components: the intensional one, called TBox, and the extensional one, called ABox. TBox is a general schema characterizing the classes of individuals to be represented, their general properties and mutual relationships, while ABox is a partial instantiation of this schema, containing assertions relating either individuals to classes, or individuals to each other. So given a concept language L, an ABox-statement in L has one of the forms (DL96): C(a) Concept Membership Assertion R(a, b) Role Membership Assertion where C is an L-concept, R is an L-role, and a, b are individuals. stance are represented by means of a special role called has-lex. For instance, in the definition AvEvent has-lex Program1 it is claimed that an AvEvent4 is related to a discourse referent of Program1. Consequently, all words that are assigned Program1 as a meaning in EWN, designate an instance of AvEvent in the application domain. The DRS:\",\"PeriodicalId\":346957,\"journal\":{\"name\":\"LDV Forum\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"LDV Forum\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21248/jlcl.19.2004.52\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"LDV Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21248/jlcl.19.2004.52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

在自然语言处理中,增量语义组合是一个非常突出的问题。在过去,已经发展了许多方法来确定名词和动词短语及其补语和修饰语的意义。然而,它们的推理能力往往太弱而无法应用于实际应用,或者表示语言的表达性太复杂,导致推理过程难以处理。作为这些问题的答案,我们开发了一种依赖于描述逻辑(DL)来处理语义构造的方法。首先,我们将讨论这种方法,并展示如何建立一个依赖于EUROWORDNET 1 (EWN)作为语言本体的语义知识库。随后,我们将概述我们在EWN方面的经验和需求。1什么是大使馆及其目标?使馆(“电子多媒体Bedienund服务辅助”)是http://www.illc.uva.nl/EuroWordNet/由德国联邦研究部赞助的一个德国联合项目。我们对这个项目的贡献主要包括三个组件:对话管理器、用于多个多语言应用领域的正式本体,以及用于向用户传达系统话语的语言生成组件。我们研究的长期目标是设计和实现一个通用的对话系统,用于理性(口语)对话,帮助用户在技术应用系统(例如信息系统、控制设备的系统或任何其他类型的问题解决系统)的操作方面实现某些目标。其设计标准之一是识别用户意图的能力,以便建立相应的子目标并控制其处理。此外,它应该支持混合主动、灵活和协作的对话,在语言和应用程序维度上提供高水平的健壮性以及可伸缩性,并且易于移植到新领域。此外,应该有可能将多语言的语言交互与多模式结合起来。它旨在为每个人提供方便的访问复杂的技术系统(A/V家庭影院、汽车设备和公共终端),鼓励多模式和多语言的用户输入。输入和输出的形式,如图形用户界面,以及-通过适当的设备-指示动作的识别。应用程序的深度学习模型应用程序的特征是一个深度学习术语,它对用于提出有关应用程序情况的命题的概念进行建模。基本上,使馆的知识库由两部分组成:EWN本体,它对经验基础上确定的单词的语言含义进行编码,以及标准上层本体(SUMO) (NP01),它被用作应用领域概念的通用基础模型(见图2)。本节讨论在分析自然语言输入时的语义结构问题。我们正在使用一种增量方法来组合语义表示。我们方法的主干是-DRT (Fis96)。解析器增量地构建话语表示结构(drs) (KaR93)并将它们映射到ABoxes3(参见BLG02)。这里的主要问题是如何将EWN方面的领域独立映射到领域模型方面的应用程序特定语言使用。在话语域中,指称通常指应用程序域中的实例。这种话语指涉和相应的dl系统的一般特征是,知识库由两个部分组成:一个是内涵部分,称为TBox,另一个是外延部分,称为ABox。TBox是一个通用模式,描述了要表示的个体的类、它们的一般属性和相互关系,而ABox是该模式的部分实例化,包含与个体与类或个体与彼此相关的断言。因此,给定一个概念语言L, L中的abox语句具有一种形式(DL96): C(a)概念隶属断言R(a, b)角色隶属断言,其中C是一个L概念,R是一个L角色,a, b是个体。姿态通过一个称为has-lex的特殊角色来表示。例如,在定义AvEvent has-lex Program1中,声称AvEvent4与Program1的话语引用相关。因此,所有在EWN中被指定为Program1的单词,在应用程序域中指定AvEvent的实例。DRS:
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using EuroWordNet within the Speech Operated System EMBASSI
In natural language processing, incremental semantic composition is one of the most prominent issues. In the past, numerous approaches have been developed for assigning meaning to noun and verb phrases and their complements and modifiers. However, their inferential power is often too weak to be applied to practical applications, or the expressiveness of the representation language is so complex, that it leads to intractable inference procedures. As an answer to these problems, we have developed an approach that relies on Description Logics (DL) for handling semantic construction. First, we will discuss this appraoch and show how a semantic knowledge base can be setup dependant on EUROWORDNET 1 (EWN) as a linguistic ontology. Subsequently we will outline our experience with and demands on EWN. 1 What is EMBASSI and its Objective? EMBASSI (”Elektronische Multimediale Bedienund Service-Assistenz”) has been http://www.illc.uva.nl/EuroWordNet/ a German joint project sponsored by the German Fed. Ministery of Research2. Our contribution to this project consists mainly of three components: the dialogue manager, formal ontologies for several multilingual application domains, and the language generation component to communicate system utterances to the user. The long-term goal of our research is to design and implement a generic dialogue system for rational (spoken) dialogues that helps a user to achieve certain goals in terms of operations of a technical application system – e.g. an information system, a system for controlling devices, or any other kind of problem solving systems. One of its design criteria is the ability to recognize users’ intentions in order to establish corresponding subgoals and control their processing. Furthermore, it should enable mixed-initiative, flexible and cooperative conversations, provide a high level of robustness as well as scalability at the linguistic and application dimensions, and easy portability to new domains. In addition, it should be possible to integrate multilingual linguistic interaction with multimodal It aims to provide easy access for everybody to complex technical systems (A/V home theatre, car devices, and public terminals), encouraging multimodal as well as multilingual user input. forms of input and output such as graphical user interfaces, and – by means of appropriate devices – the recognition of deictic actions. 2 DL Models of Applications Applications are characterized by a DL terminology which models the concepts used for making propositions about application situations. Basically, EMBASSI’s knowledge base is composed of two parts: the EWN ontology, which encodes the linguistic meaning of words determined on an empirical basis, and the STANDARD UPPER ONTOLOGY (SUMO) (NP01), which is used as a generic base model for concepts of the application domain (see (Lu02)). 3 Semantic Construction This section discusses the issue of semantic construction during analyzing natural language input. We are using an incremental approach to the composition of semantic representations. The backbone of our approach is -DRT (Fis96). The parser builds Discourse Representation Structures (DRSes) (KaR93) incrementally and maps them onto ABoxes3 (see (BLG02)). The main question here is how the mapping of domain independent in terms of EWN to application specific language usage in terms of a domain model is done. In the discourse domain, referents usually refer to instances in the application domain. Such pairs of a discourse referent and a corresponding inA general characteristic of DL-Systems is that the knowledge base is made up of two components: the intensional one, called TBox, and the extensional one, called ABox. TBox is a general schema characterizing the classes of individuals to be represented, their general properties and mutual relationships, while ABox is a partial instantiation of this schema, containing assertions relating either individuals to classes, or individuals to each other. So given a concept language L, an ABox-statement in L has one of the forms (DL96): C(a) Concept Membership Assertion R(a, b) Role Membership Assertion where C is an L-concept, R is an L-role, and a, b are individuals. stance are represented by means of a special role called has-lex. For instance, in the definition AvEvent has-lex Program1 it is claimed that an AvEvent4 is related to a discourse referent of Program1. Consequently, all words that are assigned Program1 as a meaning in EWN, designate an instance of AvEvent in the application domain. The DRS:
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信