LDV Forum最新文献_第2页

Domain ontologies and wordnets in OWL: Modelling options OWL中的领域本体和词网:建模选项

LDV Forum Pub Date : 2007-07-01 DOI: 10.21248/jlcl.22.2007.92

H. Lüngen, Angelika Storrer

{"title":"Domain ontologies and wordnets in OWL: Modelling options","authors":"H. Lüngen, Angelika Storrer","doi":"10.21248/jlcl.22.2007.92","DOIUrl":"https://doi.org/10.21248/jlcl.22.2007.92","url":null,"abstract":"Word nets are lexical reference systems that follow the design principles of the Princeton WordNet project (Fellbaum 1998, henceforth referred to as PWN1). Domain ontologies (or domain-specific ontologies, e.g. GOLD2 or the GENE Ontology3) represent knowledge about a specific domain in a format that supports automated reasoning about the objects in that domain and the relations between them (cf. Erdmann 2001, 78). Word nets have been used in various applications of text processing, e.g. discourse parsing, lexical and thematic chaining, cohesion analyses, automatic segmentation and linking, anaphora resolution, and information extraction. When these applications process documents dealing with a specific domain, one needs to combine knowlegde about the domain-specific vocabulary represented in domain ontologies with lexical repositories representing general vocabulary (like PWN). In this context, it is useful to represent and interrelate the entities and relations in both types of resources using a common representation language. In our research group “Text-technological Information Modelling4” we chose OWL as a common format for this purpose. Since our projects are mainly concerned with German documents, we developed an OWL model that relates the German wordnet GermaNet (henceforth referred to as GN)5 with domain-specific ontologies in an approach that was inspired by the Plug-In model proposed in Magnini/Speranza (2002). Our approach is decribed in Kunze et al. (to appear); it was evaluated using representative subsets of GN and of the domain ontology TermNet6 (henceforth referred to as TN) as data and Protégé","PeriodicalId":346957,"journal":{"name":"LDV Forum","volume":"200 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116352209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Analysis of E-Discussions Using Classifier Induced Semantic Spaces 基于分类器诱导语义空间的电子讨论分析

LDV Forum Pub Date : 2007-07-01 DOI: 10.21248/jlcl.22.2007.87

Edda Leopold, J. Kindermann, G. Paass

引用次数: 0

Towards a Logical Description of Trees in Annotation Graphs 注释图中树的逻辑描述

LDV Forum Pub Date : 2007-07-01 DOI: 10.21248/jlcl.22.2007.96

J. Michaelis, Uwe Mönnich

引用次数: 7

Structural Classifiers of Text Types: Towards a Novel Model of Text Representation 文本类型的结构分类器:迈向一种新的文本表示模型

LDV Forum Pub Date : 2007-07-01 DOI: 10.21248/jlcl.22.2007.95

Alexander Mehler, Peter Geibel, O. Pustylnikov

{"title":"Structural Classifiers of Text Types: Towards a Novel Model of Text Representation","authors":"Alexander Mehler, Peter Geibel, O. Pustylnikov","doi":"10.21248/jlcl.22.2007.95","DOIUrl":"https://doi.org/10.21248/jlcl.22.2007.95","url":null,"abstract":"Texts can be distinguished in terms of their content, function, structure or layout (Brinker, 1992; Bateman et al., 2001; Joachims, 2002; Power et al., 2003). These reference points do not open necessarily orthogonal perspectives on text classification. As part of explorative data analysis, text classification aims at automatically dividing sets of textual objects into classes of maximum internal homogeneity and external heterogeneity. This paper deals with classifying texts into text types whose instances serve more or less homogeneous functions. Other than mainstream approaches, which rely on the vector space model (Sebastiani, 2002) or some of its descendants (Baeza-Yates and Ribeiro-Neto, 1999) and, thus, on content-related lexical features, we solely refer to structural dierentiae. That is, we explore patterns of text structure as determinants of class membership. Our starting point are tree-like text representations which induce feature vectors and tree kernels. These kernels are utilized in supervised learning based on cross-validation as a method of model selection (Hastie et al., 2001) by example of a corpus of press communication. For a subset of categories we show that classification can be performed very well by structural dierentia only.","PeriodicalId":346957,"journal":{"name":"LDV Forum","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128065404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 32

Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection 篇章关系分类中人工与自动标记数据:实例和特征选择的影响

LDV Forum Pub Date : 2007-07-01 DOI: 10.21248/jlcl.22.2007.86

C. Sporleder

{"title":"Manually vs. Automatically Labelled Data in Discourse Relation Classification: Effects of Example and Feature Selection","authors":"C. Sporleder","doi":"10.21248/jlcl.22.2007.86","DOIUrl":"https://doi.org/10.21248/jlcl.22.2007.86","url":null,"abstract":"Wordnets are lexical reference systems that follow the design principles of the Princeton WordNet project (Fellbaum, ). Domain ontologies (or domain-specific ontologies such as GOLD, or the GENE Ontology) represent knowledge about a specific domain in a format that supports automated reasoning about the objects in that domain and the relations between them (Erdmann, ). In this paper, we will discuss how the Web Ontology Language OWL can be used to represent and interrelate the entities and relations in both types of resources. Our special focus will be on the question, whether synsets should be modelled as individuals (we use individual and instance as synonyms and will refer to this option as instance model) or as classes (we will refer to this option as class model). We will present three OWL models, each of which offers different solutions to this question. These models were developed in the context of the research group “Text-technological Modelling of Information” as a collaboration of the projects SemDok and HyTex. Since these projects are mainly concerned with German documents and with corpora that contain documents of a special technical or scientific domain, we used subsets of the German wordnet GermaNet (Kunze and Lemnitzer, ), henceforth referred to as GN, and the German domain ontology TermNet (Beiswenger et al., ), henceforth referred to as TN, to develop and evaluate the three models. To relate the general vocabulary of GN with the domain specific terms in TN, we developed an approach that was inspired by the plug-in model proposed by Magnini and Speranza (). In this approach, which has been developed in cooperation with the GermaNet research group (see Kunze et al. () for details), we adapted the OWL model for the English Princeton WordNet suggested by van Assem et al. () to GN, i.e. we modelled German synsets as instances of word-class-specific synset classes. For the reasons explained in section , we wanted to experiment with alternative models that implement the class model. In section  we will present three alternative OWL representations for GN and TN and discuss their benefits and drawbacks.","PeriodicalId":346957,"journal":{"name":"LDV Forum","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129781474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Chatbots in der praktischen Fachlexikographie / Terminologie 技术用语中的吊索词

LDV Forum Pub Date : 2007-07-01 DOI: 10.21248/jlcl.22.2007.89

Franziskus Geeb

引用次数: 2

UniTerm - Formats and Terminology Exchange 格式和术语交换

LDV Forum Pub Date : 2006-07-01 DOI: 10.21248/jlcl.21.2006.79

W. Zenk

引用次数: 1

Lexicon Exchange in MT - The Long Way to Standardization MT中的词汇交换-标准化之路漫漫

LDV Forum Pub Date : 2006-07-01 DOI: 10.21248/jlcl.21.2006.80

Stefanie Geldbach