{"title":"使用论点结构提取技术从语料库中提取专门的框架","authors":"Beatriz Sánchez Cárdenas, Carlos Ramisch","doi":"10.1075/TERM.00026.SAN","DOIUrl":null,"url":null,"abstract":"\n Frame Semantics provides a powerful cross-lingual model to describe the conceptual structure underlying\n specialized language. Building specialized frames is challenging because of the complex nature of predicate-argument structures,\n and because of the domain-specific uses of general-language predicates. Our semi-automatic method elicits semantic frames from\n specialized corpora. It aims to discover lexical patterns that reveal the structure of specialized frames and to populate them\n with corpus-based data. Firstly, we automatically extracted verb-noun triples from corpora using bootstrapping to identify\n noun-verb-noun phraseological patterns. Secondly, we annotated each noun-verb-noun triple with the lexical domain of the verbs and\n the semantic class and role of the noun filling each argument slot. We then used these annotations and patterns to classify\n similar triples. Thus, the structure and the types of lexical units that belong to each specialized frames were inferred.\n Specialized corpora analysis of environmental science texts in English and in Spanish illustrate our methodology.","PeriodicalId":44429,"journal":{"name":"Terminology","volume":" ","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2019-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Eliciting specialized frames from corpora using argument-structure extraction techniques\",\"authors\":\"Beatriz Sánchez Cárdenas, Carlos Ramisch\",\"doi\":\"10.1075/TERM.00026.SAN\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Frame Semantics provides a powerful cross-lingual model to describe the conceptual structure underlying\\n specialized language. Building specialized frames is challenging because of the complex nature of predicate-argument structures,\\n and because of the domain-specific uses of general-language predicates. Our semi-automatic method elicits semantic frames from\\n specialized corpora. It aims to discover lexical patterns that reveal the structure of specialized frames and to populate them\\n with corpus-based data. Firstly, we automatically extracted verb-noun triples from corpora using bootstrapping to identify\\n noun-verb-noun phraseological patterns. Secondly, we annotated each noun-verb-noun triple with the lexical domain of the verbs and\\n the semantic class and role of the noun filling each argument slot. We then used these annotations and patterns to classify\\n similar triples. Thus, the structure and the types of lexical units that belong to each specialized frames were inferred.\\n Specialized corpora analysis of environmental science texts in English and in Spanish illustrate our methodology.\",\"PeriodicalId\":44429,\"journal\":{\"name\":\"Terminology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2019-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Terminology\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1075/TERM.00026.SAN\",\"RegionNum\":4,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Terminology","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/TERM.00026.SAN","RegionNum":4,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
Eliciting specialized frames from corpora using argument-structure extraction techniques
Frame Semantics provides a powerful cross-lingual model to describe the conceptual structure underlying
specialized language. Building specialized frames is challenging because of the complex nature of predicate-argument structures,
and because of the domain-specific uses of general-language predicates. Our semi-automatic method elicits semantic frames from
specialized corpora. It aims to discover lexical patterns that reveal the structure of specialized frames and to populate them
with corpus-based data. Firstly, we automatically extracted verb-noun triples from corpora using bootstrapping to identify
noun-verb-noun phraseological patterns. Secondly, we annotated each noun-verb-noun triple with the lexical domain of the verbs and
the semantic class and role of the noun filling each argument slot. We then used these annotations and patterns to classify
similar triples. Thus, the structure and the types of lexical units that belong to each specialized frames were inferred.
Specialized corpora analysis of environmental science texts in English and in Spanish illustrate our methodology.
期刊介绍:
Terminology is an independent journal with a cross-cultural and cross-disciplinary scope. It focusses on the discussion of (systematic) solutions not only of language problems encountered in translation, but also, for example, of (monolingual) problems of ambiguity, reference and developments in multidisciplinary communication. Particular attention will be given to new and developing subject areas such as knowledge representation and transfer, information technology tools, expert systems and terminological databases. Terminology encompasses terminology both in general (theory and practice) and in specialized fields (LSP), such as physics.