Comput. Linguistics最新文献_第2页

Inheritance and Complementation: a Case Study of Easy Adjectives and Related Nouns 继承与互补:以简单形容词和相关名词为例

Comput. Linguistics Pub Date : 1992-09-01 DOI: 10.22028/D291-24826

D. Flickinger, J. Nerbonne

{"title":"Inheritance and Complementation: a Case Study of Easy Adjectives and Related Nouns","authors":"D. Flickinger, J. Nerbonne","doi":"10.22028/D291-24826","DOIUrl":"https://doi.org/10.22028/D291-24826","url":null,"abstract":"Mechanisms for representing lexically the bulk of syntactic and semantic information for a language have been under active development, as is evident in the recent studies contained in this volume. Our study serves to highlight some of themost useful tools available for structured lexical representation, in particular (multiple) inheritance, default specification, and lexical rules. It then illustrates the value of these mechanisms in illuminating one corner of the lexicon involving an unusual kind of complementation among a group of adjectives exemplified by easy. The virtues of the structured lexicon are its succinctness and its tendency to highlight significant clusters of linguistic properties. From its succinctness follow two practical advantages, namely its ease of maintenance and modification. In order to suggest how important these may be practically, we extend the analysis of adjectival complementation in several directions. These further illustrate how the use of inheritance in lexical representation permits exact and explicit characterizations of phenomena in the language under study. We demonstrate how the use of the mechanisms employed in the analysis of easy enables us to give a unified account of related phenomena featuring nouns such as pleasure, and even the adverbs (adjectival specifiers) too and enough. Along the way we motivate some elaborations of the HPSG (head-driven phrase structure grammar) framework in which we couch our analysis, and offer several avenues for further study of this part of the English lexicon.","PeriodicalId":360119,"journal":{"name":"Comput. Linguistics","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127600487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 62

A Practical Approach to Multiple Default Inheritance for Unification-Based Lexicons 基于统一的词典多重默认继承的实用方法

Comput. Linguistics Pub Date : 1992-09-01 DOI: 10.1017/CBO9780511663642.008

G. Russell, Afzal Ballim, John A. Carroll, Susan Warwick-Armstrong

引用次数: 59

Incremental Processing and the Hierarchical Lexicon 增量处理和分层词典

Comput. Linguistics Pub Date : 1992-06-01 DOI: 10.5555/142235.142245

E. V. D. Linden

引用次数: 32

A Program for Aligning Sentences in Bilingual Corpora 双语语料库句子对齐程序

Comput. Linguistics Pub Date : 1991-06-18 DOI: 10.3115/981344.981367

W. Gale, Kenneth Ward Church

{"title":"A Program for Aligning Sentences in Bilingual Corpora","authors":"W. Gale, Kenneth Ward Church","doi":"10.3115/981344.981367","DOIUrl":"https://doi.org/10.3115/981344.981367","url":null,"abstract":"Researchers in both machine translation (e.g., Brown et al. 1990) and bilingual lexicography (e.g., Klavans and Tzoukermann 1990) have recently become interested in studying bilingual corpora, bodies of text such as the Canadian Hansards (parliamentary proceedings), which are available in multiple languages (such as French and English). One useful step is to align the sentences, that is, to identify correspondences between sentences in one language and sentences in the other language.This paper will describe a method and a program (align) for aligning sentences based on a simple statistical model of character lengths. The program uses the fact that longer sentences in one language tend to be translated into longer sentences in the other language, and that shorter sentences tend to be translated into shorter sentences. A probabilistic score is assigned to each proposed correspondence of sentences, based on the scaled difference of lengths of the two sentences (in characters) and the variance of this difference. This probabilistic score is used in a dynamic programming framework to find the maximum likelihood alignment of sentences.It is remarkable that such a simple approach works as well as it does. An evaluation was performed based on a trilingual corpus of economic reports issued by the Union Bank of Switzerland (UBS) in English, French, and German. The method correctly aligned all but 4% of the sentences. Moreover, it is possible to extract a large subcorpus that has a much smaller error rate. By selecting the best-scoring 80% of the alignments, the error rate is reduced from 4% to 0.7%. There were more errors on the English-French subcorpus than on the English-German subcorpus, showing that error rates will depend on the corpus considered; however, both were small enough to hope that the method will be useful for many language pairs.To further research on bilingual corpora, a much larger sample of Canadian Hansards (approximately 90 million words, half in English and and half in French) has been aligned with the align program and will be available through the Data Collection Initiative of the Association for Computational Linguistics (ACL/DCI). In addition, in order to facilitate replication of the align program, an appendix is provided with detailed c-code of the more difficult core of the align program.","PeriodicalId":360119,"journal":{"name":"Comput. Linguistics","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123602899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1340

A Statistical Approach to Machine Translation 机器翻译的统计方法

Comput. Linguistics Pub Date : 1990-06-01 DOI: 10.7551/mitpress/5779.003.0039

P. Brown, J. Cocke, S. D. Pietra, V. D. Pietra, F. Jelinek, J. Lafferty, R. Mercer, P. Roossin

引用次数: 1981

Tense, Quantifiers, and Contexts 时态、量词和语境

Comput. Linguistics Pub Date : 1988-06-01 DOI: 10.5555/55056.55057

E. Hinrichs

引用次数: 45

Commonsense Metaphysics and Lexical Semantics 常识形而上学与词汇语义学

Comput. Linguistics Pub Date : 1986-05-01 DOI: 10.3115/981131.981163

Jerry R. Hobbs, W. Bruce Croft, Todd R. Davies, Douglas Edwards, K. Laws

{"title":"Commonsense Metaphysics and Lexical Semantics","authors":"Jerry R. Hobbs, W. Bruce Croft, Todd R. Davies, Douglas Edwards, K. Laws","doi":"10.3115/981131.981163","DOIUrl":"https://doi.org/10.3115/981131.981163","url":null,"abstract":"In the TACITUS project for using commonsense knowledge in the understanding of texts about mechanical devices and their failures, we have been developing various commonsense theories that are needed to mediate between the way we talk about the behavior of such devices and causal models of their operation. Of central importance in this effort is the axiomatization of what might be called \"commonsense metaphysics\". This includes a number of areas that figure in virtually every domain of discourse, such as scalar notions, granularity, time, space, material, physical objects, causality, functionality, force, and shape. Our approach to lexical semantics is then to construct core theories of each of these areas, and then to define, or at least characterize, a large number of lexical items in terms provided by the core theories. In the TACITUS system, processes for solving pragmatics problems posed by a text will use the knowledge base consisting of these theories in conjunction with the logical forms of the sentences in the text to produce an interpretation. In this paper we do not stress these interpretation processes; this is another, important aspect of the TACITUS project, and it will be described in subsequent papers.","PeriodicalId":360119,"journal":{"name":"Comput. Linguistics","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1986-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128643311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 107

PHRED: A Generator for Natural Language Interfaces PHRED:自然语言接口的生成器

Comput. Linguistics Pub Date : 1985-10-01 DOI: 10.1007/978-1-4612-3846-1_9

P. Jacobs

引用次数: 90

Strong Generative Capacity, Weak Generative Capacity, and Modern Linguistic Theories 强生成能力、弱生成能力与现代语言学理论

Comput. Linguistics Pub Date : 1984-07-01 DOI: 10.5555/970170.970175

R. Berwick

{"title":"Strong Generative Capacity, Weak Generative Capacity, and Modern Linguistic Theories","authors":"R. Berwick","doi":"10.5555/970170.970175","DOIUrl":"https://doi.org/10.5555/970170.970175","url":null,"abstract":"What makes a language a natural language? A longstanding tradition in generative grammar holds that a language is natural just in case it is learnable under a constellation of auxiliary assumptions about input evidence available to children. Yet another approach seeks some key mathematical property that distinguishes the natural languages from all possible symbol-systems. With some exceptions for example, Chomsky's demonstration that a complete characterization of our grammatical knowledge lies beyond the power of finite state languages the mathematical approach has not provided clear-cut results. For example, for a variety of reasons we cannot say that the predicate is context-free characterizes all and only the natural languages. Still another use of mathematical analysis in linguistics has been to diagnose a proposed grammatical formalism as too powerful (allowing too many grammars or languages) rather than as too weak. Such a diagnosis was supposed by some to follow from Peters and Ritchie's demonstration that the theory of transformational grammar as described in Chomsky's Aspects of the Theory of Syntax could specify grammars to generate any recursively enumerable set. For some this demonstration marked a watershed in the formal analysis transformational grammar. One general reaction (not prompted by the Peters and Ritchie result alone) was to turn to other theories of grammar designed to explicitly avoid the problems of a theory that could specify an arbitrary Turing machine computation. The proposals for generalized phrase structure grammar (GPSG) and lexical-functional grammar (LFG) have explicitly emphasized this point. GPSG aims for grammars that generate context-free languages (though there is some recent wavering on this point; see Pullum 1984); LFG, for languages that are at worst context-sensitive. Whatever the merits of the arguments for this restriction in terms of weak generative capacity and they are far from obvious, as discussed at length in Berwick and Weinberg (1983) one point remains: the switch was prompted by criticism of the nearly two-decades old Aspects theory. Much has changed in transformational grammar in twenty years. Modern transformational grammars no longer contain swarms of individual rules such as Passive, Raising, or Dative. The modern government-binding (GB) theory does not reconstruct a \"deep structure\", does not contain powerful deletion rules, and has introduced a whole host of new constraints. Given these sweeping changes, it would seem appropriate, then, to re-examine the Peters and Ritchie result, and compare the power of the newer GB-style theories to these other current linguistic theories. That is the aim of this paper. The basic points to be made are these:","PeriodicalId":360119,"journal":{"name":"Comput. Linguistics","volume":"302 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1984-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124316034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 22

The Correction of Ill-Formed Input Using History-Based Expectation with Applications to Speech Understanding 基于历史期望的错误输入纠正及其在语音理解中的应用

Comput. Linguistics Pub Date : 1900-01-01 DOI: 10.21236/ada162222

Pamela E. Fink, A. Biermann

引用次数: 38