Proceedings of the 19th international conference on Computational linguistics -最新文献

An Agent-based Approach to Chinese Named Entity Recognition 基于agent的中文命名实体识别方法

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072308

Shiren Ye, Tat-Seng Chua, Jimin Liu

引用次数: 24

Morphological Analysis of the Spontaneous Speech Corpus 自发语音语料库的形态分析

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1071884.1071903

Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada, S. Sekine, H. Isahara

引用次数: 15

A Robust Cross-Style Bilingual Sentences Alignment Model 鲁棒跨风格双语句子对齐模型

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072237

T. Kueng, Keh-Yih Su

{"title":"A Robust Cross-Style Bilingual Sentences Alignment Model","authors":"T. Kueng, Keh-Yih Su","doi":"10.3115/1072228.1072237","DOIUrl":"https://doi.org/10.3115/1072228.1072237","url":null,"abstract":"Most current sentence alignment approaches adopt sentence length and cognate as the alignment features; and they are mostly trained and tested in the documents with the same style. Since the length distribution, alignment-type distribution (used by length-based approaches) and cognate frequency vary significantly across texts with different styles, the length-based approaches fail to achieve similar performance when tested in corpora of different styles. The experiments show that the performance in F-measure could drop from 98.2% to 85.6% when a length-based approach is trained by a technical manual and then tested on a general magazine.Since a large percentage of content words in the source text would be translated into the corresponding translation duals to preserve the meaning in the target text, transfer lexicons are usually regarded as more reliable cues for aligning sentences when the alignment task is performed by human. To enhance the robustness, a robust statistical model based on both transfer lexicons and sentence lengths are proposed in this paper. After integrating the transfer lexicons into the model, a 60% F-measure error reduction (from 14.4% to 5.8%) is observed.","PeriodicalId":437823,"journal":{"name":"Proceedings of the 19th international conference on Computational linguistics -","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132139760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Meta-evaluation of Summaries in a Cross-lingual Environment using Content-based Metrics 使用基于内容的度量在跨语言环境中对摘要进行元评价

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072301

Horacio Saggion, Dragomir R. Radev, Simone Teufel, Wai Lam

引用次数: 64

Structure Alignment Using Bilingual Chunking 使用双语分块法进行结构对齐

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072238

Wei Wang, M. Zhou, Jin-Xia Huang, C. Huang

引用次数: 22

Looking for Candidate Translational Equivalents in Specialized, Comparable Corpora 在专业可比语料库中寻找候选翻译对等物

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1071884.1071904

Yun-Chuang Chiao, Pierre Zweigenbaum

引用次数: 146

Natural Language and Inference in a Computer Game 计算机游戏中的自然语言和推理

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072341

Malte Gabsdil, Alexander Koller, Kristina Striegnitz

引用次数: 12

Learning Verb Argument Structure from Minimally Annotated Corpora 从最小标注语料库中学习动词论点结构

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072268

Anoop Sarkar, Woottiporn Tripasai

引用次数: 18

Syntactic Features for High Precision Word Sense Disambiguation 高精度词义消歧的句法特征

Proceedings of the 19th international conference on Computational linguistics - Pub Date : 2002-08-24 DOI: 10.3115/1072228.1072340

David Martínez, Eneko Agirre, Lluís Màrquez i Villodre

引用次数: 38