Finite-State Methods and Natural Language Processing最新文献

Transition-Based Coding and Formal Language Theory for Ordered Digraphs 有序有向图的转换编码与形式语言理论

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-23 DOI: 10.18653/v1/W19-3115

Anssi Yli-Jyrä

引用次数: 3

Regular transductions with MCFG input syntax 使用MCFG输入语法的常规转导

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-23 DOI: 10.18653/v1/W19-3109

M. Nederhof, H. Vogler

引用次数: 1

Finite State Transducer Calculus for Whole Word Morphology 全词形态学的有限状态换能器演算

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3107

Maciej Janicki

引用次数: 0

Using Meta-Morph Rules to develop Morphological Analysers: A case study concerning Tamil 使用元形态规则开发形态分析器:以泰米尔语为例

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3111

Kengatharaiyer Sarveswaran, G. Dias, Miriam Butt

{"title":"Using Meta-Morph Rules to develop Morphological Analysers: A case study concerning Tamil","authors":"Kengatharaiyer Sarveswaran, G. Dias, Miriam Butt","doi":"10.18653/v1/W19-3111","DOIUrl":"https://doi.org/10.18653/v1/W19-3111","url":null,"abstract":"This paper describes a new and larger coverage Finite-State Morphological Analyser (FSM) and Generator for the Dravidian language Tamil. The FSM has been developed in the context of computational grammar engineering, adhering to the standards of the ParGram effort. Tamil is a morphologically rich language and the interaction between linguistic analysis and formal implementation is complex, resulting in a challenging task. In order to allow the development of the FSM to focus more on the linguistic analysis and less on the formal details, we have developed a system of meta-morph(ology) rules along with a script which translates these rules into FSM processable representations. The introduction of meta-morph rules makes it possible for computationally naive linguists to interact with the system and to expand it in future work. We found that the meta-morph rules help to express linguistic generalisations and reduce the manual effort of writing lexical classes for morphological analysis. Our Tamil FSM currently handles mainly the inflectional morphology of 3,300 verb roots and their 260 forms. Further, it also has a lexicon of approximately 100,000 nouns along with a guesser to handle out-of-vocabulary items. Although the Tamil FSM was primarily developed to be part of a computational grammar, it can also be used as a web or stand-alone application for other NLP tasks, as per general ParGram practice.","PeriodicalId":286427,"journal":{"name":"Finite-State Methods and Natural Language Processing","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129481776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Latin script keyboards for South Asian languages with finite-state normalization 具有有限状态规范化的南亚语言的拉丁字母键盘

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3114

Lawrence Wolf-Sonkin, Vlad Schogol, Brian Roark, M. Riley

引用次数: 6

Distilling weighted finite automata from arbitrary probabilistic models 从任意概率模型中提取加权有限自动机

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3112

A. Suresh, Brian Roark, M. Riley, Vlad Schogol

引用次数: 6

Bottom-Up Unranked Tree-to-Graph Transducers for Translation into Semantic Graphs 自底向上的无排序树到图转换器翻译成语义图

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3104

Johanna Björklund, Shay B. Cohen, F. Drewes, G. Satta

引用次数: 1

A Syntactically Expressive Morphological Analyzer for Turkish 土耳其语句法表达形态分析器

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3110

Adnan Ozturel, Tolga Kayadelen, Isin Demirsahin

引用次数: 9

On the Compression of Lexicon Transducers 关于词典换能器的压缩

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3105

Marco Cognetta, Cyril Allauzen, M. Riley

引用次数: 0

Weighted parsing for grammar-based language models 基于语法的语言模型的加权解析

Finite-State Methods and Natural Language Processing Pub Date : 2019-09-01 DOI: 10.18653/v1/W19-3108

Richard Mörbitz, H. Vogler

引用次数: 3