Richness of the Base and Probabilistic Unsupervised Learning in Optimality Theory

Special Interest Group on Computational Morphology and Phonology Workshop Pub Date : 2006-06-08 DOI:10.3115/1622165.1622172

G. Jarosz

引用次数: 24

Abstract

This paper proposes an unsupervised learning algorithm for Optimality Theoretic grammars, which learns a complete constraint ranking and a lexicon given only unstructured surface forms and morphological relations. The learning algorithm, which is based on the Expectation-Maximization algorithm, gradually maximizes the likelihood of the observed forms by adjusting the parameters of a probabilistic constraint grammar and a probabilistic lexicon. The paper presents the algorithm's results on three constructed language systems with different types of hidden structure: voicing neutralization, stress, and abstract vowels. In all cases the algorithm learns the correct constraint ranking and lexicon. The paper argues that the algorithm's ability to identify correct, restrictive grammars is due in part to its explicit reliance on the Optimality Theoretic notion of Richness of the Base.

查看原文本刊更多论文

最优性理论中基础的丰富性与概率无监督学习

本文提出了一种最优性理论语法的无监督学习算法，该算法只学习给定非结构化表面形式和形态关系的完全约束排序和词典。该学习算法基于期望最大化算法，通过调整概率约束语法和概率词汇的参数，逐步使观察到的形式的似然最大化。本文给出了该算法在三种不同隐藏结构类型的语言系统上的结果:语音中和、重音和抽象元音。在所有情况下，算法都学习到正确的约束排序和词典。本文认为，该算法识别正确的限制性语法的能力部分是由于它明确依赖于基础丰富度的最优性理论概念。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Special Interest Group on Computational Morphology and Phonology Workshop

自引率

0.00%

发文量