Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology最新文献_第2页

Orthographic vs. Semantic Representations for Unsupervised Morphological Paradigm Clustering 无监督形态范式聚类的正字法与语义表示

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.10

E. M. Perkoff, Josh Daniels, Alexis Palmer

{"title":"Orthographic vs. Semantic Representations for Unsupervised Morphological Paradigm Clustering","authors":"E. M. Perkoff, Josh Daniels, Alexis Palmer","doi":"10.18653/v1/2021.sigmorphon-1.10","DOIUrl":"https://doi.org/10.18653/v1/2021.sigmorphon-1.10","url":null,"abstract":"This paper presents two different systems for unsupervised clustering of morphological paradigms, in the context of the SIGMORPHON 2021 Shared Task 2. The goal of this task is to correctly cluster words in a given language by their inflectional paradigm, without any previous knowledge of the language and without supervision from labeled data of any sort. The words in a single morphological paradigm are different inflectional variants of an underlying lemma, meaning that the words share a common core meaning. They also - usually - show a high degree of orthographical similarity. Following these intuitions, we investigate KMeans clustering using two different types of word representations: one focusing on orthographical similarity and the other focusing on semantic similarity.Additionally, we discuss the merits of randomly initialized centroids versus pre-defined centroids for clustering. Pre-defined centroids are identified based on either a standard longest common substring algorithm or a connected graph method built off of longest common substring. For all development languages, the character-based embeddings perform similarly to the baseline, and the semantic embeddings perform well below the baseline.Analysis of the systems’ errors suggests that clustering based on orthographic representations is suitable for a wide range of morphological mechanisms, particularly as part of a larger system.","PeriodicalId":187165,"journal":{"name":"Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121633005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

What transfers in morphological inflection? Experiments with analogical models 形态变化中有什么变化?用类比模型进行实验

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.18

M. Elsner

引用次数: 2

Results of the Second SIGMORPHON Shared Task on Multilingual Grapheme-to-Phoneme Conversion 第二SIGMORPHON共享任务在多语言字素-音素转换中的结果

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.13

Lucas F. E. Ashby, Travis M. Bartley, S. Clematide, L. Del Signore, Cameron Gibson, K. Gorman, Yeonju Lee-Sikka, Peter Makarov, Aidan Malanoski, Sean Miller, Omar Ortiz, R. Raff, A. Sengupta, Bora Seo, Y. Spektor, Winnie Yan

引用次数: 10

Recognizing Reduplicated Forms: Finite-State Buffered Machines 识别重复形式:有限状态缓冲机

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.20

Yang Wang

引用次数: 3

Avengers, Ensemble! Benefits of ensembling in grapheme-to-phoneme prediction 复仇者集合!集成在字素到音素预测中的好处

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.16

Vagrant Gautam, Wang Yau Li, Zafarullah Mahmood, Frederic Mailhot, Shreekantha Nadig, Riqiang Wang, Nathan Zhang

引用次数: 2

Towards Detection and Remediation of Phonemic Confusion 浅谈音位混淆的检测与纠正

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.1

F. Roewer-Després, A. Yeung, Ilan Kogan

引用次数: 0

Incorporating tone in the calculation of phonotactic probability

Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.sigmorphon-1.4

James P. Kirby

引用次数: 1