{"title":"An algorithm for learning phonological classes from distributional similarity","authors":"Connor Mayer","doi":"10.1017/S0952675720000056","DOIUrl":null,"url":null,"abstract":"An important question in phonology is to what degree the learner uses distributional information rather than substantive properties of speech sounds when learning phonological structure. This paper presents an algorithm that learns phonological classes from only distributional information: the contexts in which sounds occur. The input is a segmental corpus, and the output is a set of phonological classes. The algorithm is first tested on an artificial language, with both overlapping and nested classes reflected in the distribution, and retrieves the expected classes, performing well as distributional noise is added. It is then tested on four natural languages. It distinguishes between consonants and vowels in all cases, and finds more detailed, language-specific structure. These results improve on past approaches, and are encouraging, given the paucity of the input. More refined models may provide additional insight into which phonological classes are apparent from the distributions of sounds in natural languages.","PeriodicalId":46804,"journal":{"name":"Phonology","volume":"37 1","pages":"91 - 131"},"PeriodicalIF":0.7000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1017/S0952675720000056","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phonology","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1017/S0952675720000056","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 8
Abstract
An important question in phonology is to what degree the learner uses distributional information rather than substantive properties of speech sounds when learning phonological structure. This paper presents an algorithm that learns phonological classes from only distributional information: the contexts in which sounds occur. The input is a segmental corpus, and the output is a set of phonological classes. The algorithm is first tested on an artificial language, with both overlapping and nested classes reflected in the distribution, and retrieves the expected classes, performing well as distributional noise is added. It is then tested on four natural languages. It distinguishes between consonants and vowels in all cases, and finds more detailed, language-specific structure. These results improve on past approaches, and are encouraging, given the paucity of the input. More refined models may provide additional insight into which phonological classes are apparent from the distributions of sounds in natural languages.
期刊介绍:
Phonology, published three times a year, is the only journal devoted exclusively to the discipline, and provides a unique forum for the productive interchange of ideas among phonologists and those working in related disciplines. Preference is given to papers which make a substantial theoretical contribution, irrespective of the particular theoretical framework employed, but the submission of papers presenting new empirical data of general theoretical interest is also encouraged. The journal carries research articles, as well as book reviews and shorter pieces on topics of current controversy within phonology.