{"title":"Using Rank-Frequency and Type-Token Statistics to Compare Morphological Typology in the Celtic Languages","authors":"Andrew Wilson, Rosie Harvey","doi":"10.1080/09296174.2018.1560122","DOIUrl":null,"url":null,"abstract":"ABSTRACT Previous work has used Greenberg’s synthetism index to compare three of the Celtic languages – Irish, Welsh, and Breton – but not the other three languages, namely Scottish Gaelic, Manx, and Cornish. This paper extends this earlier work by comparing all six Celtic languages, including two periods of Irish (Early Modern and Present Day). The analysis is based on a random sample of 210 parallel psalm texts (30 for each language). However, Greenberg’s synthetism index is problematic because there are no operational standards for counting morphemes within words. We therefore apply a newer typological indicator (B7), which is based solely on lexical rank-frequency statistics. We also explore whether type-token counts alone can provide similar information. The B7 indicator shows that both varieties of Irish, together with Welsh and Cornish, tend more towards synthetism, whereas Manx tends more towards analytism. Breton and Scottish Gaelic do not show a clear tendency in either direction. Rankings using type-token statistics vary considerably and do not tell the same story.","PeriodicalId":45514,"journal":{"name":"Journal of Quantitative Linguistics","volume":null,"pages":null},"PeriodicalIF":0.7000,"publicationDate":"2020-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/09296174.2018.1560122","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Quantitative Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1080/09296174.2018.1560122","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 2
Abstract
ABSTRACT Previous work has used Greenberg’s synthetism index to compare three of the Celtic languages – Irish, Welsh, and Breton – but not the other three languages, namely Scottish Gaelic, Manx, and Cornish. This paper extends this earlier work by comparing all six Celtic languages, including two periods of Irish (Early Modern and Present Day). The analysis is based on a random sample of 210 parallel psalm texts (30 for each language). However, Greenberg’s synthetism index is problematic because there are no operational standards for counting morphemes within words. We therefore apply a newer typological indicator (B7), which is based solely on lexical rank-frequency statistics. We also explore whether type-token counts alone can provide similar information. The B7 indicator shows that both varieties of Irish, together with Welsh and Cornish, tend more towards synthetism, whereas Manx tends more towards analytism. Breton and Scottish Gaelic do not show a clear tendency in either direction. Rankings using type-token statistics vary considerably and do not tell the same story.
期刊介绍:
The Journal of Quantitative Linguistics is an international forum for the publication and discussion of research on the quantitative characteristics of language and text in an exact mathematical form. This approach, which is of growing interest, opens up important and exciting theoretical perspectives, as well as solutions for a wide range of practical problems such as machine learning or statistical parsing, by introducing into linguistics the methods and models of advanced scientific disciplines such as the natural sciences, economics, and psychology.