{"title":"Towards robust complexity indices in linguistic typology","authors":"Y. Oh, F. Pellegrino","doi":"10.1075/sl.22034.oh","DOIUrl":"https://doi.org/10.1075/sl.22034.oh","url":null,"abstract":"\u0000There is high hope that corpus-based approaches to language complexity will contribute to explaining linguistic diversity. Several complexity indices have consequently been proposed to compare different aspects among languages, especially in phonology and morphology. However, their robustness against changes in corpus size and content hasn’t been systematically assessed, thus impeding comparability between studies. Here, we systematically test the robustness of four complexity indices estimated from raw texts and either routinely utilized in crosslinguistic studies (Type-Token Ratio and word-level Entropy) or more recently proposed (Word Information Density and Lexical Diversity). Our results on 47 languages strongly suggest that traditional indices are more prone to fluctuation than the newer ones. Additionally, we confirm with Word Information Density the existence of a cross-linguistic trade-off between word-internal and across-word distributions of information. Finally, we implement a proof of concept suggesting that modern deep-learning language models can improve the comparability across languages with non-parallel datasets.","PeriodicalId":46377,"journal":{"name":"Studies in Language","volume":"17 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81842582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"From grammaticalization to Diachronic Construction Grammar","authors":"Spike Gildea, Jóhanna Barðdal","doi":"10.1075/sl.20079.gil","DOIUrl":"https://doi.org/10.1075/sl.20079.gil","url":null,"abstract":"\u0000The term grammaticalization originally denoted a particular outcome of language change (lexis > morphology), then got expanded to practically all studies involving language change, the processes that create such changes, and a theory modeling these. These expansions have been challenged in the literature as conceptually flawed. A usage-based analysis of the evolution of the concept culminates in the use of the term grammaticalization as a “flag” of a particular approach to linguistics. However, the theoretical premises of grammaticalization studies are entirely compatible with the premises of Diachronic Construction Grammar (DCxG). All studies within the “expanded” concept of grammaticalization can be explicitly modeled within DCxG, which provides formalism of sufficient detail to map the gradual nature of language change in cases of grammaticalization and beyond. Consequently, the most vigorous attacks on grammaticalization lose power when grammaticalization is seen as part of a larger, more complete theory of language and language change.","PeriodicalId":46377,"journal":{"name":"Studies in Language","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90285484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The history of the polyfunctional 𗗙 jij1 in Tangut","authors":"Shuya Zhang","doi":"10.1075/sl.21085.zha","DOIUrl":"https://doi.org/10.1075/sl.21085.zha","url":null,"abstract":"\u0000This paper focuses on the history of a polyfunctional case marker 𗗙 jij1\u0000 in Tangut, an extinct Rgyalrongic language (Sino-Tibetan). This versatile case morpheme is a typological rarity of maximum syncretism among several abstract case functions, including differential object marking, the genitive, and the oblique (which overlaps with the dative). For one thing, accusatives originating from datives or spatial sources are rarely found with additional genitive functions; for another, reported instances of accusative/genitive syncretism seldom include other functions. The principal hypothesis of this paper is that the Tangut 𗗙 jij1\u0000 may be the result of multiple grammaticalization processes stemming from a proto-locative source. These processes can be subsumed under two pathways, one leading from an allative to an accusative, with an intermediate oblique stage, and the other from a locative to a genitive. Although both of these development pathways are frequently attested, the Tangut 𗗙 jij1\u0000 remains a typological rarity due to their superposition.","PeriodicalId":46377,"journal":{"name":"Studies in Language","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77548550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiple construction types for nominal expressions in Australian languages","authors":"Dana Louagie","doi":"10.1075/sl.21008.lou","DOIUrl":"https://doi.org/10.1075/sl.21008.lou","url":null,"abstract":"\u0000 This paper explores the rich diversity in structural possibilities that are available for (simple) nominal\u0000 expressions in Australian languages. First, I identify a number of construction types found across a 50 language sample, which may\u0000 be recognised by using a restricted set of parameters. I show that an important factor is whether a given parameter (such as word\u0000 order) is generalised or displayed only by some word classes. Second, I develop a four-way typology based on how the construction\u0000 types cluster in individual languages. Two types are described to some extent in the literature, but I provide a more detailed\u0000 characterisation: one mostly has flexible, non-phrasal expressions and the other mostly rigid noun phrases. The other two types\u0000 have not been described as such. One mostly has flexible phrases, which combine internal flexibility with phrasal case marking,\u0000 and the other has distinct construction types depending on the type of modifier.","PeriodicalId":46377,"journal":{"name":"Studies in Language","volume":"26 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78564660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The linguistics of odour in Semaq Beri and Semelai, two Austroasiatic languages of the Malay Peninsula","authors":"N. Kruspe, A. Majid","doi":"10.1075/sl.22004.kru","DOIUrl":"https://doi.org/10.1075/sl.22004.kru","url":null,"abstract":"\u0000There is a long history presuming smell is not expressible in language, but numerous studies in recent years challenge this presupposition. Large smell lexica have been reported around the world thereby showing high lexical codability in this domain. Psycholinguistic studies likewise find smell can be described with relatively high agreement, demonstrating high efficient codability. Often the two go hand-in-hand: languages with high lexical codability also display high efficient codability. This study compares two Austroasiatic (Aslian) languages – Semaq Beri and Semelai – previously shown to diverge in their efficient codability for smell: Semaq Beri showed relatively high efficient codability, whereas Semelai did not. Despite this, we demonstrate that both languages have high lexical codability, i.e., large lexica of basic smell terms. This seems to be a feature of the Aslian language family, suggesting a long-standing preoccupation with odours. More generally, the dissociation between lexical and efficient codability suggests a more nuanced approach towards linguistic expressibility is necessary.","PeriodicalId":46377,"journal":{"name":"Studies in Language","volume":"28 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85076889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A hitherto unnoticed type of verb-framed construction in Lithuanian and the typology of event\u0000 conflation","authors":"Axel Holvoet","doi":"10.1075/sl.21082.hol","DOIUrl":"https://doi.org/10.1075/sl.21082.hol","url":null,"abstract":"The article starts out from a Lithuanian construction denoting achievement of an excessive value of some parameter\u0000 of an incremental event. It is verb-framed, that is, the main-clause verb denotes motion along a path towards a normative value of\u0000 the parameter involved. Its implications for our understanding of the Talmyan typology of event conflation are discussed. Event\u0000 conflation in the domains of spatial motion and change of state has received most attention in the literature; differences in\u0000 saliency and elaboration, and the importance of constraints on the lexicalization of path (result) and manner, have been pointed\u0000 out. The Lithuanian verb-framed constructions discussed here are, however, similar to those in the domain of ‘temporal contouring’\u0000 in that they involve an incremental path superimposed on the path-to-result lexicalized in the embedded predicate. This extension\u0000 beyond the core domains of spatial motion and change of state represents another dimension of the Talmyan typology, largely\u0000 overlooked until now.","PeriodicalId":46377,"journal":{"name":"Studies in Language","volume":"3 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81352284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}