Phylogeny of the Turkic Languages Inferred from Basic Vocabulary: Limitations of the Lexicostatistical Methods in an Intensive Contact Situation

IF 2.1 0 LANGUAGE & LINGUISTICS
Ilya M Egorov, Anna V Dybo, Alexei S Kassian
{"title":"Phylogeny of the Turkic Languages Inferred from Basic Vocabulary: Limitations of the Lexicostatistical Methods in an Intensive Contact Situation","authors":"Ilya M Egorov, Anna V Dybo, Alexei S Kassian","doi":"10.1093/jole/lzac006","DOIUrl":null,"url":null,"abstract":"This article provides an attempt to revise the phylogenetic structure of the Turkic family using a computational lexicostatistical approach. The methodological framework of the present research is characterized by the following features: (1) wordlists with strictly controlled semantics; (2) step-by-step reconstruction using Swadesh wordlists for proto-languages; (3) three stages of post-processing of the input data (analysis of root cognacy, elimination of derivational drift, and optimization of homoplasy); (4) application of several computational algorithms (Starling neighbor-joining, Bayesian MCMC, and maximum parsimony). The analysis provided confirms the status of Chuvash as the first outlier and suggests a subsequent multifurcation of Proto-Nuclear-Turkic into eight branches. The Siberian Turkic group is a purely areal unity, that is, Yakut-Dolgan, Tofa-Tuvinian, Khakas-Mrassu, Sarygh Yugur and Altai do not form a clade. Altai is grouped together with the Kipchak languages as a separate taxon; it does not show a particularly close relationship with Kirghiz, which belongs to another Kipchak subgroup. Karluk is a low-level taxon inside the Kipchak clade.","PeriodicalId":37118,"journal":{"name":"Journal of Language Evolution","volume":"20 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Language Evolution","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jole/lzac006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0

Abstract

This article provides an attempt to revise the phylogenetic structure of the Turkic family using a computational lexicostatistical approach. The methodological framework of the present research is characterized by the following features: (1) wordlists with strictly controlled semantics; (2) step-by-step reconstruction using Swadesh wordlists for proto-languages; (3) three stages of post-processing of the input data (analysis of root cognacy, elimination of derivational drift, and optimization of homoplasy); (4) application of several computational algorithms (Starling neighbor-joining, Bayesian MCMC, and maximum parsimony). The analysis provided confirms the status of Chuvash as the first outlier and suggests a subsequent multifurcation of Proto-Nuclear-Turkic into eight branches. The Siberian Turkic group is a purely areal unity, that is, Yakut-Dolgan, Tofa-Tuvinian, Khakas-Mrassu, Sarygh Yugur and Altai do not form a clade. Altai is grouped together with the Kipchak languages as a separate taxon; it does not show a particularly close relationship with Kirghiz, which belongs to another Kipchak subgroup. Karluk is a low-level taxon inside the Kipchak clade.
从基本词汇推断突厥语言的系统发育:密集接触情况下词汇统计方法的局限性
这篇文章提供了一个尝试修改突厥家族的系统发育结构使用计算词典统计方法。本研究的方法论框架具有以下特点:(1)严格控制语义的词表;(2)利用Swadesh词表对原语言进行分步重建;(3)输入数据的三个后处理阶段(词根同源性分析、导数漂移消除和同质性优化);(4)几种计算算法(Starling neighbor-joining, Bayesian MCMC, maximum parsimony)的应用。分析证实了Chuvash作为第一个异常的地位,并提出了原始核突厥语系随后的多分支,分为八个分支。西伯利亚突厥群是一个纯粹的地区统一,也就是说,雅库特-多尔干,托法-图维尼亚,Khakas-Mrassu, Sarygh Yugur和阿尔泰不形成一个分支。阿尔泰语与奇普恰克语归为一个单独的分类群;它并没有显示出与吉尔吉斯语的特别密切的关系,吉尔吉斯语属于另一个奇普察克亚群。Karluk是Kipchak分支中的一个低级分类单元。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Language Evolution
Journal of Language Evolution Social Sciences-Linguistics and Language
CiteScore
4.50
自引率
7.70%
发文量
8
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信