Workshop on Computational Approaches to Historical Language Change最新文献

筛选
英文 中文
A New Framework for Fast Automated Phonological Reconstruction Using Trimmed Alignments and Sound Correspondence Patterns 一个使用裁剪对齐和声音对应模式的快速自动语音重建的新框架
Workshop on Computational Approaches to Historical Language Change Pub Date : 2022-04-10 DOI: 10.48550/arXiv.2204.04619
Johann-Mattis List, Robert Forkel, N. Hill
{"title":"A New Framework for Fast Automated Phonological Reconstruction Using Trimmed Alignments and Sound Correspondence Patterns","authors":"Johann-Mattis List, Robert Forkel, N. Hill","doi":"10.48550/arXiv.2204.04619","DOIUrl":"https://doi.org/10.48550/arXiv.2204.04619","url":null,"abstract":"Computational approaches in historical linguistics have been increasingly applied during the past decade and many new methods that implement parts of the traditional comparative method have been proposed. Despite these increased efforts, there are not many easy-to-use and fast approaches for the task of phonological reconstruction. Here we present a new framework that combines state-of-the-art techniques for automated sequence comparison with novel techniques for phonetic alignment analysis and sound correspondence pattern detection to allow for the supervised reconstruction of word forms in ancestral languages. We test the method on a new dataset covering six groups from three different language families. The results show that our method yields promising results while at the same time being not only fast but also easy to apply and expand.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121850870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Time-Aware Ancient Chinese Text Translation and Inference 有时间意识的古代汉语文本翻译与推理
Workshop on Computational Approaches to Historical Language Change Pub Date : 2021-07-07 DOI: 10.18653/v1/2021.lchange-1.1
Ernie Chang, Yow-Ting Shiue, Hui-Syuan Yeh, Vera Demberg
{"title":"Time-Aware Ancient Chinese Text Translation and Inference","authors":"Ernie Chang, Yow-Ting Shiue, Hui-Syuan Yeh, Vera Demberg","doi":"10.18653/v1/2021.lchange-1.1","DOIUrl":"https://doi.org/10.18653/v1/2021.lchange-1.1","url":null,"abstract":"In this paper, we aim to address the challenges surrounding the translation of ancient Chinese text: (1) The linguistic gap due to the difference in eras results in translations that are poor in quality, and (2) most translations are missing the contextual information that is often very crucial to understanding the text. To this end, we improve upon past translation techniques by proposing the following: We reframe the task as a multi-label prediction task where the model predicts both the translation and its particular era. We observe that this helps to bridge the linguistic gap as chronological context is also used as auxiliary information. We validate our framework on a parallel corpus annotated with chronology information and show experimentally its efficacy in producing quality translation outputs. We release both the code and the data for future research.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123457468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Three-part diachronic semantic change dataset for Russian 俄语的三部分历时语义变化数据集
Workshop on Computational Approaches to Historical Language Change Pub Date : 2021-06-15 DOI: 10.18653/v1/2021.lchange-1.2
Andrey Kutuzov, Lidia Pivovarova
{"title":"Three-part diachronic semantic change dataset for Russian","authors":"Andrey Kutuzov, Lidia Pivovarova","doi":"10.18653/v1/2021.lchange-1.2","DOIUrl":"https://doi.org/10.18653/v1/2021.lchange-1.2","url":null,"abstract":"We present a manually annotated lexical semantic change dataset for Russian: RuShiftEval. Its novelty is ensured by a single set of target words annotated for their diachronic semantic shifts across three time periods, while the previous work either used only two time periods, or different sets of target words. The paper describes the composition and annotation procedure for the dataset. In addition, it is shown how the ternary nature of RuShiftEval allows to trace specific diachronic trajectories: ‘changed at a particular time period and stable afterwards’ or ‘was changing throughout all time periods’. Based on the analysis of the submissions to the recent shared task on semantic change detection for Russian, we argue that correctly identifying such trajectories can be an interesting sub-task itself.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129564747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A diachronic evaluation of gender asymmetry in euphemism 委婉语中性别不对称的历时性评价
Workshop on Computational Approaches to Historical Language Change Pub Date : 2021-06-03 DOI: 10.18653/v1/2021.lchange-1.5
A. Kapron-King, Yang Xu
{"title":"A diachronic evaluation of gender asymmetry in euphemism","authors":"A. Kapron-King, Yang Xu","doi":"10.18653/v1/2021.lchange-1.5","DOIUrl":"https://doi.org/10.18653/v1/2021.lchange-1.5","url":null,"abstract":"The use of euphemisms is a known driver of language change. It has been proposed that women use euphemisms more than men. Although there have been several studies investigating gender differences in language, the claim about euphemism usage has not been tested comprehensively through time. If women do use euphemisms more, this could mean that women also lead the formation of new euphemisms and language change over time. Using four large diachronic text corpora of English, we evaluate the claim that women use euphemisms more than men through a quantitative analysis. We assembled a list of 106 euphemism-taboo pairs to analyze their relative use through time by each gender in the corpora. Contrary to the existing belief, our results show that women do not use euphemisms with a higher proportion than men. We repeated the analysis using different subsets of the euphemism-taboo pairs list and found that our result was robust. Our study indicates that in a broad range of settings involving both speech and writing, and with varying degrees of formality, women do not use or form euphemisms more than men.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"199 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134116082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Bhāṣācitra: Visualising the dialect geography of South Asia Bhāṣācitra:南亚方言地理可视化
Workshop on Computational Approaches to Historical Language Change Pub Date : 2021-05-28 DOI: 10.18653/v1/2021.lchange-1.7
Aryaman Arora, Adam Farris, R. Gopalakrishnan, Samopriya Basu
{"title":"Bhāṣācitra: Visualising the dialect geography of South Asia","authors":"Aryaman Arora, Adam Farris, R. Gopalakrishnan, Samopriya Basu","doi":"10.18653/v1/2021.lchange-1.7","DOIUrl":"https://doi.org/10.18653/v1/2021.lchange-1.7","url":null,"abstract":"We present Bhāṣācitra, a dialect mapping system for South Asia built on a database of linguistic studies of languages of the region annotated for topic and location data. We analyse language coverage and look towards applications to typology by visualising example datasets. The application is not only meant to be useful for feature mapping, but also serves as a new kind of interactive bibliography for linguists of South Asian languages.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126512332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Lexicon of Changes: Towards the Evaluation of Diachronic Semantic Shift in Chinese 变化的词汇:汉语历时性语义转移的评价
Workshop on Computational Approaches to Historical Language Change Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.lchange-1.11
Jing Chen, Emmanuele Chersoni, Chu-Ren Huang
{"title":"Lexicon of Changes: Towards the Evaluation of Diachronic Semantic Shift in Chinese","authors":"Jing Chen, Emmanuele Chersoni, Chu-Ren Huang","doi":"10.18653/v1/2022.lchange-1.11","DOIUrl":"https://doi.org/10.18653/v1/2022.lchange-1.11","url":null,"abstract":"Recent research has brought a wind of using computational approaches to the classic topic of semantic change, aiming to tackle one of the most challenging issues in the evolution of human language. While several methods for detecting semantic change have been proposed, such studies are limited to a few languages, where evaluation datasets are available. This paper presents the first dataset for evaluating Chinese semantic change in contexts preceding and following the Reform and Opening-up, covering a 50-year period in Modern Chinese. Following the DURel framework, we collected 6,000 human judgments for the dataset. We also reported the performance of alignment-based word embedding models on this evaluation dataset, achieving high and significant correlation scores.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"359 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122057141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
The GLAUx corpus: methodological issues in designing a long-term, diverse, multi-layered corpus of Ancient Greek GLAUx语料库:设计一个长期、多样、多层次的古希腊语语料库的方法论问题
Workshop on Computational Approaches to Historical Language Change Pub Date : 1900-01-01 DOI: 10.18653/v1/2021.lchange-1.6
Alek Keersmaekers
{"title":"The GLAUx corpus: methodological issues in designing a long-term, diverse, multi-layered corpus of Ancient Greek","authors":"Alek Keersmaekers","doi":"10.18653/v1/2021.lchange-1.6","DOIUrl":"https://doi.org/10.18653/v1/2021.lchange-1.6","url":null,"abstract":"This paper describes the GLAUx project (“the Greek Language Automated”), an ongoing effort to develop a large long-term diachronic corpus of Greek, covering sixteen centuries of literary and non-literary material annotated with NLP methods. After providing an overview of related corpus projects and discussing the general architecture of the corpus, it zooms in on a number of larger methodological issues in the design of historical corpora. These include the encoding of textual variants, handling extralinguistic variation and annotating linguistic ambiguity. Finally, the long- and short-term perspectives of this project are discussed.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130343531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
black[LSCDiscovery shared task] CoToHiLi at LSCDiscovery: the Role of Linguistic Features in Predicting Semantic Change [j] CoToHiLi在LSCDiscovery:语言特征在预测语义变化中的作用
Workshop on Computational Approaches to Historical Language Change Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.lchange-1.20
Ana Sabina Uban, Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Laurentiu Zoicas
{"title":"black[LSCDiscovery shared task] \u0000 CoToHiLi at LSCDiscovery: the Role of Linguistic Features in Predicting Semantic Change","authors":"Ana Sabina Uban, Alina Maria Cristea, Anca Dinu, Liviu P. Dinu, Simona Georgescu, Laurentiu Zoicas","doi":"10.18653/v1/2022.lchange-1.20","DOIUrl":"https://doi.org/10.18653/v1/2022.lchange-1.20","url":null,"abstract":"This paper presents the contributions of the CoToHiLi team for the LSCDiscovery shared task on semantic change in the Spanish language. We participated in both tasks (graded discovery and binary change, including sense gain and sense loss) and proposed models based on word embedding distances combined with hand-crafted linguistic features, including polysemy, number of neological synonyms, and relation to cognates in English. We find that models that include linguistically informed features combined using weights assigned manually by experts lead to promising results.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130665701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
black[LSCDiscovery shared task] HSE at LSCDiscovery in Spanish: Clustering and Profiling for Lexical Semantic Change Discovery 黑[LSCDiscovery共享任务]HSE在西班牙语LSCDiscovery:词法语义变化发现的聚类和分析
Workshop on Computational Approaches to Historical Language Change Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.lchange-1.21
Kseniia Kashleva, Alexander Shein, Elizaveta Tukhtina, Svetlana Vydrina
{"title":"black[LSCDiscovery shared task] \u0000 HSE at LSCDiscovery in Spanish: Clustering and Profiling for Lexical Semantic Change Discovery","authors":"Kseniia Kashleva, Alexander Shein, Elizaveta Tukhtina, Svetlana Vydrina","doi":"10.18653/v1/2022.lchange-1.21","DOIUrl":"https://doi.org/10.18653/v1/2022.lchange-1.21","url":null,"abstract":"This paper describes the methods used for lexical semantic change discovery in Spanish. We tried the method based on BERT embeddings with clustering, the method based on grammatical profiles and the grammatical profiles method enhanced with permutation tests. BERT embeddings with clustering turned out to show the best results for both graded and binary semantic change detection outperforming the baseline. Our best submission for graded discovery was the 3rd best result, while for binary detection it was the 2nd place (precision) and the 7th place (both F1-score and recall). Our highest precision for binary detection was 0.75 and it was achieved due to improving grammatical profiling with permutation tests.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121976090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
What is Done is Done: an Incremental Approach to Semantic Shift Detection 做了什么就做了:语义移位检测的增量方法
Workshop on Computational Approaches to Historical Language Change Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.lchange-1.4
Francesco Periti, A. Ferrara, S. Montanelli, M. Ruskov
{"title":"What is Done is Done: an Incremental Approach to Semantic Shift Detection","authors":"Francesco Periti, A. Ferrara, S. Montanelli, M. Ruskov","doi":"10.18653/v1/2022.lchange-1.4","DOIUrl":"https://doi.org/10.18653/v1/2022.lchange-1.4","url":null,"abstract":"Contextual word embedding techniques for semantic shift detection are receiving more and more attention. In this paper, we present What is Done is Done (WiDiD), an incremental approach to semantic shift detection based on incremental clustering techniques and contextual embedding methods to capture the changes over the meanings of a target word along a diachronic corpus. In WiDiD, the word contexts observed in the past are consolidated as a set of clusters that constitute the “memory” of the word meanings observed so far. Such a memory is exploited as a basis for subsequent word observations, so that the meanings observed in the present are stratified over the past ones.","PeriodicalId":120650,"journal":{"name":"Workshop on Computational Approaches to Historical Language Change","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126082077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信