Silvia Stopponi, N. Pedrazzini, Saskia Peels-Matthey, Barbara McGillivray, Malvina Nissim
{"title":"Natural Language Processing for Ancient Greek","authors":"Silvia Stopponi, N. Pedrazzini, Saskia Peels-Matthey, Barbara McGillivray, Malvina Nissim","doi":"10.1075/dia.23013.sto","DOIUrl":null,"url":null,"abstract":"\n Computational methods have produced meaningful and usable results to study word semantics, including semantic\n change. These methods, belonging to the field of Natural Language Processing, have recently been applied to ancient languages; in\n particular, language modelling has been applied to Ancient Greek, the language on which we focus. In this contribution we explain\n how vector representations can be computed from word co-occurrences in a corpus and can be used to locate words in a semantic space,\n and what kind of semantic information can be extracted from language models. We compare three different kinds of language models\n that can be used to study Ancient Greek semantics: a count-based model, a word embedding model and a syntactic embedding model;\n and we show examples of how the quality of their representations can be assessed. We highlight the advantages and potential of\n these methods, especially for the study of semantic change, together with their limitations.","PeriodicalId":44637,"journal":{"name":"Diachronica","volume":null,"pages":null},"PeriodicalIF":0.6000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diachronica","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/dia.23013.sto","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
Computational methods have produced meaningful and usable results to study word semantics, including semantic
change. These methods, belonging to the field of Natural Language Processing, have recently been applied to ancient languages; in
particular, language modelling has been applied to Ancient Greek, the language on which we focus. In this contribution we explain
how vector representations can be computed from word co-occurrences in a corpus and can be used to locate words in a semantic space,
and what kind of semantic information can be extracted from language models. We compare three different kinds of language models
that can be used to study Ancient Greek semantics: a count-based model, a word embedding model and a syntactic embedding model;
and we show examples of how the quality of their representations can be assessed. We highlight the advantages and potential of
these methods, especially for the study of semantic change, together with their limitations.
期刊介绍:
Diachronica provides a forum for the presentation and discussion of information concerning all aspects of language change in any and all languages of the globe. Contributions which combine theoretical interest and philological acumen are especially welcome. Diachronica appears three times per year, publishing articles, review articles, book reviews, and a miscellanea section including notes, reports and discussions.