{"title":"Insights into Old English lexicography: lemmatisation of gĀn and its prefix-formations using a corpus-based database","authors":"Laura García Fernández","doi":"10.1093/ijl/ecab014","DOIUrl":null,"url":null,"abstract":"\n This article contributes to Old English lexicography by providing a list of lemmas and inflectional forms for the Old English derived verbs (prefixed verbs) of gān ‘to go’. Entries for these lemmas, if listed by Old English dictionaries, are often incomplete, but, more importantly, they are not based on a lemmatised corpus. This is particularly problematic in the case of languages like Old English that are rife with morphological variation. The methodology followed in this study comprises searches on a lexical database and manual revision of the hits. The searches are launched on the lemmatiser Norna, and the hits are checked with available lexicographical sources, secondary sources and annotated corpora. Finally, ambiguous cases are examined in their context. The final list of lemmas and inflectional forms amounts to 104 inflectional forms which are attributed to 14 different lemmas, including one lemma and up to 61 inflectional forms never before listed by dictionaries. Special attention is paid to the contrast between what is attested in the Old English corpora and what is available from the sources. In addition to providing insights into the inventory of lemmas and inflectional forms for the derivatives (prefixed verbs) of the verb gān as attested in The Dictionary of Old English Corpus, which was not available until now from lexicographical sources, this article contributes recommendations for the linguistic analysis of Old English using corpus-based lexical databases.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2021-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Lexicography","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1093/ijl/ecab014","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 0
Abstract
This article contributes to Old English lexicography by providing a list of lemmas and inflectional forms for the Old English derived verbs (prefixed verbs) of gān ‘to go’. Entries for these lemmas, if listed by Old English dictionaries, are often incomplete, but, more importantly, they are not based on a lemmatised corpus. This is particularly problematic in the case of languages like Old English that are rife with morphological variation. The methodology followed in this study comprises searches on a lexical database and manual revision of the hits. The searches are launched on the lemmatiser Norna, and the hits are checked with available lexicographical sources, secondary sources and annotated corpora. Finally, ambiguous cases are examined in their context. The final list of lemmas and inflectional forms amounts to 104 inflectional forms which are attributed to 14 different lemmas, including one lemma and up to 61 inflectional forms never before listed by dictionaries. Special attention is paid to the contrast between what is attested in the Old English corpora and what is available from the sources. In addition to providing insights into the inventory of lemmas and inflectional forms for the derivatives (prefixed verbs) of the verb gān as attested in The Dictionary of Old English Corpus, which was not available until now from lexicographical sources, this article contributes recommendations for the linguistic analysis of Old English using corpus-based lexical databases.
本文提供了gān“to go”的古英语派生动词(前缀动词)的引理和屈折形式列表,为古英语词典编纂做出了贡献。这些引理的词条,如果被古英语词典列出,通常是不完整的,但更重要的是,它们不是基于引理语料库。这在像古英语这样充斥着形态变异的语言中尤其有问题。本研究采用的方法包括在词汇数据库中搜索和手动修改点击量。搜索在lemmatiser Norna上启动,并通过可用的词典来源、辅助来源和注释语料库来检查点击量。最后,在它们的上下文中检查模糊的案例。引理和屈折形式的最终列表共有104种屈折形式,归属于14种不同的引理,包括一个引理和多达61种以前从未被词典列出的屈折形式。人们特别注意古英语语料库中所证实的内容与来源中所能获得的内容之间的对比。除了深入了解《古英语语料库词典》(the Dictionary of Old English Corpus)中所证实的动词gān的派生词(前缀动词)的引理和屈折形式,本文还为使用基于语料库的词汇数据库对古英语进行语言学分析提供了建议。
期刊介绍:
The International Journal of Lexicography was launched in 1988. Interdisciplinary as well as international, it is concerned with all aspects of lexicography, including issues of design, compilation and use, and with dictionaries of all languages, though the chief focus is on dictionaries of the major European languages - monolingual and bilingual, synchronic and diachronic, pedagogical and encyclopedic. The Journal recognizes the vital role of lexicographical theory and research, and of developments in related fields such as computational linguistics, and welcomes contributions in these areas.