{"title":"Two Ways of Representing Specialist Knowledge: Analysing the Botanical Lexicon in Diccionario de la Lengua Española and Diccionario del Español de México","authors":"Jesús Camacho-Niño","doi":"10.1093/ijl/ecad014","DOIUrl":"https://doi.org/10.1093/ijl/ecad014","url":null,"abstract":"\u0000 This article explores the lexicographic codification of botanical knowledge in two general dictionaries: Diccionario de la lengua española (2014 [2021]), produced by the Real Academia Española in Spain, and Diccionario del español de México (2019), published by El Colegio de México. It begins with a historical overview of the inclusion of botanical terminology in general dictionaries by the Real Academia Española and by other authors. Then, it analyses the degree to which the botanical terms included in each dictionary meet the inclusion criteria established. Finally, it focuses on diatechnical labelling and the differences between the two dictionaries in terms of the disciplines to which terms are assigned. This allows us to draw conclusions regarding the representation of this field of knowledge in the two dictionaries and the lexicographic techniques used to produce them.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48898426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Categorizing obsolete, archaic, and classic words in an Indonesian dictionary","authors":"Dewi Puspita, K. Yusuf","doi":"10.1558/lexi.24757","DOIUrl":"https://doi.org/10.1558/lexi.24757","url":null,"abstract":"The global era has led to fairly rapid changes in language. Many words have become obsolete. There are also many words whose meanings have become irrelevant nowadays. Unfortunately, in Indonesian dictionaries, especially in the Kamus Besar Bahasa Indonesia (KBBI; Comprehensive dictionary of Indonesian), there is no label for obsolete words. There is only an “archaic” label to mark all outdated words and a “classic” label to mark classical words. Another labeling problem in the KBBI is that there are no clear guidelines or criteria to determine when a word is considered archaic, obsolete, or classic. The absence of clear criteria causes some entries that have been labeled “archaic” in the KBBI to seem obsolete, and sometimes words labeled as classic get confused with archaic words. The aim of this article is to investigate ways of categorizing archaic, obsolete, and classic words in the KBBI. This research was conducted by comparing several forms and entry criteria labeled “archaic,” “obsolete,” and “classic” in several dictionaries, in particular dictionaries of foreign languages whose lexicographic traditions are well established. Each dictionary has its own criteria for classifying a word as archaic, obsolete, or classic, and we can learn from them. The findings suggest that checking the corpus data set is the easiest way to categorize words according to their labels.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"16 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75430982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"use of verb valency patterns in the Indonesian monolingual learner’s dictionary","authors":"Dora Amalía","doi":"10.1558/lexi.24995","DOIUrl":"https://doi.org/10.1558/lexi.24995","url":null,"abstract":"The verb is one of the most perplexing features for learners of BIPA (Bahasa Indonesia bagi Penutur Asing “Indonesian for foreign speakers”). Indonesian verbs are particularly rich in affixes that correspond to their numerous senses. The learners find it challenging to use verbs with the appropriate affixes in a sentence structure. To function effectively as a learning tool, dictionaries must provide morphological and grammatical information. Frame semantics theory is used in this research to determine a verb’s meaning based on its semantic context and frame. The verb is described by identifying the grammatical constructions in which it participates, and by characterizing all of the obligatory and optional types of companions. By doing this, we can obtain the verb valency pattern to add to a dictionary’s morphological and grammatical information. This study aims to create entry models of affixed verbs with the valency pattern. The six transitive verbs for discussion comprise a variety of affixes with various senses: mempersembahkan “to present/dedicate”; membersihkan “to clean”; mencintai “to love”; memperlancar “to expedite”; memperbaiki “to fix”; and memberlakukan “to apply”. These entries serve as models for the BIPA learner’s dictionary.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"48 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81701567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Digitalizing a local language dictionary","authors":"Winda Luthfita, Selly Rizki Yanita","doi":"10.1558/lexi.25076","DOIUrl":"https://doi.org/10.1558/lexi.25076","url":null,"abstract":"The National Agency for Language Development and Cultivation (henceforth, Badan Bahasa) has published many dictionaries as a government agency under the Ministry of Education and Culture of Indonesia. More than 100 dictionaries have been published since 1977. Some dictionaries have been revised by adding new entries and senses. In alignment with technological developments, Badan Bahasa has started an integration project that aims to provide an online application for their language products. In 2015, it started Program Pengayaan Kosakata (Word proposal application program), and this was followed by the launch of the online version of the Kamus Besar Bahasa Indonesia (KBBI; Comprehensive dictionary of Indonesian), Tesaurus Tematis Bahasa Indonesia (Thematic thesaurus of Indonesian), and Ensiklopedia Sastra Indonesia (Encyclopedia of Indonesian literature) in 2016. In 2020, Badan Bahasa started the development of Aplikasi Pangkalan Data Kamus, also called Aplikasi Kompilasi Kamus (AKK; Dictionary compilation application). This online application accommodates at least three kinds of dictionary – a local language dictionary, specialized dictionary, and bilingual dictionary – published by Badan Bahasa. The process has continued by developing a digitalization project targeting the digitalization of print versions of specialized dictionaries, Indonesian-local language dictionaries, and local language-Indonesian dictionaries. This article aims to discuss some challenges regarding the digitalization of dictionaries arising from the print versions, the dictionary structure, and the dictionary interface, and puts forward some solutions to deal with the issues. The research method uses qualitative methods for observing dictionary files and examining microstructure issues throughout the whole process. The results of this study are expected to support the digitalization process and dictionary development in Indonesia.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"7 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87690298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"use of lexical bundles in an online comprehensive dictionary of Indonesian (KBBI Daring)","authors":"Adi Budiwiyanto","doi":"10.1558/lexi.25177","DOIUrl":"https://doi.org/10.1558/lexi.25177","url":null,"abstract":"Research on lexical bundles in the last few decades has focused mostly on written registers, especially academic writing. In this study, I investigate the use of lexical bundles in a different genre – dictionaries. As a lexical bundle is formulaic language specific to a particular register, I hypothesize that particular lexical bundles are used in dictionaries. The research focus of this study is the extent to which lexical bundles are used in the online version of the Kamus Besar Bahasa Indonesia (KBBI Daring; Comprehensive dictionary of Indonesian online), especially in the lemma, definition, and example sections. In addition, the study examines the characteristics of lexical bundles in dictionaries. The approach used is corpus based. As reference bundles, a total of 517 lexical bundles were extracted from the IndonesianWeb Corpus (available from SketchEngine). The bundles were then analyzed for their use in KBBI Daring in terms of their frequency, structure, and function. The results showed that the use of lexical bundles in KBBI Daring was mostly found in the definition section. The bundles found were generally in the form of phrases rather than clauses. In terms of structure, the lexical bundles were dominated by incomplete structures. The bundles, either in the definition or example sections, were mostly in the form of yang-clause fragments, such as yang digunakan untuk “that is used for,” yang terdiri atas “that consists of,” yang terbuat dari “that is made of/from,” yang berasal dari “that comes from,” and yang berhubungan dengan “that relates to.” In terms of function, presenting content is the dominant function in the KBBI, while organizing text is the least prevalent function. In a nutshell, each section in the dictionary, especially in the KBBI, has its own character.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"35 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75706202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Avoiding Recursion in the Representation of Subsenses and Subentries in Dictionaries","authors":"M. Mechura","doi":"10.1093/ijl/ecad012","DOIUrl":"https://doi.org/10.1093/ijl/ecad012","url":null,"abstract":"\u0000 Recursion, and recursion-like design patterns, are used in the entry schemas of dictionaries to model subsenses and subentries. Recursion occurs when elements of a given type, such as sense, are allowed to contain elements of the same or similar type, such as sense or subsense. This article argues that recursion unnecessarily increases the computational complexity of entries, making dictionaries less easily processable by machines. The article will show how entry schemas can be simplified by re-engineering subsenses and subentries as relations (like in a relational database) such that we only have flat lists of senses and entries, while the is-subsense-of and is-subentry-of relations are encoded using pairs of unique identifiers. This design pattern losslessly records the same information as recursion (including – importantly – the listing order of items inside an entry) but decreases the complexity of the entry structure and makes dictionary entries more easily machine-processable.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43354780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sebastian Przybyszewski, Iwona Kosek, Monika Czerepowicka
{"title":"Not Only Meaning… Verbel: The Electronic Dictionary of Paradigms of Polish Verbal Multiword Expressions","authors":"Sebastian Przybyszewski, Iwona Kosek, Monika Czerepowicka","doi":"10.1093/ijl/ecad008","DOIUrl":"https://doi.org/10.1093/ijl/ecad008","url":null,"abstract":"\u0000 This article presents Verbel: The Electronic Dictionary of Paradigms of Polish Verbal Multiword Expressions (MWEs) and discusses theoretical problems connected with compiling such a dictionary for inflectionally complex languages such as Polish. The dictionary includes over 5,000 Polish verbal MWEs and explicitly presents their forms and constraints in inflection. It also provides grammatical, semantic, pragmatic and prescriptive commentaries. The first part of the article covers the theoretical and methodological basis used in the compilation of the dictionary. Generally, a verbal MWE is inflected according to the paradigm of the verb which is its main component. However, MWEs may have some specific inflectional constraints connected with other factors (e.g. semantic, pragmatic), which result in different paradigms for verbal MWEs and for the verbs that are their main components. In the second part, the conception and content of the dictionary are discussed. Finally, the natural language processing tools that underlie the work on the dictionary are described.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49103709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wheat or Chaff? A Compound Selection Model Based on Look-Up Data","authors":"Mikkel Ekeland Paulsen","doi":"10.1093/ijl/ecad013","DOIUrl":"https://doi.org/10.1093/ijl/ecad013","url":null,"abstract":"Abstract Which compounds should be included in general-purpose dictionaries is often an open question that is answered with a case-by-case consideration of all compounds above a certain corpus frequency threshold. Another way to determine which compounds should be listed, is to examine which compounds, or rather which compound properties, are in demand by the users. This study uses look-up data from the two officially sanctioned, general-purpose dictionaries of Norwegian (Bokmålsordboka and Nynorskordboka) to derive an explicit compound selection model that performs with comparable sensitivity and specificity as the traditional procedure. These findings demonstrate that it is indeed possible to arrive at a fully operational and explicit compound selection model that meets the needs of users. With such a tool at their disposal, lexicographers would be able to separate the wheat from the chaff in the boundless field that is the compound lexicon of North Germanic Languages.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135693008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hausa Dictionary for Everyday Use: Hausa - English/ English - Hausa. Ƙamusun Hausa na yau da kullum: Hausa - Inglilishi/ Ingilishi - Hausa. Paul Newman and Roxana Ma Newman","authors":"C. Schmaling","doi":"10.1093/ijl/ecad010","DOIUrl":"https://doi.org/10.1093/ijl/ecad010","url":null,"abstract":"","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45152399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Definition, Presentation and Automatic Generation of Contextual Data in Lexicography","authors":"M. J. Domínguez, R. Gouws","doi":"10.1093/ijl/ecac020","DOIUrl":"https://doi.org/10.1093/ijl/ecac020","url":null,"abstract":"\u0000 This paper deals with several aspects of context in lexicography. Section 1 briefly mentions some different approaches to the concept context in various fields. Section 2 puts the focus on different uses and perceptions of the concept context in lexicography, contrasting it with related concepts, such as cotext, contextualization and contextual information. A more comprehensive discussion also covers different aspects of the occurrence of the concept context in dictionary research, with specific reference to central aspects of the so-called inner and outer context. Various portals, dictionaries and dictionary entries will illustrate the above-mentioned approaches. Section 3 approaches the subject from a user perspective. Section 4 addresses the question How can contextual data be extracted or generated? To answer this question, some methods and tools for (automatic) acquisition and analysis of contextual data, – in particular of the local contextual data in terms of Faber and León-Araúz (2016) – are introduced. Examples of these are lexical databases or semantic networks, like WordNet, and corpora, like Sketch Engine, or predictive methods, like Word2vec and similar ones. Some advantages and disadvantages of specific data acquisition tools used for the analysis of local contextual data are indicated. This section also contributes to a more detailed discussion of the automatic generation of the so-called local syntactic-semantic context or word environment, specifically of the building of syntactic-semantic argument patterns and their examples.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41887432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}