{"title":"use of verb valency patterns in the Indonesian monolingual learner’s dictionary","authors":"Dora Amalía","doi":"10.1558/lexi.24995","DOIUrl":"https://doi.org/10.1558/lexi.24995","url":null,"abstract":"The verb is one of the most perplexing features for learners of BIPA (Bahasa Indonesia bagi Penutur Asing “Indonesian for foreign speakers”). Indonesian verbs are particularly rich in affixes that correspond to their numerous senses. The learners find it challenging to use verbs with the appropriate affixes in a sentence structure. To function effectively as a learning tool, dictionaries must provide morphological and grammatical information. Frame semantics theory is used in this research to determine a verb’s meaning based on its semantic context and frame. The verb is described by identifying the grammatical constructions in which it participates, and by characterizing all of the obligatory and optional types of companions. By doing this, we can obtain the verb valency pattern to add to a dictionary’s morphological and grammatical information. This study aims to create entry models of affixed verbs with the valency pattern. The six transitive verbs for discussion comprise a variety of affixes with various senses: mempersembahkan “to present/dedicate”; membersihkan “to clean”; mencintai “to love”; memperlancar “to expedite”; memperbaiki “to fix”; and memberlakukan “to apply”. These entries serve as models for the BIPA learner’s dictionary.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"48 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81701567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Digitalizing a local language dictionary","authors":"Winda Luthfita, Selly Rizki Yanita","doi":"10.1558/lexi.25076","DOIUrl":"https://doi.org/10.1558/lexi.25076","url":null,"abstract":"The National Agency for Language Development and Cultivation (henceforth, Badan Bahasa) has published many dictionaries as a government agency under the Ministry of Education and Culture of Indonesia. More than 100 dictionaries have been published since 1977. Some dictionaries have been revised by adding new entries and senses. In alignment with technological developments, Badan Bahasa has started an integration project that aims to provide an online application for their language products. In 2015, it started Program Pengayaan Kosakata (Word proposal application program), and this was followed by the launch of the online version of the Kamus Besar Bahasa Indonesia (KBBI; Comprehensive dictionary of Indonesian), Tesaurus Tematis Bahasa Indonesia (Thematic thesaurus of Indonesian), and Ensiklopedia Sastra Indonesia (Encyclopedia of Indonesian literature) in 2016. In 2020, Badan Bahasa started the development of Aplikasi Pangkalan Data Kamus, also called Aplikasi Kompilasi Kamus (AKK; Dictionary compilation application). This online application accommodates at least three kinds of dictionary – a local language dictionary, specialized dictionary, and bilingual dictionary – published by Badan Bahasa. The process has continued by developing a digitalization project targeting the digitalization of print versions of specialized dictionaries, Indonesian-local language dictionaries, and local language-Indonesian dictionaries. This article aims to discuss some challenges regarding the digitalization of dictionaries arising from the print versions, the dictionary structure, and the dictionary interface, and puts forward some solutions to deal with the issues. The research method uses qualitative methods for observing dictionary files and examining microstructure issues throughout the whole process. The results of this study are expected to support the digitalization process and dictionary development in Indonesia.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"7 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87690298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"use of lexical bundles in an online comprehensive dictionary of Indonesian (KBBI Daring)","authors":"Adi Budiwiyanto","doi":"10.1558/lexi.25177","DOIUrl":"https://doi.org/10.1558/lexi.25177","url":null,"abstract":"Research on lexical bundles in the last few decades has focused mostly on written registers, especially academic writing. In this study, I investigate the use of lexical bundles in a different genre – dictionaries. As a lexical bundle is formulaic language specific to a particular register, I hypothesize that particular lexical bundles are used in dictionaries. The research focus of this study is the extent to which lexical bundles are used in the online version of the Kamus Besar Bahasa Indonesia (KBBI Daring; Comprehensive dictionary of Indonesian online), especially in the lemma, definition, and example sections. In addition, the study examines the characteristics of lexical bundles in dictionaries. The approach used is corpus based. As reference bundles, a total of 517 lexical bundles were extracted from the IndonesianWeb Corpus (available from SketchEngine). The bundles were then analyzed for their use in KBBI Daring in terms of their frequency, structure, and function. The results showed that the use of lexical bundles in KBBI Daring was mostly found in the definition section. The bundles found were generally in the form of phrases rather than clauses. In terms of structure, the lexical bundles were dominated by incomplete structures. The bundles, either in the definition or example sections, were mostly in the form of yang-clause fragments, such as yang digunakan untuk “that is used for,” yang terdiri atas “that consists of,” yang terbuat dari “that is made of/from,” yang berasal dari “that comes from,” and yang berhubungan dengan “that relates to.” In terms of function, presenting content is the dominant function in the KBBI, while organizing text is the least prevalent function. In a nutshell, each section in the dictionary, especially in the KBBI, has its own character.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"35 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75706202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Avoiding Recursion in the Representation of Subsenses and Subentries in Dictionaries","authors":"M. Mechura","doi":"10.1093/ijl/ecad012","DOIUrl":"https://doi.org/10.1093/ijl/ecad012","url":null,"abstract":"\u0000 Recursion, and recursion-like design patterns, are used in the entry schemas of dictionaries to model subsenses and subentries. Recursion occurs when elements of a given type, such as sense, are allowed to contain elements of the same or similar type, such as sense or subsense. This article argues that recursion unnecessarily increases the computational complexity of entries, making dictionaries less easily processable by machines. The article will show how entry schemas can be simplified by re-engineering subsenses and subentries as relations (like in a relational database) such that we only have flat lists of senses and entries, while the is-subsense-of and is-subentry-of relations are encoded using pairs of unique identifiers. This design pattern losslessly records the same information as recursion (including – importantly – the listing order of items inside an entry) but decreases the complexity of the entry structure and makes dictionary entries more easily machine-processable.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43354780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sebastian Przybyszewski, Iwona Kosek, Monika Czerepowicka
{"title":"Not Only Meaning… Verbel: The Electronic Dictionary of Paradigms of Polish Verbal Multiword Expressions","authors":"Sebastian Przybyszewski, Iwona Kosek, Monika Czerepowicka","doi":"10.1093/ijl/ecad008","DOIUrl":"https://doi.org/10.1093/ijl/ecad008","url":null,"abstract":"\u0000 This article presents Verbel: The Electronic Dictionary of Paradigms of Polish Verbal Multiword Expressions (MWEs) and discusses theoretical problems connected with compiling such a dictionary for inflectionally complex languages such as Polish. The dictionary includes over 5,000 Polish verbal MWEs and explicitly presents their forms and constraints in inflection. It also provides grammatical, semantic, pragmatic and prescriptive commentaries. The first part of the article covers the theoretical and methodological basis used in the compilation of the dictionary. Generally, a verbal MWE is inflected according to the paradigm of the verb which is its main component. However, MWEs may have some specific inflectional constraints connected with other factors (e.g. semantic, pragmatic), which result in different paradigms for verbal MWEs and for the verbs that are their main components. In the second part, the conception and content of the dictionary are discussed. Finally, the natural language processing tools that underlie the work on the dictionary are described.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49103709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Wheat or Chaff? A Compound Selection Model Based on Look-Up Data","authors":"Mikkel Ekeland Paulsen","doi":"10.1093/ijl/ecad013","DOIUrl":"https://doi.org/10.1093/ijl/ecad013","url":null,"abstract":"Abstract Which compounds should be included in general-purpose dictionaries is often an open question that is answered with a case-by-case consideration of all compounds above a certain corpus frequency threshold. Another way to determine which compounds should be listed, is to examine which compounds, or rather which compound properties, are in demand by the users. This study uses look-up data from the two officially sanctioned, general-purpose dictionaries of Norwegian (Bokmålsordboka and Nynorskordboka) to derive an explicit compound selection model that performs with comparable sensitivity and specificity as the traditional procedure. These findings demonstrate that it is indeed possible to arrive at a fully operational and explicit compound selection model that meets the needs of users. With such a tool at their disposal, lexicographers would be able to separate the wheat from the chaff in the boundless field that is the compound lexicon of North Germanic Languages.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135693008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hausa Dictionary for Everyday Use: Hausa - English/ English - Hausa. Ƙamusun Hausa na yau da kullum: Hausa - Inglilishi/ Ingilishi - Hausa. Paul Newman and Roxana Ma Newman","authors":"C. Schmaling","doi":"10.1093/ijl/ecad010","DOIUrl":"https://doi.org/10.1093/ijl/ecad010","url":null,"abstract":"","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45152399","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Definition, Presentation and Automatic Generation of Contextual Data in Lexicography","authors":"M. J. Domínguez, R. Gouws","doi":"10.1093/ijl/ecac020","DOIUrl":"https://doi.org/10.1093/ijl/ecac020","url":null,"abstract":"\u0000 This paper deals with several aspects of context in lexicography. Section 1 briefly mentions some different approaches to the concept context in various fields. Section 2 puts the focus on different uses and perceptions of the concept context in lexicography, contrasting it with related concepts, such as cotext, contextualization and contextual information. A more comprehensive discussion also covers different aspects of the occurrence of the concept context in dictionary research, with specific reference to central aspects of the so-called inner and outer context. Various portals, dictionaries and dictionary entries will illustrate the above-mentioned approaches. Section 3 approaches the subject from a user perspective. Section 4 addresses the question How can contextual data be extracted or generated? To answer this question, some methods and tools for (automatic) acquisition and analysis of contextual data, – in particular of the local contextual data in terms of Faber and León-Araúz (2016) – are introduced. Examples of these are lexical databases or semantic networks, like WordNet, and corpora, like Sketch Engine, or predictive methods, like Word2vec and similar ones. Some advantages and disadvantages of specific data acquisition tools used for the analysis of local contextual data are indicated. This section also contributes to a more detailed discussion of the automatic generation of the so-called local syntactic-semantic context or word environment, specifically of the building of syntactic-semantic argument patterns and their examples.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41887432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Compilation of the English–Slovene Dictionary of Criminal Justice and Security Abbreviations","authors":"Mojca Kompara Lukančič","doi":"10.1093/ijl/ecad009","DOIUrl":"https://doi.org/10.1093/ijl/ecad009","url":null,"abstract":"\u0000 The article highlights the compilation of specialised dictionaries, focusing mainly on the characteristics of dictionary entries of abbreviations in terminological dictionaries for Slovene. Special attention is given to the compilation and characteristics of the dictionary entries of abbreviations in the Slovene terminological dictionaries published by the Fran Ramovš Institute of the Slovenian language. The paper presents an overview of the characteristics of the terminological dictionaries for Slovene and provides examples of good practice that were used in the compilation of the English–Slovene Dictionary of Criminal Justice and Security Abbreviations. The paper presents the microstructure of the dictionary, and selected examples of Slovene and English–Slovene dictionary entries, as well as the overall compilation process, the selection of terms and the inclusion of related subfields. The English–Slovene Dictionary of Criminal Justice and Security Abbreviations is a growing dictionary compiled in the freely available dictionary mask Termania, and freely accessible on the portal Slovarji.si. The dictionary is the first Slovene terminological dictionary of abbreviations for the field of criminal justice and security.","PeriodicalId":45657,"journal":{"name":"International Journal of Lexicography","volume":" ","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46870637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}