{"title":"Utilising heterogeneous language resources for term extraction in maritime domains","authors":"Gisle Andersen","doi":"10.1075/term.20024.and","DOIUrl":null,"url":null,"abstract":"\n The development of terminologies for domains where these are lacking is a time-consuming and costly task. This\n article takes a methodological perspective and addresses a general methodological question: how can we, with limited funding,\n utilise to a maximal degree, existing language resources to create a terminology at a relatively low cost? Although an important\n player in the maritime industries for many centuries, Norway has not prioritised the systematic development of an official\n maritime terminology. The article therefore focuses specifically on efforts to develop a national resource for maritime domains.\n The article describes efforts to create a corpus of popular science and a parallel corpus of technical texts. Six different term\n extraction methods are applied. These include corpus-based statistical analyses of frequency, collocation and keyness, as well as\n bilingual term extraction. Finally, the pros and cons of each method are evaluated by means of a cost-benefit analysis.","PeriodicalId":44429,"journal":{"name":"Terminology","volume":" ","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2021-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Terminology","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/term.20024.and","RegionNum":4,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 2
Abstract
The development of terminologies for domains where these are lacking is a time-consuming and costly task. This
article takes a methodological perspective and addresses a general methodological question: how can we, with limited funding,
utilise to a maximal degree, existing language resources to create a terminology at a relatively low cost? Although an important
player in the maritime industries for many centuries, Norway has not prioritised the systematic development of an official
maritime terminology. The article therefore focuses specifically on efforts to develop a national resource for maritime domains.
The article describes efforts to create a corpus of popular science and a parallel corpus of technical texts. Six different term
extraction methods are applied. These include corpus-based statistical analyses of frequency, collocation and keyness, as well as
bilingual term extraction. Finally, the pros and cons of each method are evaluated by means of a cost-benefit analysis.
期刊介绍:
Terminology is an independent journal with a cross-cultural and cross-disciplinary scope. It focusses on the discussion of (systematic) solutions not only of language problems encountered in translation, but also, for example, of (monolingual) problems of ambiguity, reference and developments in multidisciplinary communication. Particular attention will be given to new and developing subject areas such as knowledge representation and transfer, information technology tools, expert systems and terminological databases. Terminology encompasses terminology both in general (theory and practice) and in specialized fields (LSP), such as physics.