Distribution of Terms Across Genres in the Annotated Lithuanian Cybersecurity Corpus

Q2 Arts and Humanities
Sigita Rackevičienė, A. Utka, Agnė Bielinskienė, A. Rokas
{"title":"Distribution of Terms Across Genres in the Annotated Lithuanian Cybersecurity Corpus","authors":"Sigita Rackevičienė, A. Utka, Agnė Bielinskienė, A. Rokas","doi":"10.15388/respectus.2022.41.46.105","DOIUrl":null,"url":null,"abstract":"The paper provides results of the frequential distribution analysis of cybersecurity terms used in the Lithuanian cybersecurity corpus composed of texts of different genres. The research focuses on the following aspects: overall distribution of cybersecurity terms (their density and diversity) across genres, distribution of English and English-Lithuanian terms and their usage patterns in Lithuanian sentences, and, finally, the most frequent cybersecurity terms and their thematic groups in each genre. The research was performed in several stages: compilation of a cybersecurity corpus and its subdivision into genre-specific subcorpora, manual annotation of cybersecurity terms, automatic lemmatisation of annotated terms and, finally, quantitative analysis of the distribution of the terms across the subcorpora. The results reveal the similarities and differences of the use of cybersecurity terminology across genres which are important to consider to get a complete picture of terminology usage trends in this domain.","PeriodicalId":36933,"journal":{"name":"Respectus Philologicus","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Respectus Philologicus","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15388/respectus.2022.41.46.105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Arts and Humanities","Score":null,"Total":0}
引用次数: 0

Abstract

The paper provides results of the frequential distribution analysis of cybersecurity terms used in the Lithuanian cybersecurity corpus composed of texts of different genres. The research focuses on the following aspects: overall distribution of cybersecurity terms (their density and diversity) across genres, distribution of English and English-Lithuanian terms and their usage patterns in Lithuanian sentences, and, finally, the most frequent cybersecurity terms and their thematic groups in each genre. The research was performed in several stages: compilation of a cybersecurity corpus and its subdivision into genre-specific subcorpora, manual annotation of cybersecurity terms, automatic lemmatisation of annotated terms and, finally, quantitative analysis of the distribution of the terms across the subcorpora. The results reveal the similarities and differences of the use of cybersecurity terminology across genres which are important to consider to get a complete picture of terminology usage trends in this domain.
在标注立陶宛网络安全语料库中的跨体裁术语分布
本文提供了由不同体裁文本组成的立陶宛网络安全语料库中使用的网络安全术语的频率分布分析结果。研究重点关注以下几个方面:网络安全术语在不同类型中的总体分布(密度和多样性),英语和英语-立陶宛语术语的分布及其在立陶宛语句子中的使用模式,以及每种类型中最常见的网络安全术语及其主题组。该研究分几个阶段进行:网络安全语料库的编写及其细分为特定体裁的子语料库,网络安全术语的手动注释,注释术语的自动词法化,最后对术语在子语料库中的分布进行定量分析。结果揭示了不同类型的网络安全术语使用的异同,这对于全面了解该领域的术语使用趋势非常重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Respectus Philologicus
Respectus Philologicus Arts and Humanities-Literature and Literary Theory
CiteScore
0.30
自引率
0.00%
发文量
31
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信