Թվայնացված գրականությունից մինչեվ լեզվական շտեմարաններ. դրանց նշանակությունը, հեռանկարներն ու մարտահրավերները

Gayane Hovhannisyan, Srbuhi Aydinyan
{"title":"Թվայնացված գրականությունից մինչեվ լեզվական շտեմարաններ. դրանց նշանակությունը, հեռանկարներն ու մարտահրավերները","authors":"Gayane Hovhannisyan, Srbuhi Aydinyan","doi":"10.52027/18294685-hga2023.sp","DOIUrl":null,"url":null,"abstract":"The Republic of Armenia has traditionally had a solid culture of preserving written heritage. However, in the last few decades, due to various geopolitical circumstances, we have lagged behind the global advancement of digitization, information, and communication technologies in terms of natural language processing for the Armenian language, access to the expanding opportunities of systematising and utilising the collated enormous amount of literature. This paper presents the current global state of natural language processing and the related methodological problems of the field in our country. The process of dataset development and consumption, from the primary stage of text digitising collected in physical databases, libraries, and archives, to the very use of the knowledge and information stored in these texts for various scientific, educational and practical purposes, requires a clear scientific-methodological program introducing not only the importance and perspectives of the field but will also ensure the conditions for its sustainable development, including the processing of written (and spoken) digitized language data and the retrieval and “industrialization” of the information and knowledge contained in the language corpora.","PeriodicalId":189164,"journal":{"name":"Bulletin of Armenian Libraries","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bulletin of Armenian Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52027/18294685-hga2023.sp","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The Republic of Armenia has traditionally had a solid culture of preserving written heritage. However, in the last few decades, due to various geopolitical circumstances, we have lagged behind the global advancement of digitization, information, and communication technologies in terms of natural language processing for the Armenian language, access to the expanding opportunities of systematising and utilising the collated enormous amount of literature. This paper presents the current global state of natural language processing and the related methodological problems of the field in our country. The process of dataset development and consumption, from the primary stage of text digitising collected in physical databases, libraries, and archives, to the very use of the knowledge and information stored in these texts for various scientific, educational and practical purposes, requires a clear scientific-methodological program introducing not only the importance and perspectives of the field but will also ensure the conditions for its sustainable development, including the processing of written (and spoken) digitized language data and the retrieval and “industrialization” of the information and knowledge contained in the language corpora.
Թվայնացվածգրականությունիցմինչեվլեզվականշտեմարաններ。դրանցնշանակությունը,հեռանկարներնումարտահրավերները
亚美尼亚共和国传统上具有保存书面遗产的坚实文化。然而,在过去几十年,由于各种地缘政治环境,我们在亚美尼亚语的自然语言处理方面落后于全球数位化、资讯和通讯技术的进步,也无法获得系统化和利用整理后的大量文献的机会。本文介绍了自然语言处理在国际上的发展现状和我国在该领域存在的相关方法论问题。从物理数据库、图书馆和档案馆中收集的文本数字化的初级阶段,到将这些文本中存储的知识和信息用于各种科学、教育和实践目的,数据集的开发和消费过程需要一个明确的科学方法计划,不仅要介绍该领域的重要性和观点,还要确保其可持续发展的条件。包括对书面(和口头)数字化语言数据的处理,以及语料库中包含的信息和知识的检索和“产业化”。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信