INFORMATION SYSTEM OF IDENTIFICATION OF TERMS AND ABBREVIATIONS IN TEXT DOCUMENTS

Vitalii Danylyk, V. Lytvyn, Solomiya Mushasta
{"title":"INFORMATION SYSTEM OF IDENTIFICATION OF TERMS AND ABBREVIATIONS IN TEXT DOCUMENTS","authors":"Vitalii Danylyk, V. Lytvyn, Solomiya Mushasta","doi":"10.31891/2307-5732-2023-319-1-81-83","DOIUrl":null,"url":null,"abstract":"The paper examines the process of building and functioning of the system for identifying terms and abbreviations in text documents. The task of developing such a system is urgent, since such an identification problem often arises in the military sphere. During the implementation of the system, it was taken into account that a single term or abbreviation may have several explanations in different regulatory documents. All available explanations are added to the term or abbreviation, which is taken into account during the operation of the system. A feature of the system is the use of natural language processing methods, since terms can be found in different cases. To implement the system, ready-made Python packages were used to cover similar tasks: Tkinter, PyMuPDF Examples of the system’s functioning are given. The developed system is used in practice. In the process of completing the work, the research of problems and the search for solutions for the tasks is carried out, an information system is developed for the processing of documents with the aim of integrating definitions of potentially unknown terms and abbreviations into them, in order to enable officers to use any literature without problems, because all terms and abbreviations will be known. To generalize the documentation, all the necessary requirements for the system are defined, and in order to correctly create the architecture and allocate the functional tasks of the system under development, a system analysis is performed and a conceptual model is built. Using all the specified information, all the necessary diagrams are built using the UML notation. Diagrams depict the relationships between objects and the overall architecture of the system. The architecture of the system is built in such a way that the component systems and the system as a whole can be easily expanded. At the end of the development, testing and implementation of the project is carried out. The process of operation of the components of the system on the part of the end user and the process of deployment by the end user of the information system are described. The object of the study is the presence of slowing factors in the process of command and control carried out by commanders of tactical units, which can slow down decision-making and also affect their correctness. The subject of the study is to solve the problems of the appearance of slowing factors in the process of command and control carried out by commanders of tactical units, by means of work with military data.","PeriodicalId":386560,"journal":{"name":"Herald of Khmelnytskyi National University. Technical sciences","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Herald of Khmelnytskyi National University. Technical sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31891/2307-5732-2023-319-1-81-83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The paper examines the process of building and functioning of the system for identifying terms and abbreviations in text documents. The task of developing such a system is urgent, since such an identification problem often arises in the military sphere. During the implementation of the system, it was taken into account that a single term or abbreviation may have several explanations in different regulatory documents. All available explanations are added to the term or abbreviation, which is taken into account during the operation of the system. A feature of the system is the use of natural language processing methods, since terms can be found in different cases. To implement the system, ready-made Python packages were used to cover similar tasks: Tkinter, PyMuPDF Examples of the system’s functioning are given. The developed system is used in practice. In the process of completing the work, the research of problems and the search for solutions for the tasks is carried out, an information system is developed for the processing of documents with the aim of integrating definitions of potentially unknown terms and abbreviations into them, in order to enable officers to use any literature without problems, because all terms and abbreviations will be known. To generalize the documentation, all the necessary requirements for the system are defined, and in order to correctly create the architecture and allocate the functional tasks of the system under development, a system analysis is performed and a conceptual model is built. Using all the specified information, all the necessary diagrams are built using the UML notation. Diagrams depict the relationships between objects and the overall architecture of the system. The architecture of the system is built in such a way that the component systems and the system as a whole can be easily expanded. At the end of the development, testing and implementation of the project is carried out. The process of operation of the components of the system on the part of the end user and the process of deployment by the end user of the information system are described. The object of the study is the presence of slowing factors in the process of command and control carried out by commanders of tactical units, which can slow down decision-making and also affect their correctness. The subject of the study is to solve the problems of the appearance of slowing factors in the process of command and control carried out by commanders of tactical units, by means of work with military data.
文本文件中术语和缩略语识别信息系统
本文探讨了文本文档中术语和缩略语识别系统的构建过程和功能。发展这样一个系统的任务是紧迫的,因为在军事领域经常出现这样的识别问题。在该系统的实施过程中,考虑到单个术语或缩写可能在不同的监管文件中有几种解释。所有可用的解释都添加到术语或缩写中,在系统运行过程中考虑到这些解释。该系统的一个特点是使用自然语言处理方法,因为术语可以在不同的情况下找到。为了实现该系统,使用了现成的Python包来覆盖类似的任务:Tkinter, PyMuPDF给出了系统功能的示例。所开发的系统已在实际中得到应用。在完成工作的过程中,对问题的研究和对任务的解决方案进行了研究,开发了一个用于处理文件的信息系统,目的是将可能未知的术语和缩写的定义整合到文件中,以便使官员能够毫无问题地使用任何文献,因为所有术语和缩写都是已知的。为了概括文档,定义了系统的所有必要需求,并且为了正确地创建体系结构并分配正在开发的系统的功能任务,执行了系统分析并构建了概念模型。使用所有指定的信息,使用UML符号构建所有必要的图。图描述了对象之间的关系和系统的整体架构。系统的架构是以这样一种方式构建的,即组件系统和作为一个整体的系统可以很容易地扩展。在开发结束时,进行项目的测试和实施。描述了终端用户对系统各部件的操作过程和终端用户对信息系统的部署过程。研究的对象是战术部队指挥员在指挥控制过程中存在的减缓因素,这些因素会使决策速度变慢,影响决策的正确性。研究课题是通过对军事数据的处理,解决战术部队指挥员在指挥控制过程中出现的慢速因素问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信