{"title":"Approach to Creating an HTML Version of a Scientific Article from a Manuscript in MS Word Format for a Low-Budget Publisher","authors":"R. Y. Skornyakova","doi":"10.3103/S0005105525700475","DOIUrl":"10.3103/S0005105525700475","url":null,"abstract":"<p>The most common approach to creating an HTML version of a journal article used by scientific publishers is to first create an XML version of the article in accordance with the NISO Journal Article Tag Suite (JATS) standard, followed by automatic conversion to HTML and PDF. However, obtaining an XML version from a manuscript in the MS Word .docx format, often used by authors, when it contains a large number of complex formulas and tables, is a difficult task. Existing software either cannot cope with it in full or is expensive and inaccessible to small publishers on a limited budget. This paper proposes an approach to creating an HTML version of a journal article from a manuscript in .docx format containing formulas in MathType format, which does not require significant financial or time costs from the publisher. It also describes a prototype converter from .docx format to HTML and JATS XML formats that implements this approach and applicable to KIAM preprints.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S376 - S388"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automatic Annotation of Training Datasets in Computer Vision Using Machine Learning Methods","authors":"A. K. Zhuravlyov, K. A. Grigorian","doi":"10.3103/S0005105525700347","DOIUrl":"10.3103/S0005105525700347","url":null,"abstract":"<p>This paper addresses the automatic annotation of training datasets in the field of computer vision using machine learning methods. Data annotation is a key stage in the development and training of deep learning models, but creating labeled data often requires significant time and labor. This paper proposes a mechanism for automatic annotation based on the use of convolutional neural networks and active learning methods. The proposed methodology includes the analysis and evaluation of existing approaches to automatic annotation. The effectiveness of the proposed solutions is assessed using publicly available datasets. The results demonstrate that the proposed method significantly reduces the time required for data annotation, although operator intervention is still necessary. The literature review presents an analysis of modern annotation methods and existing automatic systems, providing a better understanding of the context and advantages of the proposed approach. The conclusion discusses the study achievements, its limitations, and possible directions for future research in this field.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S279 - S282"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Criteria for Ranking Conferences","authors":"A. S. Kozitsin","doi":"10.3103/S000510552570044X","DOIUrl":"10.3103/S000510552570044X","url":null,"abstract":"<p>The ranking of scientific conferences plays a key role in the academic world, determining the significance and prestige of each event. The main results of ranking from the point of view of personalities are as follows: determining the quality and influence of the scientific conference; a guide for selecting a conference; encouragement to conduct quality research; formation of a scientific community; and improvement of the visibility and influence of the conference for the scientific community. This paper provides an overview of currently existing conference catalogs and conference ranking systems, both automatic ones and with the participation of expert councils. It is noted that the purpose of creating a national ranking system is to promote and popularize domestic conferences and journals. From a review of currently existing conference catalogs and conference ranking systems, the following criteria for ranking conferences can be formulated: indicators of publication activity, based on the results of the analysis of published conference materials; credibility of the speakers and the organizing committee of the conference; number of presentations and the ratio of the number of presentations to the number of conference participants; time for reviewing applications submitted to the conference; ratio of submitted and accepted applications; and retrospective and geographical parameters.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S352 - S363"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using Semantic Search to Select and Rank Geological Publications","authors":"M. I. Patuk, V. V. Naumova","doi":"10.3103/S0005105525700372","DOIUrl":"10.3103/S0005105525700372","url":null,"abstract":"<p>It is essential to aggregate scientific information for a comprehensive analysis of geological objects. This paper explores the potential and possibilities of semantic search to select thematically similar publications in the geological domain. Various language models are examined in the context of identifying similarities and differences in texts describing mineral deposits. After additional training, a significant improvement in search results from language models is demonstrated. Two web services are presented, based on a method for calculating the semantic similarity between texts and providing a quantitative assessment of their similarity.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S294 - S298"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143835593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. P. Tuchkova, K. P. Belyaev, G. M. Mikhaylov, K. A. Romashina
{"title":"Analysis of Intra-Annual Variability of Heat Fluxes in the North Atlantic Based on Approximation of Trajectories of the Stochastic Diffusion Process","authors":"N. P. Tuchkova, K. P. Belyaev, G. M. Mikhaylov, K. A. Romashina","doi":"10.3103/S0005105525700487","DOIUrl":"10.3103/S0005105525700487","url":null,"abstract":"<p>To analyze heat fluxes, observational data for 1979–2018 were used for the North Atlantic. The spatiotemporal variability of the total heat flux was modeled using stochastic diffusion. The coefficients of the stochastic differential equation were estimated by using nonparametric statistics. Previously, the existence and uniqueness of a solution in the strong sense of the stochastic differential equation generated by the constructed diffusion process was proven when Kolmogorov’s conditions were met. In this work, the coefficients of the equation were approximated using trigonometric polynomials whose amplitudes and phases depended on the flow values. Using a given series of 40 years in length from 1979 to 2018, spatial maps and time curves were constructed. The results are shown for 1999 and 2018, and their comparative analysis is also carried out. Numerical calculations were performed on the Lomonosov-2 supercomputer at Lomonosov Moscow State University.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S389 - S397"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Presentation of the Results of a Scientific Institute in the Form of a Knowledge Graph in a Semantic Library","authors":"O. M. Ataeva, V. A. Serebryakov, N. P. Tuchkova","doi":"10.3103/S0005105525700396","DOIUrl":"10.3103/S0005105525700396","url":null,"abstract":"<p>The problem of the presentation of the scientific results of academic institute in a digital environment is considered. A new look at the knowledge space of a scientific institute constitutes a natural stage in the development of WEB technologies. The data structure inherent in previous studies allows the organization, searching, and navigation of them using a knowledge graph, like a version of the semantic library LibMeta. The knowledge graph gives a more complete and high-quality idea of the knowledge space, often removing the cognitive load in the perception of complex structures and data connections.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6 supplement","pages":"S307 - S317"},"PeriodicalIF":0.5,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automation of Footages Sorting by Screenplay Text for Video Editing","authors":"A. D. Nemanov, I. S. Shakhova","doi":"10.3103/S0005105525700281","DOIUrl":"10.3103/S0005105525700281","url":null,"abstract":"<p>The video editing process involves numerous labor-intensive operations for sorting and preparing footages, requiring significant time investment. This article describes the development of a software solution that uses machine learning technology to automate these processes. The primary focus is on creating a system capable of classifying and sorting media files according to the screenplay text, thereby increasing the efficiency of material preparation for editing. The system includes modules for speech recognition, audio and video classification, and algorithms for determining screenplay compliance. Testing showed that the proposed system correctly classifies media files in most cases, significantly reducing rough-cut editing time.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S216 - S226"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Taking the Structure of the Document into Account in the Method of Automatic Annotation of Mathematical Concepts in Educational Texts","authors":"K. S. Nikolaev","doi":"10.3103/S0005105525700293","DOIUrl":"10.3103/S0005105525700293","url":null,"abstract":"<p>The enrichment of educational texts with semantic content (in particular, adding hyperlinks to the pages of a service that displays detailed information about concepts in the text) helps increase efficiency in students’ assimilation of the material. Existing methods of semantic markup of educational texts do not take into account the structural features of such documents, and this leads to an excessive recognition of concepts. This article describes the development of the method of automatic annotation of mathematical concepts in educational and mathematical texts by adding functionality to account for the structure of an educational document. The main purpose of this method is to process educational materials of the distance education course Technology for Solving Planimetric Problems. Following a single template when creating course pages allows you to apply an analysis of webpage markup and keywords that are used by the course creators. The main task in this process is to determine the type of table cell containing text fragments of educational materials. In accordance with the recommendations of the course creators, definitions should be highlighted in the cells that contain the task statement, as well as in the blocks where the input data of the task is indicated. The type of table cells is determined by analyzing their attributes and searching for keywords in their contents. This limitation of recognizable text fragments will improve the student’s perception of the course pages and improve the quality of their learning.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S227 - S233"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A New Approach to Creating a Corpus of Video Game Texts","authors":"N. R. Nurlygaianov, V. V. Kugurakova","doi":"10.3103/S000510552570030X","DOIUrl":"10.3103/S000510552570030X","url":null,"abstract":"<p>The problem of high and increasing cost of video game development is considered, and to solve it is proposed to apply procedural content generation, which will reduce development costs. The work is a part of a large-scale research on automatic prototyping of video games and is devoted to the processing of game scenarios, i.e. natural language texts. It is proposed to extract the necessary entities from the scripts and pass them to further steps of the algorithm, which will generate game resources based on the textual descriptions. There are several publications devoted to game text processing, in which several different structures for storing the extracted information are proposed. In this paper we propose a universal format that is suitable for processing the text of any video game and allows to create a corpus of texts for use in further research and automatic generation of game prototypes.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S234 - S240"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Approach to Detecting Pedestrian Movement Using the Method of Histograms of Oriented Gradients","authors":"M. V. Bobyr, N. A. Milostnaya, N. I. Khrapova","doi":"10.3103/S0005105525700244","DOIUrl":"10.3103/S0005105525700244","url":null,"abstract":"<p>An approach to automatically recognizing the movement of people at a pedestrian crossing is presented in this article. This approach includes two main procedures, for each of which commands are given in the C# programming language using the EMGU computer vision library. In the first procedure, pedestrian detection is performed using the combination of a directional gradient histogram and support vector methods. The second procedure allows you to read frames from a video sequence and process them. This approach allows for the detection of the movements of people at a pedestrian crossing without using specialized neural networks. At the same time, the method proposed in this article had sufficient reliability in terms of human movement recognition, which indicates its applicability to real conditions.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S169 - S176"},"PeriodicalIF":0.5,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143707155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}