{"title":"Comparison of Approaches to the Extraction of Mathematical Methods from Scientific Texts","authors":"Z. S. Ismagulov, D. V. Kosyakov, A. E. Guskov","doi":"10.3103/S0005105524700328","DOIUrl":null,"url":null,"abstract":"<p>The processes of extracting and comparing mathematical methods from scientific publications using different approaches—large language models, machine learning based classification method, and probabilistic topic modelling—are discussed. The superiority of the model obtained with probabilistic topic modelling when studying each article separately and of the large language model when studying whole projects is revealed, as well as the significant superiority of combining the results of these two approaches.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 6","pages":"441 - 452"},"PeriodicalIF":0.5000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0005105524700328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The processes of extracting and comparing mathematical methods from scientific publications using different approaches—large language models, machine learning based classification method, and probabilistic topic modelling—are discussed. The superiority of the model obtained with probabilistic topic modelling when studying each article separately and of the large language model when studying whole projects is revealed, as well as the significant superiority of combining the results of these two approaches.
期刊介绍:
Automatic Documentation and Mathematical Linguistics is an international peer reviewed journal that covers all aspects of automation of information processes and systems, as well as algorithms and methods for automatic language analysis. Emphasis is on the practical applications of new technologies and techniques for information analysis and processing.