{"title":"使用语义搜索选择和排序地质出版物","authors":"M. I. Patuk, V. V. Naumova","doi":"10.3103/S0005105525700372","DOIUrl":null,"url":null,"abstract":"<p>It is essential to aggregate scientific information for a comprehensive analysis of geological objects. This paper explores the potential and possibilities of semantic search to select thematically similar publications in the geological domain. Various language models are examined in the context of identifying similarities and differences in texts describing mineral deposits. After additional training, a significant improvement in search results from language models is demonstrated. Two web services are presented, based on a method for calculating the semantic similarity between texts and providing a quantitative assessment of their similarity.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S294 - S298"},"PeriodicalIF":0.5000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Semantic Search to Select and Rank Geological Publications\",\"authors\":\"M. I. Patuk, V. V. Naumova\",\"doi\":\"10.3103/S0005105525700372\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>It is essential to aggregate scientific information for a comprehensive analysis of geological objects. This paper explores the potential and possibilities of semantic search to select thematically similar publications in the geological domain. Various language models are examined in the context of identifying similarities and differences in texts describing mineral deposits. After additional training, a significant improvement in search results from language models is demonstrated. Two web services are presented, based on a method for calculating the semantic similarity between texts and providing a quantitative assessment of their similarity.</p>\",\"PeriodicalId\":42995,\"journal\":{\"name\":\"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS\",\"volume\":\"58 5 supplement\",\"pages\":\"S294 - S298\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2025-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.3103/S0005105525700372\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0005105525700372","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Using Semantic Search to Select and Rank Geological Publications
It is essential to aggregate scientific information for a comprehensive analysis of geological objects. This paper explores the potential and possibilities of semantic search to select thematically similar publications in the geological domain. Various language models are examined in the context of identifying similarities and differences in texts describing mineral deposits. After additional training, a significant improvement in search results from language models is demonstrated. Two web services are presented, based on a method for calculating the semantic similarity between texts and providing a quantitative assessment of their similarity.
期刊介绍:
Automatic Documentation and Mathematical Linguistics is an international peer reviewed journal that covers all aspects of automation of information processes and systems, as well as algorithms and methods for automatic language analysis. Emphasis is on the practical applications of new technologies and techniques for information analysis and processing.