Jesús Cascón Katchadourian, Carlos Rodríguez Domínguez, Francisco Carranza García, Daniel Torres Salinas
{"title":"GeoAcademy: web platform and algorithm for automatic detection and location of geographic coordinates and toponyms in scientific articles","authors":"Jesús Cascón Katchadourian, Carlos Rodríguez Domínguez, Francisco Carranza García, Daniel Torres Salinas","doi":"10.3989/redc.2023.4.1393","DOIUrl":null,"url":null,"abstract":"The following study relates the qualities and uses of the GeoAcademy Project, a program designed with the aim of geolocating scientific articles automatically, such articles would be found in Scopus, Web of Science, or similar databases. An algorithm has been developed with the intention of capturing geographical coordinates or toponyms contained within the documents in order to perform reliable geolocation. In the methodology, we describe the stages of the project that have been necessary so as to build a sample database concerning the Sierra Nevada (Spain), as well as the development of the algorithm. The technical data regarding the employment of the algorithm on the sample documents and its levels of success are included in the results, as is an explanation of the platform containing web maps which can be utilised to show the texts which have been geolocated. In conclusion we outline the obstacles faced, potential bibliometric uses and the advantages it offers as a reference resource and source of information.","PeriodicalId":45937,"journal":{"name":"Revista Espanola De Documentacion Cientifica","volume":"43 1","pages":"0"},"PeriodicalIF":1.0000,"publicationDate":"2023-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista Espanola De Documentacion Cientifica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3989/redc.2023.4.1393","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The following study relates the qualities and uses of the GeoAcademy Project, a program designed with the aim of geolocating scientific articles automatically, such articles would be found in Scopus, Web of Science, or similar databases. An algorithm has been developed with the intention of capturing geographical coordinates or toponyms contained within the documents in order to perform reliable geolocation. In the methodology, we describe the stages of the project that have been necessary so as to build a sample database concerning the Sierra Nevada (Spain), as well as the development of the algorithm. The technical data regarding the employment of the algorithm on the sample documents and its levels of success are included in the results, as is an explanation of the platform containing web maps which can be utilised to show the texts which have been geolocated. In conclusion we outline the obstacles faced, potential bibliometric uses and the advantages it offers as a reference resource and source of information.
下面的研究涉及到GeoAcademy项目的质量和用途,这是一个旨在自动定位科学文章的程序,这些文章可以在Scopus, Web of Science或类似的数据库中找到。已经开发了一种算法,目的是捕获文档中包含的地理坐标或地名,以便执行可靠的地理定位。在方法中,我们描述了为建立关于内华达山脉(西班牙)的样本数据库所必需的项目阶段,以及算法的开发。关于在样本文档上使用算法的技术数据及其成功程度都包含在结果中,这是对包含可用于显示已定位文本的网络地图的平台的解释。最后,我们概述了面临的障碍、潜在的文献计量学用途以及它作为参考资源和信息来源所提供的优势。
期刊介绍:
Revista española de Documentación Científica (REDC) is a journal edited by the Instituto de Estudios Documentales sobre Ciencia y Tecnología (IEDCYT, formerly CINDOC) belonging to the Consejo Superior de Investigaciones Científicas (CSIC). It is published quarterly since 1977. The main objective of this journal is to contribute to the dissemination of knowledge amongst researchers in the field of Library and Information Science and those involved in the use of scientific, technical and strategic information for science policy and decision making. REDC includes research papers dealing with experimental and theoretical topics. The articles published in REDC include titles, abstracts and key-words in English in order to facilitate its international visibility.