Extraction Of Wikidata Knowledge For The Metadata Formation For Documents of Digital Mathematical Collections

Полина Олеговна Гафурова, Александр Михайлович Елизаров, Евгений Константинович Липачёв
{"title":"Extraction Of Wikidata Knowledge For The Metadata Formation For Documents of Digital Mathematical Collections","authors":"Полина Олеговна Гафурова, Александр Михайлович Елизаров, Евгений Константинович Липачёв","doi":"10.26907/1562-5419-2021-24-6-1023-1059","DOIUrl":null,"url":null,"abstract":"Methods for creating digital mathematical collections that include unstructured sets of documents are presented. These sets contain materials from scientific conferences, as well as articles from the archives of mathematical journals of the \"pre-digital\" period.\nUsing the software tools of the metadata factory of the digital mathematical library Lobachevskii DML, a mandatory set of metadata for digital collection documents was formed. To refine and replenish the metadata sets, knowledge extraction methods from Wikidata were used.\nTo search Wikidata for information about digital collection documents and their authors, a system of SPARQL queries has been developed. A set of Wikidata entities is defined, which determine the features of the search, as well as the subsequent filtering of the results.\nMethods for clarifying and supplementing the bibliographic references given in the articles are proposed. When forming the metadata of documents of retrocollections, a search was made in Wikidata for information about the years of life of the authors of articles, as well as URLs of web pages with information about articles and their authors. The results of the formation of several new digital collections of the Lobachevskii-DML digital library are presented.","PeriodicalId":262909,"journal":{"name":"Russian Digital Libraries Journal","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Russian Digital Libraries Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26907/1562-5419-2021-24-6-1023-1059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Methods for creating digital mathematical collections that include unstructured sets of documents are presented. These sets contain materials from scientific conferences, as well as articles from the archives of mathematical journals of the "pre-digital" period. Using the software tools of the metadata factory of the digital mathematical library Lobachevskii DML, a mandatory set of metadata for digital collection documents was formed. To refine and replenish the metadata sets, knowledge extraction methods from Wikidata were used. To search Wikidata for information about digital collection documents and their authors, a system of SPARQL queries has been developed. A set of Wikidata entities is defined, which determine the features of the search, as well as the subsequent filtering of the results. Methods for clarifying and supplementing the bibliographic references given in the articles are proposed. When forming the metadata of documents of retrocollections, a search was made in Wikidata for information about the years of life of the authors of articles, as well as URLs of web pages with information about articles and their authors. The results of the formation of several new digital collections of the Lobachevskii-DML digital library are presented.
面向数字数学馆藏文档元数据形成的维基数据知识提取
提出了创建包含非结构化文档集的数字数学集合的方法。这些集合包含来自科学会议的材料,以及来自“前数字”时期数学期刊档案的文章。利用数字数学图书馆Lobachevskii DML元数据工厂的软件工具,形成了一套用于数字馆藏文档的强制性元数据集。为了完善和补充元数据集,使用了维基数据的知识提取方法。为了在维基数据中搜索有关数字馆藏文档及其作者的信息,开发了一个SPARQL查询系统。定义了一组Wikidata实体,这些实体决定了搜索的特征,以及随后对结果的过滤。提出了澄清和补充文献参考文献的方法。在形成retrocollection文档的元数据时,在Wikidata中搜索文章作者的生活年限信息,以及包含文章及其作者信息的网页的url。本文介绍了Lobachevskii-DML数字图书馆的几个新数字馆藏的形成结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信