{"title":"An example of empirical approach for bibliographic record linkage","authors":"A. Knyazeva, O. Kolobov, I. Turchanovsky","doi":"10.1109/RCIS.2016.7549290","DOIUrl":null,"url":null,"abstract":"The record linkage problem in application to a bibliographic and authority data is considered. The problem is common in the situation of merging data from several libraries. The two approaches based on empirical analysis of data are tested. Both of them involve an indirect information about a person. The proposed variant of the decision tree method allows us to deal with inconsistent bibliographic data and to use particular rules one by one for improving of record linkage quality. The study was performed on data of several Russian libraries. The data we deal with are in RUSMARC format which is a variant of UNIMARC popular in Russia.","PeriodicalId":344289,"journal":{"name":"2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Tenth International Conference on Research Challenges in Information Science (RCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCIS.2016.7549290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The record linkage problem in application to a bibliographic and authority data is considered. The problem is common in the situation of merging data from several libraries. The two approaches based on empirical analysis of data are tested. Both of them involve an indirect information about a person. The proposed variant of the decision tree method allows us to deal with inconsistent bibliographic data and to use particular rules one by one for improving of record linkage quality. The study was performed on data of several Russian libraries. The data we deal with are in RUSMARC format which is a variant of UNIMARC popular in Russia.