{"title":"将成对重复转换为实体簇实现高质量重复检测","authors":"DraisbachUwe, ChristenPeter, NaumannFelix","doi":"10.1145/3352591","DOIUrl":null,"url":null,"abstract":"Duplicate detection algorithms produce clusters of database records, each cluster representing a single real-world entity. As most of these algorithms use pairwise comparisons, the resulting (trans...","PeriodicalId":44355,"journal":{"name":"ACM Journal of Data and Information Quality","volume":"12 1","pages":"1-30"},"PeriodicalIF":1.5000,"publicationDate":"2020-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3352591","citationCount":"19","resultStr":"{\"title\":\"Transforming Pairwise Duplicates to Entity Clusters for High-quality Duplicate Detection\",\"authors\":\"DraisbachUwe, ChristenPeter, NaumannFelix\",\"doi\":\"10.1145/3352591\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Duplicate detection algorithms produce clusters of database records, each cluster representing a single real-world entity. As most of these algorithms use pairwise comparisons, the resulting (trans...\",\"PeriodicalId\":44355,\"journal\":{\"name\":\"ACM Journal of Data and Information Quality\",\"volume\":\"12 1\",\"pages\":\"1-30\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2020-01-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1145/3352591\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Journal of Data and Information Quality\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3352591\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Journal of Data and Information Quality","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3352591","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Transforming Pairwise Duplicates to Entity Clusters for High-quality Duplicate Detection
Duplicate detection algorithms produce clusters of database records, each cluster representing a single real-world entity. As most of these algorithms use pairwise comparisons, the resulting (trans...