{"title":"技术角度:实体匹配,保证质量和误差","authors":"B. Kimelfeld, W. Martens","doi":"10.1145/3371316.3371322","DOIUrl":null,"url":null,"abstract":"The challenge of entity matching is that of identifying when different data items (often referred to as records or mentions) refer to the same real-life entity. Popular instantiations of this problem include deduplication, where the items are database records that include duplicate representations of the same entity (e.g., duplicate profiles in a social network) [2], record linkage, where the items come from different data sources that mention overlapping sets of entities (e.g., the profiles of two social networks) [5], and schema matching, where the items are attributes of different database schemas that intersect on their domain of interest (e.g., the database schemas of different social networks) [6].","PeriodicalId":21740,"journal":{"name":"SIGMOD Rec.","volume":"24 1","pages":"23"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Technical Perspective: Entity Matching with Quality and Error Guarantees\",\"authors\":\"B. Kimelfeld, W. Martens\",\"doi\":\"10.1145/3371316.3371322\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The challenge of entity matching is that of identifying when different data items (often referred to as records or mentions) refer to the same real-life entity. Popular instantiations of this problem include deduplication, where the items are database records that include duplicate representations of the same entity (e.g., duplicate profiles in a social network) [2], record linkage, where the items come from different data sources that mention overlapping sets of entities (e.g., the profiles of two social networks) [5], and schema matching, where the items are attributes of different database schemas that intersect on their domain of interest (e.g., the database schemas of different social networks) [6].\",\"PeriodicalId\":21740,\"journal\":{\"name\":\"SIGMOD Rec.\",\"volume\":\"24 1\",\"pages\":\"23\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SIGMOD Rec.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3371316.3371322\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIGMOD Rec.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3371316.3371322","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Technical Perspective: Entity Matching with Quality and Error Guarantees
The challenge of entity matching is that of identifying when different data items (often referred to as records or mentions) refer to the same real-life entity. Popular instantiations of this problem include deduplication, where the items are database records that include duplicate representations of the same entity (e.g., duplicate profiles in a social network) [2], record linkage, where the items come from different data sources that mention overlapping sets of entities (e.g., the profiles of two social networks) [5], and schema matching, where the items are attributes of different database schemas that intersect on their domain of interest (e.g., the database schemas of different social networks) [6].