Content Driven Profile Matching across Online Social Networks

Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017 Pub Date : 2017-07-31 DOI:10.1145/3110025.3110095

R. Roedler, Dennis Kergl, G. Rodosek

{"title":"Content Driven Profile Matching across Online Social Networks","authors":"R. Roedler, Dennis Kergl, G. Rodosek","doi":"10.1145/3110025.3110095","DOIUrl":null,"url":null,"abstract":"Many publications deal with profile matching across online social networks and the approaches become increasingly complex. Almost all of them rely on common profile attributes like names and hobbies or structural attributes like relations to other user profiles. These approaches require high effort concerning computation, because each profile of one network has to be compared to all profiles of the other network. Complex approaches are not well suited to handle large datasets. Therefore, we present an approach to significantly reduce complexity by exploiting special properties of dataset IDs. We provide a proof of concept by an implementation of the use case of matching user profiles accross Twitter and Instagram. Additionally to the complexity problem of existing approaches, many profiles with similar attributes often lead to a restrictive trade-off between precision and recall of the matching strategy. Furthermore, profile attributes and relationships are not trustworthy, as these are due to arbitrary change by profile owners. In contrast to most existing approaches that rely on user definable attributes, we rather focus on timing properties of user publications across social media platforms. There are already profile matching approaches based on timing patterns. However, these do not aim to reduce complexity, what is a necessary requirement to be applicable to real-world online social networks. As we will show, the approach can be easily transferred to other networks.","PeriodicalId":399660,"journal":{"name":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3110025.3110095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Many publications deal with profile matching across online social networks and the approaches become increasingly complex. Almost all of them rely on common profile attributes like names and hobbies or structural attributes like relations to other user profiles. These approaches require high effort concerning computation, because each profile of one network has to be compared to all profiles of the other network. Complex approaches are not well suited to handle large datasets. Therefore, we present an approach to significantly reduce complexity by exploiting special properties of dataset IDs. We provide a proof of concept by an implementation of the use case of matching user profiles accross Twitter and Instagram. Additionally to the complexity problem of existing approaches, many profiles with similar attributes often lead to a restrictive trade-off between precision and recall of the matching strategy. Furthermore, profile attributes and relationships are not trustworthy, as these are due to arbitrary change by profile owners. In contrast to most existing approaches that rely on user definable attributes, we rather focus on timing properties of user publications across social media platforms. There are already profile matching approaches based on timing patterns. However, these do not aim to reduce complexity, what is a necessary requirement to be applicable to real-world online social networks. As we will show, the approach can be easily transferred to other networks.

查看原文本刊更多论文

跨在线社交网络的内容驱动配置文件匹配

许多出版物处理跨在线社会网络的个人资料匹配，方法变得越来越复杂。几乎所有这些都依赖于常见的配置文件属性，如姓名和爱好，或结构属性，如与其他用户配置文件的关系。这些方法在计算方面需要付出很大的努力，因为必须将一个网络的每个概要与其他网络的所有概要进行比较。复杂的方法不太适合处理大型数据集。因此，我们提出了一种通过利用数据集id的特殊属性来显著降低复杂性的方法。我们通过实现跨Twitter和Instagram匹配用户配置文件的用例来提供概念验证。除了现有方法的复杂性问题外，许多具有相似属性的配置文件通常会导致匹配策略的精度和召回率之间的限制性权衡。此外，概要文件的属性和关系是不可信的，因为这些是由于概要文件所有者的任意更改造成的。与大多数依赖于用户自定义属性的现有方法相比，我们更关注用户发布在社交媒体平台上的时间属性。已经有基于时间模式的配置文件匹配方法。然而，这些并不旨在降低复杂性，这是适用于现实世界的在线社交网络的必要要求。正如我们将展示的，这种方法可以很容易地转移到其他网络。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017

自引率

0.00%

发文量