{"title":"Content Driven Profile Matching across Online Social Networks","authors":"R. Roedler, Dennis Kergl, G. Rodosek","doi":"10.1145/3110025.3110095","DOIUrl":null,"url":null,"abstract":"Many publications deal with profile matching across online social networks and the approaches become increasingly complex. Almost all of them rely on common profile attributes like names and hobbies or structural attributes like relations to other user profiles. These approaches require high effort concerning computation, because each profile of one network has to be compared to all profiles of the other network. Complex approaches are not well suited to handle large datasets. Therefore, we present an approach to significantly reduce complexity by exploiting special properties of dataset IDs. We provide a proof of concept by an implementation of the use case of matching user profiles accross Twitter and Instagram. Additionally to the complexity problem of existing approaches, many profiles with similar attributes often lead to a restrictive trade-off between precision and recall of the matching strategy. Furthermore, profile attributes and relationships are not trustworthy, as these are due to arbitrary change by profile owners. In contrast to most existing approaches that rely on user definable attributes, we rather focus on timing properties of user publications across social media platforms. There are already profile matching approaches based on timing patterns. However, these do not aim to reduce complexity, what is a necessary requirement to be applicable to real-world online social networks. As we will show, the approach can be easily transferred to other networks.","PeriodicalId":399660,"journal":{"name":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3110025.3110095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Many publications deal with profile matching across online social networks and the approaches become increasingly complex. Almost all of them rely on common profile attributes like names and hobbies or structural attributes like relations to other user profiles. These approaches require high effort concerning computation, because each profile of one network has to be compared to all profiles of the other network. Complex approaches are not well suited to handle large datasets. Therefore, we present an approach to significantly reduce complexity by exploiting special properties of dataset IDs. We provide a proof of concept by an implementation of the use case of matching user profiles accross Twitter and Instagram. Additionally to the complexity problem of existing approaches, many profiles with similar attributes often lead to a restrictive trade-off between precision and recall of the matching strategy. Furthermore, profile attributes and relationships are not trustworthy, as these are due to arbitrary change by profile owners. In contrast to most existing approaches that rely on user definable attributes, we rather focus on timing properties of user publications across social media platforms. There are already profile matching approaches based on timing patterns. However, these do not aim to reduce complexity, what is a necessary requirement to be applicable to real-world online social networks. As we will show, the approach can be easily transferred to other networks.