Boyu Zhang, Xiangguo Ding, Xiaowen Huang, Yang Cao, J. Sang, Jian Yu
{"title":"Multi-source User Attribute Inference based on Hierarchical Auto-encoder","authors":"Boyu Zhang, Xiangguo Ding, Xiaowen Huang, Yang Cao, J. Sang, Jian Yu","doi":"10.1145/3338533.3366599","DOIUrl":null,"url":null,"abstract":"With the rapid development of Online Social Networks (OSNs), it is crucial to construct users' portraits from their dynamic behaviors to address the increasing needs for customized information services. Previous work on user attribute inference mainly concentrated on developing advanced features/models or exploiting external information and knowledge but ignored the contradiction between dynamic behaviors and stable demographic attributes, which results in deviation of user understanding To address the contradiction and accurately infer the user attributes, we propose a Multi-source User Attribute Inference algorithm based on Hierarchical Auto-encoder (MUAI-HAE). The basic idea is that: the shared patterns among the same individual's behaviors on different OSNs well indicate his/her stable demographic attributes. The hierarchical autoencoder is introduced to realize this idea by discovering the underlying non-linear correlation between different OSNs. The unsupervised scheme in shared pattern learning alleviates the requirements for the cross-OSN user account and improves the practicability. Off-the-shelf classification methods are then utilized to infer user attributes from the derived shared behavior patterns. The experiments on the real-world datasets from three OSNs demonstrate the effectiveness of the proposed method.","PeriodicalId":273086,"journal":{"name":"Proceedings of the ACM Multimedia Asia","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM Multimedia Asia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3338533.3366599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
With the rapid development of Online Social Networks (OSNs), it is crucial to construct users' portraits from their dynamic behaviors to address the increasing needs for customized information services. Previous work on user attribute inference mainly concentrated on developing advanced features/models or exploiting external information and knowledge but ignored the contradiction between dynamic behaviors and stable demographic attributes, which results in deviation of user understanding To address the contradiction and accurately infer the user attributes, we propose a Multi-source User Attribute Inference algorithm based on Hierarchical Auto-encoder (MUAI-HAE). The basic idea is that: the shared patterns among the same individual's behaviors on different OSNs well indicate his/her stable demographic attributes. The hierarchical autoencoder is introduced to realize this idea by discovering the underlying non-linear correlation between different OSNs. The unsupervised scheme in shared pattern learning alleviates the requirements for the cross-OSN user account and improves the practicability. Off-the-shelf classification methods are then utilized to infer user attributes from the derived shared behavior patterns. The experiments on the real-world datasets from three OSNs demonstrate the effectiveness of the proposed method.
随着在线社交网络(Online Social Networks,简称OSNs)的快速发展,从用户的动态行为中构建用户画像,以满足日益增长的个性化信息服务需求至关重要。以往的用户属性推断工作主要集中在开发高级特征/模型或利用外部信息和知识,而忽略了动态行为与稳定的人口统计属性之间的矛盾,导致用户理解偏差。为了解决这一矛盾,准确推断用户属性,我们提出了一种基于分层自编码器(MUAI-HAE)的多源用户属性推断算法。其基本思想是:同一个体在不同osn上的共同行为模式很好地反映了其稳定的人口统计属性。分层自编码器通过发现不同osn之间潜在的非线性相关性来实现这一思想。共享模式学习中的无监督方案减轻了对跨osn用户账号的要求,提高了实用性。然后利用现成的分类方法从派生的共享行为模式推断用户属性。在三个osn的实际数据集上的实验证明了该方法的有效性。