Graph Neural Networks for the Global Economy with Microsoft DeepGraph

Jaewon Yang, Baoxu Shi, A. Samylkin
{"title":"Graph Neural Networks for the Global Economy with Microsoft DeepGraph","authors":"Jaewon Yang, Baoxu Shi, A. Samylkin","doi":"10.1145/3488560.3510020","DOIUrl":null,"url":null,"abstract":"Graph Neural Networks (GNNs) are AI models that learn embeddings for the nodes in a graph and use the embeddings to perform prediction tasks. In this talk, we present how we developed GNNs for the LinkedIn economic graph. LinkedIn economic graph is a digital representation of the global economy with 1B nodes and 200B edges, consisting of social graphs about members' connections, activity graphs between members and other economic entities, and knowledge graphs about members', companies', job postings' attributes. By applying GNN to this graph, we can utilize the full potential of the economic graph in many search and recommendation products across LinkedIn. The biggest challenge was to scale up GNNs to a massive scale of billion nodes and edges. To address this challenge, we developed Microsoft DeepGraph, an open source library for large scale GNN development. DeepGraph allows for training GNNs on large graphs by serving the graph in a distributed fashion with graph engine servers. In this talk, we will highlight the strengths of DeepGraph, such as support for both PyTorch and TensorFlow, and integration with Azure ML and Azure Kubernetes Service. We will share lessons and findings from developing GNNs for various applications around the LinkedIn economic graph. We will explain how we combine graphs such as social graph, activity graph, knowledge graphs into one gigantic heterogeneous graph, and what algorithms we employed for this heterogenous graph. We will present a few case studies, such as how we identify job postings with vague titles and replace them with more specific titles using GNNs.","PeriodicalId":348686,"journal":{"name":"Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3488560.3510020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Graph Neural Networks (GNNs) are AI models that learn embeddings for the nodes in a graph and use the embeddings to perform prediction tasks. In this talk, we present how we developed GNNs for the LinkedIn economic graph. LinkedIn economic graph is a digital representation of the global economy with 1B nodes and 200B edges, consisting of social graphs about members' connections, activity graphs between members and other economic entities, and knowledge graphs about members', companies', job postings' attributes. By applying GNN to this graph, we can utilize the full potential of the economic graph in many search and recommendation products across LinkedIn. The biggest challenge was to scale up GNNs to a massive scale of billion nodes and edges. To address this challenge, we developed Microsoft DeepGraph, an open source library for large scale GNN development. DeepGraph allows for training GNNs on large graphs by serving the graph in a distributed fashion with graph engine servers. In this talk, we will highlight the strengths of DeepGraph, such as support for both PyTorch and TensorFlow, and integration with Azure ML and Azure Kubernetes Service. We will share lessons and findings from developing GNNs for various applications around the LinkedIn economic graph. We will explain how we combine graphs such as social graph, activity graph, knowledge graphs into one gigantic heterogeneous graph, and what algorithms we employed for this heterogenous graph. We will present a few case studies, such as how we identify job postings with vague titles and replace them with more specific titles using GNNs.
图神经网络的全球经济与微软DeepGraph
图神经网络(gnn)是一种人工智能模型,它学习图中节点的嵌入,并使用嵌入来执行预测任务。在这次演讲中,我们将介绍如何为LinkedIn经济图表开发gnn。LinkedIn经济图是一个拥有1B个节点和200B条边的全球经济的数字表示,包括关于成员关系的社交图,成员与其他经济实体之间的活动图,以及关于成员、公司、职位发布属性的知识图。通过将GNN应用于这个图表,我们可以在LinkedIn上的许多搜索和推荐产品中充分利用经济图表的潜力。最大的挑战是将gnn扩展到数十亿个节点和边缘的大规模。为了应对这一挑战,我们开发了Microsoft DeepGraph,这是一个用于大规模GNN开发的开源库。DeepGraph允许通过使用图引擎服务器以分布式方式提供图来训练大型图上的gnn。在这次演讲中,我们将重点介绍DeepGraph的优势,例如对PyTorch和TensorFlow的支持,以及与Azure ML和Azure Kubernetes Service的集成。我们将分享在围绕LinkedIn经济图表的各种应用中开发gnn的经验和发现。我们将解释我们如何将社交图、活动图、知识图等图组合成一个巨大的异构图,以及我们为这个异构图使用了什么算法。我们将介绍一些案例研究,例如我们如何识别具有模糊标题的招聘启事,并使用gnn将其替换为更具体的标题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信