IPOD:大规模工业和专业职业数据集

Junhua Liu, Yung Chuen Ng, Kwan Hui Lim
{"title":"IPOD:大规模工业和专业职业数据集","authors":"Junhua Liu, Yung Chuen Ng, Kwan Hui Lim","doi":"10.1145/3406865.3418329","DOIUrl":null,"url":null,"abstract":"In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. As such, there has been growing interest in utilizing occupational data mining and analysis, and a key requirement to facilitate these tasks is the need for an occupation-related dataset. However, most research use proprietary datasets or do not make their dataset publicly available, thus impeding development in this area. To solve this issue, we present the Industrial and Professional Occupation Dataset (IPOD), which comprises 475,073 job titles belonging to 192,295 Linkedin users. In addition to making IPOD publicly available, we also: (i) manually annotate each job title with its associated level of seniority, domain of work and location; and (ii) provide embedding for job titles and discuss various use cases. This dataset is publicly available at https://github.com/junhua/ipod.","PeriodicalId":93424,"journal":{"name":"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...","volume":"54 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"IPOD: A Large-scale Industrial and Professional Occupation Dataset\",\"authors\":\"Junhua Liu, Yung Chuen Ng, Kwan Hui Lim\",\"doi\":\"10.1145/3406865.3418329\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. As such, there has been growing interest in utilizing occupational data mining and analysis, and a key requirement to facilitate these tasks is the need for an occupation-related dataset. However, most research use proprietary datasets or do not make their dataset publicly available, thus impeding development in this area. To solve this issue, we present the Industrial and Professional Occupation Dataset (IPOD), which comprises 475,073 job titles belonging to 192,295 Linkedin users. In addition to making IPOD publicly available, we also: (i) manually annotate each job title with its associated level of seniority, domain of work and location; and (ii) provide embedding for job titles and discuss various use cases. This dataset is publicly available at https://github.com/junhua/ipod.\",\"PeriodicalId\":93424,\"journal\":{\"name\":\"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...\",\"volume\":\"54 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3406865.3418329\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CSCW '20 Companion : conference companion publication of the 2020 Conference on Computer Supported Cooperative Work and Social Computing : October 17-21, 2020, Virtual Event, USA. Conference on Computer-Supported Cooperative Work and So...","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3406865.3418329","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

在当今的就业市场,职业数据挖掘和分析变得越来越重要,因为它使公司能够预测员工流动率,建立职业轨迹模型,筛选简历和执行其他人力资源任务。因此,人们对利用职业数据挖掘和分析越来越感兴趣,而促进这些任务的一个关键要求是需要一个与职业相关的数据集。然而,大多数研究使用专有数据集或不公开其数据集,从而阻碍了该领域的发展。为了解决这个问题,我们提供了工业和专业职业数据集(IPOD),其中包括属于192,295名Linkedin用户的475,073个职位。除了公开IPOD之外,我们还:(i)手动标注每个职位的相关资历、工作领域和工作地点;(ii)为职位名称提供嵌入并讨论各种用例。该数据集可在https://github.com/junhua/ipod上公开获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
IPOD: A Large-scale Industrial and Professional Occupation Dataset
In today's job market, occupational data mining and analysis is growing in importance as it enables companies to predict employee turnover, model career trajectories, screen through resumes and perform other human resource tasks. As such, there has been growing interest in utilizing occupational data mining and analysis, and a key requirement to facilitate these tasks is the need for an occupation-related dataset. However, most research use proprietary datasets or do not make their dataset publicly available, thus impeding development in this area. To solve this issue, we present the Industrial and Professional Occupation Dataset (IPOD), which comprises 475,073 job titles belonging to 192,295 Linkedin users. In addition to making IPOD publicly available, we also: (i) manually annotate each job title with its associated level of seniority, domain of work and location; and (ii) provide embedding for job titles and discuss various use cases. This dataset is publicly available at https://github.com/junhua/ipod.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信