雇主行业分类使用职位公告

Mahak Goindani, Qiaoling Liu, Josh Chao, V. Jijkoun
{"title":"雇主行业分类使用职位公告","authors":"Mahak Goindani, Qiaoling Liu, Josh Chao, V. Jijkoun","doi":"10.1109/ICDMW.2017.30","DOIUrl":null,"url":null,"abstract":"In the recruitment domain, knowing the employer industry of jobs is important to get an insight about the demand in each industry. The existing system at CareerBuilder uses an employer name normalization system and an employer knowledge base to infer the employer industry of a job. However, errors may occur during the computation of the job employer and in the construction of the employer knowledge base with the industry attributes. Since the knowledge base is huge, it is not possible to manually detect the errors. Therefore, in this paper we use Machine Learning techniques to automatically detect the errors. With the observation that the main jobs posted by an employer often relate to the employer industry, e.g., truck driver jobs often correspond to employers belonging to the transportation industry, we develop a system that classifies the industry of an employer using job posting data. We aggregate job postings from an employer and use job titles and employer names as features for predicting the industry of the employer. We used two models for classification: (1) Support Vector Machine, and (2) Gradient Boosted Decision Trees, and observed that while both the models perform similarly in classifying job employers that were correctly computed, GBDT is more effective than SVM in identifying job employers that were wrongly computed. We also show the utility of our system in detecting normalization errors and knowledge base errors.","PeriodicalId":389183,"journal":{"name":"2017 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Employer Industry Classification Using Job Postings\",\"authors\":\"Mahak Goindani, Qiaoling Liu, Josh Chao, V. Jijkoun\",\"doi\":\"10.1109/ICDMW.2017.30\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the recruitment domain, knowing the employer industry of jobs is important to get an insight about the demand in each industry. The existing system at CareerBuilder uses an employer name normalization system and an employer knowledge base to infer the employer industry of a job. However, errors may occur during the computation of the job employer and in the construction of the employer knowledge base with the industry attributes. Since the knowledge base is huge, it is not possible to manually detect the errors. Therefore, in this paper we use Machine Learning techniques to automatically detect the errors. With the observation that the main jobs posted by an employer often relate to the employer industry, e.g., truck driver jobs often correspond to employers belonging to the transportation industry, we develop a system that classifies the industry of an employer using job posting data. We aggregate job postings from an employer and use job titles and employer names as features for predicting the industry of the employer. We used two models for classification: (1) Support Vector Machine, and (2) Gradient Boosted Decision Trees, and observed that while both the models perform similarly in classifying job employers that were correctly computed, GBDT is more effective than SVM in identifying job employers that were wrongly computed. We also show the utility of our system in detecting normalization errors and knowledge base errors.\",\"PeriodicalId\":389183,\"journal\":{\"name\":\"2017 IEEE International Conference on Data Mining Workshops (ICDMW)\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE International Conference on Data Mining Workshops (ICDMW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDMW.2017.30\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2017.30","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

在招聘领域,了解工作的雇主行业对于了解每个行业的需求非常重要。凯业必达现有的系统使用雇主名称规范化系统和雇主知识库来推断工作所在的雇主行业。但是,在计算工作单位和构建具有行业属性的雇主知识库时,可能会出现错误。由于知识库非常庞大,不可能手动检测错误。因此,在本文中我们使用机器学习技术来自动检测错误。观察到雇主发布的主要职位通常与雇主所在行业相关,例如,卡车司机的职位通常与运输行业的雇主相对应,我们开发了一个系统,该系统使用职位发布数据对雇主所在行业进行分类。我们汇总来自雇主的招聘信息,并使用职位和雇主名称作为预测雇主所在行业的特征。我们使用两种模型进行分类:(1)支持向量机(Support Vector Machine)和(2)梯度提升决策树(Gradient boosting Decision Trees),并观察到尽管这两种模型在对正确计算的雇主进行分类方面表现相似,但GBDT在识别错误计算的雇主方面比SVM更有效。我们还展示了我们的系统在检测规范化错误和知识库错误方面的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Employer Industry Classification Using Job Postings
In the recruitment domain, knowing the employer industry of jobs is important to get an insight about the demand in each industry. The existing system at CareerBuilder uses an employer name normalization system and an employer knowledge base to infer the employer industry of a job. However, errors may occur during the computation of the job employer and in the construction of the employer knowledge base with the industry attributes. Since the knowledge base is huge, it is not possible to manually detect the errors. Therefore, in this paper we use Machine Learning techniques to automatically detect the errors. With the observation that the main jobs posted by an employer often relate to the employer industry, e.g., truck driver jobs often correspond to employers belonging to the transportation industry, we develop a system that classifies the industry of an employer using job posting data. We aggregate job postings from an employer and use job titles and employer names as features for predicting the industry of the employer. We used two models for classification: (1) Support Vector Machine, and (2) Gradient Boosted Decision Trees, and observed that while both the models perform similarly in classifying job employers that were correctly computed, GBDT is more effective than SVM in identifying job employers that were wrongly computed. We also show the utility of our system in detecting normalization errors and knowledge base errors.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信