Analysis and classification of employee attrition and absenteeism in industry: A sequential pattern mining-based methodology

IF 8.2 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
M. Saqib Nawaz , M. Zohaib Nawaz , Philippe Fournier-Viger , José María Luna
{"title":"Analysis and classification of employee attrition and absenteeism in industry: A sequential pattern mining-based methodology","authors":"M. Saqib Nawaz ,&nbsp;M. Zohaib Nawaz ,&nbsp;Philippe Fournier-Viger ,&nbsp;José María Luna","doi":"10.1016/j.compind.2024.104106","DOIUrl":null,"url":null,"abstract":"<div><p>Employee attrition and absenteeism are major problems that affect many industries and organizations, resulting in diminished productivity, elevated costs, and losses. These phenomena can be attributed to multiple factors that are difficult to anticipate for human resources or management. Therefore, this paper proposes a content-based methodology for the analysis and classification of employee attrition and absenteeism that can be used for talent analysis and management, a task that is traditionally carried out ex-post. The developed methodology, called E(3A)CSPM, is based on SPM (sequential pattern mining). In the methodology, four public datasets with diversified employee data are adopted, which are initially transformed into a suitable format. Then, SPM algorithms are applied to the transformed datasets to reveal recurring patterns and rules of features. The discovered patterns and rules not only offer information regarding features that have a key role in employee attrition and absenteeism but also their values. These frequent patterns of features are thereafter used to classify/predict employee attrition and absenteeism. Eight classifiers and multiple evaluation metrics are used in experiments. The performance of E(3A)CSPM is contrasted with state-of-the-art approaches for employee attrition and absenteeism and the obtained findings reveal that E(3A)CSPM surpasses these approaches.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"159 ","pages":"Article 104106"},"PeriodicalIF":8.2000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000344","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Employee attrition and absenteeism are major problems that affect many industries and organizations, resulting in diminished productivity, elevated costs, and losses. These phenomena can be attributed to multiple factors that are difficult to anticipate for human resources or management. Therefore, this paper proposes a content-based methodology for the analysis and classification of employee attrition and absenteeism that can be used for talent analysis and management, a task that is traditionally carried out ex-post. The developed methodology, called E(3A)CSPM, is based on SPM (sequential pattern mining). In the methodology, four public datasets with diversified employee data are adopted, which are initially transformed into a suitable format. Then, SPM algorithms are applied to the transformed datasets to reveal recurring patterns and rules of features. The discovered patterns and rules not only offer information regarding features that have a key role in employee attrition and absenteeism but also their values. These frequent patterns of features are thereafter used to classify/predict employee attrition and absenteeism. Eight classifiers and multiple evaluation metrics are used in experiments. The performance of E(3A)CSPM is contrasted with state-of-the-art approaches for employee attrition and absenteeism and the obtained findings reveal that E(3A)CSPM surpasses these approaches.

工业中雇员流失和缺勤的分析与分类:基于序列模式挖掘的方法
员工流失和旷工是影响许多行业和组织的主要问题,会导致生产力下降、成本上升和损失。这些现象可归因于人力资源或管理部门难以预测的多种因素。因此,本文提出了一种基于内容的员工流失和缺勤分析与分类方法,可用于传统的事后人才分析和管理。所开发的方法被称为 E(3A)CSPM ,是基于 SPM(序列模式挖掘)的。在该方法中,采用了四个包含多样化员工数据的公共数据集,并首先将其转换为合适的格式。然后,将 SPM 算法应用于转换后的数据集,以揭示重复出现的模式和特征规则。所发现的模式和规则不仅提供了在员工流失和缺勤中起关键作用的特征信息,还提供了其价值。此后,这些频繁出现的特征模式将用于对员工流失和缺勤情况进行分类/预测。实验中使用了八个分类器和多个评价指标。E(3A)CSPM 的性能与最先进的员工减员和旷工方法进行了对比,结果表明 E(3A)CSPM 的性能超过了这些方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers in Industry
Computers in Industry 工程技术-计算机:跨学科应用
CiteScore
18.90
自引率
8.00%
发文量
152
审稿时长
22 days
期刊介绍: The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信