M. Saqib Nawaz , M. Zohaib Nawaz , Philippe Fournier-Viger , José María Luna
{"title":"Analysis and classification of employee attrition and absenteeism in industry: A sequential pattern mining-based methodology","authors":"M. Saqib Nawaz , M. Zohaib Nawaz , Philippe Fournier-Viger , José María Luna","doi":"10.1016/j.compind.2024.104106","DOIUrl":null,"url":null,"abstract":"<div><p>Employee attrition and absenteeism are major problems that affect many industries and organizations, resulting in diminished productivity, elevated costs, and losses. These phenomena can be attributed to multiple factors that are difficult to anticipate for human resources or management. Therefore, this paper proposes a content-based methodology for the analysis and classification of employee attrition and absenteeism that can be used for talent analysis and management, a task that is traditionally carried out ex-post. The developed methodology, called E(3A)CSPM, is based on SPM (sequential pattern mining). In the methodology, four public datasets with diversified employee data are adopted, which are initially transformed into a suitable format. Then, SPM algorithms are applied to the transformed datasets to reveal recurring patterns and rules of features. The discovered patterns and rules not only offer information regarding features that have a key role in employee attrition and absenteeism but also their values. These frequent patterns of features are thereafter used to classify/predict employee attrition and absenteeism. Eight classifiers and multiple evaluation metrics are used in experiments. The performance of E(3A)CSPM is contrasted with state-of-the-art approaches for employee attrition and absenteeism and the obtained findings reveal that E(3A)CSPM surpasses these approaches.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"159 ","pages":"Article 104106"},"PeriodicalIF":8.2000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000344","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Employee attrition and absenteeism are major problems that affect many industries and organizations, resulting in diminished productivity, elevated costs, and losses. These phenomena can be attributed to multiple factors that are difficult to anticipate for human resources or management. Therefore, this paper proposes a content-based methodology for the analysis and classification of employee attrition and absenteeism that can be used for talent analysis and management, a task that is traditionally carried out ex-post. The developed methodology, called E(3A)CSPM, is based on SPM (sequential pattern mining). In the methodology, four public datasets with diversified employee data are adopted, which are initially transformed into a suitable format. Then, SPM algorithms are applied to the transformed datasets to reveal recurring patterns and rules of features. The discovered patterns and rules not only offer information regarding features that have a key role in employee attrition and absenteeism but also their values. These frequent patterns of features are thereafter used to classify/predict employee attrition and absenteeism. Eight classifiers and multiple evaluation metrics are used in experiments. The performance of E(3A)CSPM is contrasted with state-of-the-art approaches for employee attrition and absenteeism and the obtained findings reveal that E(3A)CSPM surpasses these approaches.
期刊介绍:
The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that:
• Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry;
• Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry;
• Foster connections or integrations across diverse application areas of ICT in industry.