{"title":"A data-driven approach to customer lifetime value prediction using probability and machine learning models","authors":"Albert Wong, Andres Viloria Garcia, Yew-Wei Lim","doi":"10.1016/j.dajour.2025.100601","DOIUrl":null,"url":null,"abstract":"<div><div>Customer lifetime value is an important marketing metric and has applications in market segmentation, strategy development, and direct marketing programs, especially when customers are not under contract. In this research, we demonstrate the prediction of the lifetime value of patients in a health service portfolio in two separate ways. The probability of a patient being alive and their value in the coming evaluation period are first predicted using a probability model that has been well-established in the marketing community. We then use several machine learning algorithms to perform the same task. The results of these two approaches are compared in terms of accuracy to gain insight into their respective strengths and weaknesses. We believe that the work is one of the first attempts to gain an understanding of the use of machine learning algorithms in this important marketing issue. The results showed that the probability model performs better than the machine learning models, probably due to the assumption required in the probability calculations. It is therefore recommended that an essential step in applying these software approaches is to verify the validity of the key assumption of regularity. In addition, in future studies, consideration should be given to a larger dataset with demographic variables beyond age and gender that were used in this study. Developing specific ML models for dealing with zero-inflated data, which is an inherent feature of customer lifetime data, will also be helpful.</div></div>","PeriodicalId":100357,"journal":{"name":"Decision Analytics Journal","volume":"16 ","pages":"Article 100601"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Decision Analytics Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772662225000578","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Customer lifetime value is an important marketing metric and has applications in market segmentation, strategy development, and direct marketing programs, especially when customers are not under contract. In this research, we demonstrate the prediction of the lifetime value of patients in a health service portfolio in two separate ways. The probability of a patient being alive and their value in the coming evaluation period are first predicted using a probability model that has been well-established in the marketing community. We then use several machine learning algorithms to perform the same task. The results of these two approaches are compared in terms of accuracy to gain insight into their respective strengths and weaknesses. We believe that the work is one of the first attempts to gain an understanding of the use of machine learning algorithms in this important marketing issue. The results showed that the probability model performs better than the machine learning models, probably due to the assumption required in the probability calculations. It is therefore recommended that an essential step in applying these software approaches is to verify the validity of the key assumption of regularity. In addition, in future studies, consideration should be given to a larger dataset with demographic variables beyond age and gender that were used in this study. Developing specific ML models for dealing with zero-inflated data, which is an inherent feature of customer lifetime data, will also be helpful.