A COMPARISON STUDY OF PREDICTIVE MODELS FOR CUSTOMER LIFETIME VALUE AND THE CHANGE OF ITS LEVEL

Authors

  • Pornthip Dechpichai Department of Mathematics, Faculty of Science, King Mongkut’s University of Technology Thonburi
  • Latthapol Chokratprapa Department of Mathematics, Faculty of Science, King Mongkut’s University of Technology Thonburi
  • Supitch Sripath Thanachart insurance public company limited
  • Nathakhun Wiroonsri Department of Mathematics, Faculty of Science, King Mongkut’s University of Technology Thonburi

Keywords:

Customer Lifetime Value, Ordinal Logistic Regression, Random Forest

Abstract

The main aims of this research are to construct and compare models to predict the change of Customer Lifetime Value or CLV level in two consecutive years. The data used in this work is collected by a case study insurance company from 2018 to 2020 containing the following features: gender, the number of consecutive years with the company, the number of active policies, the total insurance premium, the total coverage, the total policy profit and CLV level according to company criteria (Standard, Plus, Extra and Ultima). The data from 2018 and 2019 consisting of 541,371 customers are used to construct the ordinal logistic regression and Random Forest models and to check the model accuracies, respectively. Then the data from 2020 consisting of 1,029,001 customers is used for predicting future CLV.

            The results show that, in term of predicting CLV, the Random Forest model (75.01%) has a higher accuracy than the ordinal logistic regression model (65.60%). However, both models cannot be used in detecting the change of CLV level providing the accuracies of 15.57% and 25.74%, respectively. After adjusting the predicting threshold criterion for the ordinal logistic regression model, the total accuracy and the change of CLV level accuracy are improved to 74.60% and 52.16%, respectively. Therefore, the ordinal logistic regression model with the new threshold criterion is selected as our final model. By using the final model, the 1,029,001 customers are classified as 40,786 Standard customers, 809,951 Plus customers, 168,389 Extra customers, and 9,875 Ultima customers.

References

Breiman, L. (2001). Random Forest s. Machine Learning, 45, 5–32. doi: 10.1023/A:1010933 404324.

Chang, W., Chang, C., & Li, Q. (2012). Customer Lifetime Value: A Review. Social Behavior and Personality: an international journal, 40(7), 1057-1064. doi:10.2224/sbp.2012.40. 7.1057.

Ho, T.K. (1995). Random Decision Forests. Proceedings of the 3rd International Conference on Document Analysis and Recognition 1, 278-282. doi: 10.1109/ICDA R.1995.598994.

Jackson, D. (1989). Determining a customer’s lifetime value. Direct Marketing, 51, 60-63.

James, G., Witten, D., Hastie, T. & Tibshirani, R. (2013). An Introduction to Statistical Learning with Applications in R. New York: Springer.

Office of Insurance Commission. (2019). Annual General Insurance Business Overview Report 2019. Retrieved form https://www.oic.or. th/th/industry /statistic/data/39/2.

RStudio Team. (2020). RStudio: Integrated Development for R, Boston: RStudio, PBC. Retrieved form http://www.rstudio.com/.

Downloads

Published

2022-12-29

How to Cite

Dechpichai, P., Chokratprapa, L. ., Sripath, S., & Wiroonsri, N. (2022). A COMPARISON STUDY OF PREDICTIVE MODELS FOR CUSTOMER LIFETIME VALUE AND THE CHANGE OF ITS LEVEL . Modern Management Journal, 20(2), 130–142. Retrieved from https://so04.tci-thaijo.org/index.php/stou-sms-pr/article/view/256864

Issue

Section

Research Articles