Customer Segmentation Based on Transactional Data Using Stream Clustering

Matthias Carnein¹⁹ &
Heike Trautmann¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11439))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

3207 Accesses
18 Citations

Abstract

Customer Segmentation aims to identify groups of customers that share similar interest or behaviour. It is an essential tool in marketing and can be used to target customer segments with tailored marketing strategies. Customer segmentation is often based on clustering techniques. This analysis is typically performed as a snapshot analysis where segments are identified at a specific point in time. However, this ignores the fact that customer segments are highly volatile and segments change over time. Once segments change, the entire analysis needs to be repeated and strategies adapted. In this paper we explore stream clustering as a tool to alleviate this problem. We propose a new stream clustering algorithm which allows to identify and track customer segments over time. The biggest challenge is that customer segmentation often relies on the transaction history of a customer. Since this data changes over time, it is necessary to update customers which have already been incorporated into the clustering. We show how to perform this step incrementally, without the need for periodic re-computations. As a result, customer segmentation can be performed continuously, faster and is more scalable. We demonstrate the performance of our algorithm using a large real-life case study.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Customer Segmentation Using K-Means Clustering

Customer Segmentation via Data Mining Techniques: State-of-the-Art Review

An Exploratory Approach for Understanding Customer Behavior Processes Based on Clustering and Sequence Mining

Notes

1.
Implementation available at: http://www.matthias-carnein.de/userStream. For reproducability, we also show how to apply the algorithm on a public dataset.

References

Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: Proceedings of the 29th International Conference on Very Large Data Bases, VLDB 2003, Berlin, Germany, vol. 29, pp. 81–92. VLDB Endowment (2003)
Google Scholar
Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for projected clustering of high dimensional data streams. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, vol. 30, pp. 852–863. VLDB Endowment (2004)
Google Scholar
Bifet, A., Gavalda, R., Holmes, G., Pfahringer, B.: Machine Learning for Data Streams with Practical Examples in MOA. MIT Press, Cambridge (2018)
Book Google Scholar
Buttle, F.: Customer Relationship Management: Concepts and Technologies. Elsevier Butterworth-Heinemann, Oxford (2009)
Google Scholar
Carnein, M., Assenmacher, D., Trautmann, H.: An empirical comparison of stream clustering algorithms. In: Proceedings of the ACM International Conference on Computing Frontiers (CF 2017), pp. 361–365. ACM (2017). https://doi.org/10.1145/3075564.3078887
Carnein, M., Trautmann, H.: Evostream - evolutionary stream clustering utilizing idle times. Big Data Res. (2018). https://doi.org/10.1016/j.bdr.2018.05.005
Article Google Scholar
Carnein, M., Trautmann, H.: Optimizing data stream representation: an extensive survey on stream clustering algorithms. Bus. Inf. Syst. Eng. (BISE) (2019). https://doi.org/10.1007/s12599-019-00576-5
Article Google Scholar
Chen, Y., Tu, L.: Density-based clustering for real-time stream data. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2007, San Jose, California, USA, pp. 133–142. ACM (2007). https://doi.org/10.1145/1281192.1281210
Hahsler, M., Bolaños, M.: Clustering data streams based on shared density between micro-clusters. IEEE Trans. Knowl. Data Eng. 28(6), 1449–1461 (2016). https://doi.org/10.1109/TKDE.2016.2522412
Article Google Scholar
Kranen, P., Assent, I., Baldauf, C., Seidl, T.: Self-adaptive anytime stream clustering. In: 9th IEEE International Conference on Data Mining (ICDM 2009), pp. 249–258, December 2009. https://doi.org/10.1109/ICDM.2009.47
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theor. 28(2), 129–137 (2006). https://doi.org/10.1109/TIT.1982.1056489
Article MathSciNet MATH Google Scholar
Rousseeuw, P.J., Kaufman, L.: Finding Groups in Data. Wiley, Hoboken (1990)
MATH Google Scholar
Schiffman, L.G., Hansen, H., Kanuk, L.L.: Consumer Behaviour: A European Outlook. Pearson Education, London (2008)
Google Scholar
Wedel, M., Kamakura, W.A.: Market Segmentation, 2nd edn. Springer, USA (2000). https://doi.org/10.1007/978-1-4615-4651-1
Book Google Scholar
Welford, B.P.: Note on a method for calculating corrected sums of squares and products. Technometrics 4(3), 419–420 (1962)
Article MathSciNet Google Scholar
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: a new data clustering algorithm and its applications. Data Min. Knowl. Discov. 1(2), 141–182 (1997)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Münster, Münster, Germany
Matthias Carnein & Heike Trautmann

Authors

Matthias Carnein
View author publications
You can also search for this author in PubMed Google Scholar
Heike Trautmann
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthias Carnein .

Editor information

Editors and Affiliations

Hong Kong University of Science and Technology, Hong Kong, China
Qiang Yang
Nanjing University, Nanjing, China
Zhi-Hua Zhou
University of Macau, Taipa, Macau, China
Zhiguo Gong
Southeast University, Nanjing, China
Min-Ling Zhang
Nanjing University of Aeronautics and Astronautics, Nanjing, China
Sheng-Jun Huang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carnein, M., Trautmann, H. (2019). Customer Segmentation Based on Transactional Data Using Stream Clustering. In: Yang, Q., Zhou, ZH., Gong, Z., Zhang, ML., Huang, SJ. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2019. Lecture Notes in Computer Science(), vol 11439. Springer, Cham. https://doi.org/10.1007/978-3-030-16148-4_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-16148-4_22
Published: 22 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16147-7
Online ISBN: 978-3-030-16148-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Customer Segmentation Based on Transactional Data Using Stream Clustering

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Customer Segmentation Using K-Means Clustering

Customer Segmentation via Data Mining Techniques: State-of-the-Art Review

An Exploratory Approach for Understanding Customer Behavior Processes Based on Clustering and Sequence Mining

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Customer Segmentation Based on Transactional Data Using Stream Clustering

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Customer Segmentation Using K-Means Clustering

Customer Segmentation via Data Mining Techniques: State-of-the-Art Review

An Exploratory Approach for Understanding Customer Behavior Processes Based on Clustering and Sequence Mining

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation