Abstract
Nowadays, grocery lists make part of shopping habits of many customers. With the popularity of e-commerce and plethora of products and promotions available on online stores, it can become increasingly difficult for customers to identify. products that both satisfy their needs and represent the best deals overall. In this paper, we present a grocery recommender system based on the use of traditional machine learning methods aiming at assisting customers with creation of their grocery lists on the MyGroceryTour platform which displays weekly grocery deals in Canada. Our recommender system relies on the individual user purchase histories, as well as the available products’ and stores’ features, to constitute intelligent weekly grocery lists. The use of clustering prior to supervised machine learning methods allowed us to identify customers profiles and reduce the choice of potential products of interest for each customer, thus improving the prediction results. The highest average F-score of 0.499 for the considered dataset of 826 Canadian customers was obtained using the Random Forest prediction model which was compared to the Decision Tree, Gradient Boosting Tree, XGBoost, Logistic Regression, Catboost, Support Vector Machine and Naive Bayes models in our study.
Chapter PDF
Similar content being viewed by others
Keywords
References
Vincent-Wayne, M., Aylott, R.: An exploratory study of grocery shopping stressors. Int. J. Retail. Distrib. Manag. 26, 362–373 (1998)
Newcomb, E., Pashley, T., Stasko, J.: Mobile computing in the retail arena. In: Proceedings of the SIGCHI conference on Human factors in computing systems, pp. 337–344. Association for Computing Machinery, New York (2003)
Sourav, B., Floréen, P., Forsblom, A., Hemminki, S., Myllymäki, P., Nurmi, P., Pulkkinen, T., Salovaara, A.: An Intelligent Mobile Grocery Assistant. In: 2012 Eighth International Conference on Intelligent Environments, pp. 165–172. IEEE, Guanajuato (2012)
Park, Y., Chang, K.: Individual and group behavior-based customer profile model for personalized product recommendation. Expert Systems with Applications 36(2), 1932–1939 (2009)
Ricci, F., Rokach, L., Shapira, B.: Recommender Systems: Introduction and Challenges. In: Recommender Systems Handbook, pp. 1–34. Springer, Boston (2015)
Faggioli, G., Mirko P., Fabio A.: Recency aware collaborative filtering for next basket recommendation. In: Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization, pp. 80–87. Association for Computing Machinery, New York (2020)
Che et al.: Inter-basket and intra-basket adaptive attention network for next basket recommendation. IEEE Access 7, 80644-80650 (2019)
Xia, Y. Giuseppe, D. F., Shikhar, V., Ankur, D.: A content-based recommender system for ecommerce offers and coupons. In: Proceedings of the SIGIR 2017 eCom workshop. eCOM@ SIGIR, Tokyo (2017)
Dou, X.: Online purchase behavior prediction and analysis using ensemble learning. In : 2020 IEEE 5th International Conference on Cloud Computing and Big Data Analytics (ICCCBDA), pp. 532–536. IEEE (2020)
Prokhorenkova, L., Gleb G., Aleksandr V., Anna V. D., Andrey G.: CatBoost: unbiased boosting with categorical features (2017) Available via arXiv
Tahiri, N., Mazoure, B. and Makarenkov, V.: An intelligent shopping list based on the application of partitioning and machine learning algorithms. In: Proceedings of the 18th Python in Science Conference (SCIPY 2019), pp. 85–92. Austin, Texas (2019)
Kotsiantis, S. B., Kanellopoulos, D., Pintelas, P. E.: Data preprocessing for supervised leaning. Int. J. Comput. Sci. 2, 111–117 (2006)
García, S., Luengo, J., Herrera, F.: Data Preprocessing in Data Mining. Springer, Cham, Switzerland (2015)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol. 1, no. 14, pp. 281–297 (1967)
Ester, M., Kriegel, H. P., Sander, J., Xu, X.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. AAAI Press, pp. 226–231. AAAI Press, Portland, Oregon (1996)
Pedregosa et al.: Scikit-learn: Machine Learning in Python. JMLR 12, 2825-2830 (2011)
Davies, D. L., Bouldin, D.W.:ACluster Separation Measure. In: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-1, no. 2, pp. 224–227 (1979)
Van der Maaten, L. J. P., Hinton, G.: Visualizing High-Dimensional Data Using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2023 The Author(s)
About this paper
Cite this paper
Chabane, N., Bouaoune, M.A., Tighilt, R.A.S., Mazoure, B., Tahiri, N., Makarenkov, V. (2023). Using Clustering and Machine Learning Methods to Provide Intelligent Grocery Shopping Recommendations. In: Brito, P., Dias, J.G., Lausen, B., Montanari, A., Nugent, R. (eds) Classification and Data Science in the Digital Age. IFCS 2022. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-031-09034-9_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-09034-9_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-09033-2
Online ISBN: 978-3-031-09034-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)