Abstract
Decision makers in the tourism sector deal with various issues and need high-quality information to support their decisions. We propose a data-centric approach that analyses historical point of interest (POI) check-in data to determine parameters for an Agent Based Model (ABM). ABM simulation is then run multiple times to simulate possible outcomes in terms of the tourist flow. We have tested the proposed approach on the city of Salzburg using check-in data from Salzburg Card users across 29 POIs. These data were used to parameterize the ABM model with the number of people, the number of POIs a person visits per day, and the preference for selecting POIs to visit. The simulation was performed in GAMA ABM platform and the spatial environment was based on buildings and roads from OpenStreetMap (OSM). Simulation for the duration of 1 day has been repeated 50 times to generate POI visiting patterns. The simulation results have been compared to the ground truth data for the same day and they show that the approach can recreate the long-term pattern of POI visits, but has over-estimated several POIs that had lower visitor counts on that specific day.
Supported by FFG project number FO999887513: “Digital Tourism for Sustainability (dTS)”.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Tourism sector deals with various issues and requires reliable information to support decision making. One tool that is established for this task is the ABM [1], which is a computational approach for modeling complex systems consisting of interacting autonomous agents and simulating their activities [2]. Contrary to the top-down approaches that can estimate total number of tourists and provide a high level overview [3], the bottom-up approach of ABM defines and tracks behaviors of each individual tourist and can answer “what if” scenarios. Thus, the simulations generated by ABM can provide valuable insights to decision makers in dealing with current and future situations (e.g., what happens if some POIs are closed or the number of tourists is doubled?).
For ABM to be successful in creating realistic simulations, it first needs to be initialized with the right parameters for agents’ behavior. These parameters can be obtained by analyzing historical data, such as POI check-in data in the case of this study, for tourist flow patterns and behavior. Here we propose an approach for building a spatio-temporal ABM to simulate the behavior of tourists in the city of Salzburg based on the historical data of POI check-ins with tourist card. The main research question of this paper is can ABM adequately simulate tourists’ visiting patterns in terms of visits to POIs in one day? The practical contribution of this study is the extraction of ABM simulation parameters from check-in data format which can be recreated for other similar input datasets. Theoretical novelty, which we have not seen elsewhere, is in the use of frequent itemset mining to define agents’ decision making for the next destination.
2 Related Work
Baktash et al. [1] recently reviewed existing literature on ABM in tourism. According to their classification, our study falls between the tourist flow management and tourist decision making. Recent studies in these fields have looked at ABM for tourist flow management for 41 attractions in Sichuan [4], the spatial spillover effects across 314 Chinese cities [5], and user generated content analysis to deduce desired destinations [6]. What stands out about [1] is that there is no mention nor discussion of the role of machine learning (ML) and artificial intelligence (AI) for ABM in tourism. This shows us that even if such trend exists in other fields, there is a gap in the tourism ABM with potential to improve tourist behavior modeling by using ML and AI on historical data. Additionally, the tourist check-in data that is used here is not commonly found in literature and presents a new challenge.
3 Methodology
The proposed approach starts by analyzing the proprietary POI check-in data obtained from Salzburg CardFootnote 1 users (Fig. 1) who are usually short term visitors to Salzburg. The data hold anonymous check-ins into 29 different POIs from 2017 to 2019 where each row shows a unique user identifier, the name of the POI, and the date of the check-in.
There are three parameters of the ABM environment that need to be determined from the input data via statistical analysis and data mining including the number of tourists, the number of POIs visited per day, and the POI selection preference by tourists. The number of tourists for the day of the simulation is set to be the same as the number of active tourists on the previous day in the data. For the number of POIs that tourists visit per day, we consider the distribution of these values for all the days in the data prior to the simulation day. We then use the mean, and standard deviation values to randomly select a number of POIs that each agent in the simulation has to visit using the gauss (mean, standard_deviation) GAMA function. The POI selection preference is defined by mining frequent itemsets of length 1 on individual POIs. The supports for POIs were calculated using the check-ins up until the day of the simulation and the support expresses the ratio of tourists that have visited the POI. The supports are then used as weights in the weighted random choice function used by tourists in the simulation to select their next destination.
The next step is to run multiple simulations of tourist flow in the city of Salzburg for the specified simulation day using the GAMA ABM platformFootnote 2 which supports the use of spatial datasets for defining the model environment [7]. We use OSM, the most prominent Volunteered Geographic Information (VGI) dataset [8], as a source of 2D vector information about road networks on which agents can move and building footprints as their origins and destinations.
The results of the simulation runs can be aggregated and compared to the ground truth POI check-in data for evaluation.
4 Experiment and Results
The experiment was performed on a standard PC with an I7 processor and 16 GB RAM running a Windows operating system. POIs were represented as point geometries sourced directly from Salzburg Card. The agents were initialized as sleeping at accommodations and would start visiting POIs around 8AM and would finish their visits when they have visited the requested number of POIs for the day. We ran 50 simulations for the simulation day March 1, 2019.
Figure 2 shows the histogram of visited POIs by a single user per day. The leftmost histogram shows the data from January 1, 2017 until March 1, 2019 which were used to train tourist-agents in the ABM simulation. The middle and right histograms show the simulation result for March 1, 2019 and the respective ground truth data. It can be noticed that the counts of simulated POI visits per agent (middle) are distributed similarly to the actual POI visits per tourist on the simulation day (right), which is also confirmed by the chi-squared test value of \(\chi ^2=0.1455\) between these two histograms.
In Fig. 3 we list the 29 POIs within the City of Salzburg and their average visiting probability per tourist (dashed line). We then ran 50 simulation runs and analyzed the quantitative visitor numbers for each POI. Here, the blue bars represent the true visitor numbers for the simulation day (March 1, 2019), and the red line depicts the mean visitor numbers across all 50 simulations with 95% confidence interval indicated by the pink area.
5 Conclusion and Future Work
We proposed a tourist flow simulation approach based on ABM that is parameterized by historical data of POI check-ins. The three parameters for the ABM simulation, the number of people agents, number of POIs each agent visits per day, and the selection of POIs that agents visit were determined from historical data analysis. The simulation was performed for a single day on the model of the city of Salzburg and repeated 50 times. The simulation results were then compared to the ground truth POI check-in data for the same day.
The results show that the simulations are able to create overall realistic patterns of POI check-ins. However, Fig. 3 shows discrepancies in some POIs where the simulated numbers are much larger than true values. The simulated values for these POIs are similar to the long term trend shown with dashed line (e.g., POI Entritt Schloss & Wasserspeile Hellbrunn) which is observed over the training period. Our method is limited in reflecting seasonal or daily changes where POI may be closed on the simulation day. Thus, we need to develop a more sophisticated POI selection approach for the simulation that is based on a more detailed trend analysis. We should also increase the behavior complexity of agents, relying on theories from social sciences.
Notes
- 1.
https://www.salzburg.info/en/hotels-offers/salzburg-card, Data can only be shared within the dTS project consortium.
- 2.
References
Baktash, A., Huang, A., de la Mora Velasco, E., Jahromi, M.F., Bahja, F.: Agent-based modelling for tourism research. Curr. Issues Tourism, 1–13 (2022)
Batty, M., Crooks, A.T., See, L.M., Heppenstall, A.J.: Perspectives on agent-based models and geographical systems. In: Heppenstall, A., Crooks, A., See, L., Batty, M. (eds.) Agent-Based Models of Geographical Systems. Springer, Dordrecht (2012). https://doi.org/10.1007/978-90-481-8927-4_1
Lemmel, J., et al.: Deep-learning vs regression: prediction of tourism flow with limited data. In: International Joint Conference on Artificial Intelligence IJCAI 2022. Workshop AI for Time Series Analysis AI4TS, July 2022
Qiu, R., Xu, W., Li, S.: Agent-based modeling of the spatial diffusion of tourist flow—a case study of Sichuan China. J. China Tourism Res. 12(1), 85–107 (2016)
Li, S., Yang, Y., Zhong, Z., Tang, X.: Agent-based modeling of spatial spillover effects in visitor flows. J. Travel Res. 60(3), 546–563 (2021)
Zhang, S., Zhen, F., Wang, B., Li, Z., Qin, X.: Coupling social media and agent-based modelling: a novel approach for supporting smart tourism planning. J. Urban Technol. 29(2), 79–97 (2022)
Taillandier, P., et al.: Building, composing and experimenting complex spatial models with the GAMA platform. Geoinformatica 23(2), 299–322 (2019). https://doi.org/10.1007/s10707-018-00339-6
Sui, D., Elwood, S., Goodchild, M.: Crowdsourcing Geographic Knowledge: Volunteered Geographic Information (VGI) in Theory and Practice. Springer Science & Business Media, Berlin (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2023 The Author(s)
About this paper
Cite this paper
Majic, I., Scholz, J., Bulbul, R., Wallinger, S. (2023). Tourist Flow Simulation in GAMA Using Historical Data Parameters. In: Ferrer-Rosell, B., Massimo, D., Berezina, K. (eds) Information and Communication Technologies in Tourism 2023. ENTER 2023. Springer Proceedings in Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-031-25752-0_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-25752-0_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25751-3
Online ISBN: 978-3-031-25752-0
eBook Packages: Business and ManagementBusiness and Management (R0)