Abstract
Process mining studies data-driven methods to discover, enhance, and monitor business processes by gathering knowledge from event logs recorded by modern IT systems. To gain valuable process insights, it is essential for process mining users to formalize their process questions as executable queries. For this purpose, we present the Celonis Process Query Language (Celonis PQL), which is a domain-specific language tailored toward a special process data model and designed for business users. It translates process-related business questions into queries and executes them on a custom-built query engine. Celonis PQL covers a broad set of more than 150 operators, ranging from process-specific functions to machine learning and mathematical operators. Its syntax is inspired by SQL, but specialized for process-related queries. In addition, we present practical use cases and real-world applications, which demonstrate the expressiveness of the language and how business users can apply it to discover, enhance, and monitor business processes. The maturity and feasibility of Celonis PQL is shown by thousands of users from different industries, who apply it to various process types and huge amounts of event data every day.
Chapter PDF
Similar content being viewed by others
References
Badakhshan, P., Bernhart, G., Geyer-Klingeberg, J., Nakladal, J., Schenk, S., Vogelgesang, T.: The action engine – turning process insights into action. In: ICPM Demos 2019. CEUR Workshop Proceedings, vol. 2374. CEUR-WS.org (2019). http://ceur-ws.org/Vol-2374/paper8.pdf
Bernstein, P.A., Hadzilacos, V., Goodman, N.: Concurrency Control and Recovery in Database Systems. Addison-Wesley (1987). http://research.microsoft.com/en-us/people/philbe/ccontrol.aspx
Berti, A.: Increasing scalability of process mining using event dataframes: How data structure matters. CoRR abs/1907.12817 (2019). http://arxiv.org/abs/1907.12817
Carmona, J., van Dongen, B.F., Solti, A., Weidlich, M.: Conformance Checking - Relating Processes and Models. Springer (2018). https://doi.org/10.1007/978-3-319-99414-7
Council, T.P.P.: TPC Benchmark H (Decision Support) Standard Specification Revision 2.18.0 (2018). http://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v2.18.0.pdf
Esser, S., Fahland, D.: Storing and querying multi-dimensional process event logs using graph databases. In: Process Querying (PQ) Workshop 2019, pp. 283–294 (2019)
Lang, H., Mühlbauer, T., Funke, F., Boncz, P.A., Neumann, T., Kemper, A.: Data blocks: Hybrid OLTP and OLAP on compressed storage using both vectorization and compilation. In: SIGMOD 2016, pp. 311–326. ACM (2016). https://doi.org/10.1145/2882903.2882925
Leemans, S.J.J., Fahland, D., van der Aalst, W.M.P.: Discovering block-structured process models from event logs - A constructive approach. In: PETRI NETS 2013. Lecture Notes in Computer Science, vol. 7927, pp. 311–329. Springer (2013). https://doi.org/10.1007/978-3-642-38697-8_17
Leis, V., Boncz, P.A., Kemper, A., Neumann, T.: Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age. In: SIGMOD 2014, pp. 743–754. ACM (2014). https://doi.org/10.1145/2588555.2610507
Müller, I., Ratsch, C., Färber, F.: Adaptive string dictionary compression in in-memory column-store database systems. In: Proceedings of the EDBT 2014, pp. 283–294. OpenProceedings.org (2014). https://doi.org/10.5441/002/edbt.2014.27
Neumann, T.: Efficiently compiling efficient query plans for modern hardware. PVLDB 4(9), 539–550 (2011). https://doi.org/10.14778/2002938.2002940. http://www.vldb.org/pvldb/vol4/p539-neumann.pdf
OMG: Business Process Model and Notation (BPMN), Version 2.0 (2011). http://www.omg.org/spec/BPMN/2.0
Polyvyanyy, A., Ouyang, C., Barros, A., van der Aalst, W.M.P.: Process querying: Enabling business intelligence through query-based process analytics. Decis. Support Syst. 100, 41–56 (2017). https://doi.org/10.1016/j.dss.2017.04.011
Schüle, M., Schliski, P., Hutzelmann, T., Rosenberger, T., Leis, V., Vorona, D., Kemper, A., Neumann, T.: Monopedia: Staying single is good enough - the hyper way for Web scale applications. PVLDB 10(12), 1921–1924 (2017). https://doi.org/10.14778/3137765.3137809. http://www.vldb.org/pvldb/vol10/p1921-schuele.pdf
van der Aalst, W.M.P.: Process Mining - Data Science in Action, Second Edition. Springer (2016). https://doi.org/10.1007/978-3-662-49851-4
Veit, F., Geyer-Klingeberg, J., Madrzak, J., Haug, M., Thomson, J.: The proactive insights engine: Process mining meets machine learning and artificial intelligence. In: BPM Demos 2017. CEUR Workshop Proceedings, vol. 1920. CEUR-WS.org (2017). http://ceur-ws.org/Vol-1920/BPM_2017_paper_192.pdf
Weijters, A.J.M.M., Ribeiro, J.T.S.: Flexible heuristics miner (FHM). In: Proceedings of the IEEE CIDM 2011, pp. 310–317. IEEE (2011). https://doi.org/10.1109/CIDM.2011.5949453
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2022 The Author(s)
About this chapter
Cite this chapter
Vogelgesang, T., Ambrosy, J., Becher, D., Seilbeck, R., Geyer-Klingeberg, J., Klenk, M. (2022). Celonis PQL: A Query Language for Process Mining. In: Polyvyanyy, A. (eds) Process Querying Methods. Springer, Cham. https://doi.org/10.1007/978-3-030-92875-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-92875-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92874-2
Online ISBN: 978-3-030-92875-9
eBook Packages: Computer ScienceComputer Science (R0)