Abstract
Batch runs corresponding to the same recipe usually have different duration. The data collected by the sensors that equip batch production lines reflects this fact: time series with different lengths and unsynchronized events. Dynamic Time Warping (DTW) is an algorithm successfully used, in batch monitoring too, to synchronize and map to a standard time axis two series, an action called alignment. The online alignment of running batches, although interesting, gives no information on the remaining time frame of the batch, such as its total runtime, or time-to-end. We notice that this problem is similar to the one addressed by Survival Analysis (SA), a statistical technique of standard use in clinical studies to model time-to-event data. Machine Learning (ML) algorithms adapted to survival data exist, with increased predictive performance with respect to classical formulations. We apply a SA-ML-based system to the problem of predicting the time-to-end of a running batch, and show a new application of DTW. The information returned by openended DTW can be used to select relevant data samples for the SA-ML system, without negatively affecting the predictive performance and decreasing the computational cost with respect to the same SA-ML system that uses all the data available. We tested the system on a real-world dataset coming from a chemical plant.
Chapter PDF
Similar content being viewed by others
References
Gradient boosting survival analysis. https://scikit-survival.readthedocs.io/en/latest/generated/sksurv.ensemble.GradientBoostingSurvivalAnalysis.html, accessed: 2019-06-14
Understanding predictions in survival anlysis. https://scikit-survival.readthedocs.io/en/latest/understanding_predictions.html, accessed: 2019-06-14
Cox, D.R.: Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological) 34(2), 187–202 (1972)
Gao, X.: On-line monitoring of batch process with multiway pca/ica. In: Principal Component Analysis. IntechOpen (2012)
Giorgino, T., et al.: Computing and visualizing dynamic time warping alignments in r: the dtw package. Journal of statistical Software 31(7), 1–24 (2009)
Hothorn, T., B¨uhlmann, P., Dudoit, S., Molinaro, A., Van Der Laan, M.J.: Survival ensembles. Biostatistics 7(3), 355–373 (2005)
Hunter, J.D.: Matplotlib: A 2d graphics environment. Computing in science & engineering 9(3), 90–95 (2007)
Jones, E., Oliphant, T., Peterson, P., et al.: SciPy: Open source scientific tools for Python (2001–), https://www.scipy.org/, [Online; accessed: 2019-06-14]
Kassidas, A., Taylor, P.A., MacGregor, J.F.: Off-line diagnosis of deterministic faults in continuous dynamic multivariable processes using speech recognition methods. Journal of Process Control 8(5), 381 – 393 (1998). https://doi.org/https://doi.org/10.1016/S0959-1524(98)00025-0, aDCHEM ’97 IFAC Symposium: Advanced Control of Chemical Processes
McKinney, W., et al.: Data structures for statistical computing in python. In: Proceedings of the 9th Python in Science Conference. vol. 445, pp. 51–56. Austin, TX (2010)
Oliphant, T.E.: A guide to NumPy, vol. 1. Trelgol Publishing USA (2006)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al.: Scikit-learn: Machine learning in python. Journal of machine learning research 12(Oct), 2825–2830 (2011)
Pölsterl, S., Gupta, P., Wang, L., Conjeti, S., Katouzian, A., Navab, N.: Heterogeneous ensembles for predicting survival of metastatic, castrate-resistant prostate cancer patients. F1000Research 5 (2016)
Pölsterl, S., Navab, N., Katouzian, A.: Fast training of support vector machines for survival analysis. In: Appice, A., Rodrigues, P.P., Santos Costa, V., Gama, J., Jorge, A., Soares, C. (eds.) Machine Learning and Knowledge Discovery in Databases. pp. 243–259. Springer International Publishing, Cham (2015)
Pölsterl, S., Navab, N., Katouzian, A.: An efficient training algorithm for kernel survival support vector machines. arXiv preprint arXiv:1611.07054 (2016)
Ramaker, H., van Sprang, E.: Statistical batch process monitoring (01 2004)
Ramaker, H.J., van Sprang, E.N., Westerhuis, J.A., Smilde, A.K.: Dynamic time warping of spectroscopic batch data. Analytica Chimica Acta 498(1), 133 – 153 (2003). https://doi.org/https://doi.org/10.1016/j.aca.2003.08.045
van Rossum, G.: Python tutorial. Tech. Rep. CS-R9526, Centrum voor Wiskunde en Informatica (CWI), Amsterdam (May 1995)
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE transactions on acoustics, speech, and signal processing 26(1), 43–49 (1978)
Wang, P., Li, Y., Reddy, C.K.: Machine learning for survival analysis: A survey. arXiv preprint arXiv:1708.04649 (2017)
Zhao, J., Itti, L.: Shapedtw: shape dynamic time warping. Pattern Recognition 74, 171–184 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2021 The Author(s)
About this paper
Cite this paper
Graniero, P., Gärtler, M. (2021). Prediction of Batch Processes Runtime Applying Dynamic Time Warping and Survival Analysis. In: Beyerer, J., Maier, A., Niggemann, O. (eds) Machine Learning for Cyber Physical Systems. Technologien für die intelligente Automation, vol 13. Springer Vieweg, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-62746-4_6
Download citation
DOI: https://doi.org/10.1007/978-3-662-62746-4_6
Published:
Publisher Name: Springer Vieweg, Berlin, Heidelberg
Print ISBN: 978-3-662-62745-7
Online ISBN: 978-3-662-62746-4
eBook Packages: EngineeringEngineering (R0)