Abstract
Classification of jets with deep learning has gained significant attention in recent times. However, the performance of deep neural networks is often achieved at the cost of interpretability. Here we propose an interpretable network trained on the jet spectrum S2(R) which is a two-point correlation function of the jet constituents. The spectrum can be derived from a functional Taylor series of an arbitrary jet classifier function of energy flows. An interpretable network can be obtained by truncating the series. The intermediate feature of the network is an infrared and collinear safe C-correlator which allows us to estimate the importance of an S2(R) deposit at an angular scale R in the classification. The performance of the architecture is comparable to that of a convolutional neural network (CNN) trained on jet images, although the number of inputs and complexity of the architecture is significantly simpler than the CNN classifier. We consider two examples: one is the classification of two-prong jets which differ in color charge of the mother particle, and the other is a comparison between Pythia 8 and Herwig 7 generated jets.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
J.M. Butterworth, A.R. Davison, M. Rubin and G.P. Salam, Jet substructure as a new Higgs search channel at the LHC, Phys. Rev. Lett.100 (2008) 242001 [arXiv:0802.2470] [INSPIRE].
J. Thaler and L.-T. Wang, Strategies to Identify Boosted Tops, JHEP07 (2008) 092 [arXiv:0806.0023] [INSPIRE].
D.E. Kaplan, K. Rehermann, M.D. Schwartz and B. Tweedie, Top Tagging: A Method for Identifying Boosted Hadronically Decaying Top Quarks, Phys. Rev. Lett.101 (2008) 142001 [arXiv:0806.0848] [INSPIRE].
T. Plehn, G.P. Salam and M. Spannowsky, Fat Jets for a Light Higgs, Phys. Rev. Lett.104 (2010) 111801 [arXiv:0910.5472] [INSPIRE].
T. Plehn, M. Spannowsky, M. Takeuchi and D. Zerwas, Stop Reconstruction with Tagged Tops, JHEP10 (2010) 078 [arXiv:1006.2833] [INSPIRE].
D.E. Soper and M. Spannowsky, Finding physics signals with shower deconstruction, Phys. Rev.D 84 (2011) 074002 [arXiv:1102.3480] [INSPIRE].
D.E. Soper and M. Spannowsky, Finding top quarks with shower deconstruction, Phys. Rev.D 87 (2013) 054012 [arXiv:1211.3140] [INSPIRE].
M. Dasgupta, A. Fregoso, S. Marzani and G.P. Salam, Towards an understanding of jet substructure, JHEP09 (2013) 029 [arXiv:1307.0007] [INSPIRE].
D.E. Soper and M. Spannowsky, Finding physics signals with event deconstruction, Phys. Rev.D 89 (2014) 094005 [arXiv:1402.1189] [INSPIRE].
A.J. Larkoski, S. Marzani, G. Soyez and J. Thaler, Soft Drop, JHEP05 (2014) 146 [arXiv:1402.2657] [INSPIRE].
J. Gallicchio and M.D. Schwartz, Seeing in Color: Jet Superstructure, Phys. Rev. Lett.105 (2010) 022001 [arXiv:1001.5027] [INSPIRE].
J. Thaler and K. Van Tilburg, Identifying Boosted Objects with N-subjettiness, JHEP03 (2011) 015 [arXiv:1011.2268] [INSPIRE].
J. Gallicchio and M.D. Schwartz, Quark and Gluon Tagging at the LHC, Phys. Rev. Lett.107 (2011) 172001 [arXiv:1106.3076] [INSPIRE].
Y.-T. Chien, Telescoping jets: Probing hadronic event structure with multiple R’s, Phys. Rev.D 90 (2014) 054008 [arXiv:1304.5240] [INSPIRE].
A.J. Larkoski, G.P. Salam and J. Thaler, Energy Correlation Functions for Jet Substructure, JHEP06 (2013) 108 [arXiv:1305.0007] [INSPIRE].
A.J. Larkoski, I. Moult and D. Neill, Power Counting to Better Jet Observables, JHEP12 (2014) 009 [arXiv:1409.6298] [INSPIRE].
I. Moult, L. Necib and J. Thaler, New Angles on Energy Correlation Functions, JHEP12 (2016) 153 [arXiv:1609.07483] [INSPIRE].
L.G. Almeida, M. Backović, M. Cliche, S.J. Lee and M. Perelstein, Playing Tag with ANN: Boosted Top Identification with Pattern Recognition, JHEP07 (2015) 086 [arXiv:1501.05968] [INSPIRE].
L. de Oliveira, M. Kagan, L. Mackey, B. Nachman and A. Schwartzman, Jet-images — deep learning edition, JHEP07 (2016) 069 [arXiv:1511.05190] [INSPIRE].
P.T. Komiske, E.M. Metodiev and M.D. Schwartz, Deep learning in color: towards automated quark/gluon jet discrimination, JHEP01 (2017) 110 [arXiv:1612.01551] [INSPIRE].
G. Kasieczka, T. Plehn, M. Russell and T. Schell, Deep-learning Top Taggers or The End of QCD?, JHEP05 (2017) 006 [arXiv:1701.08784] [INSPIRE].
G. Louppe, K. Cho, C. Becot and K. Cranmer, QCD-Aware Recursive Neural Networks for Jet Physics, JHEP01 (2019) 057 [arXiv:1702.00748] [INSPIRE].
P.T. Komiske, E.M. Metodiev, B. Nachman and M.D. Schwartz, Pileup Mitigation with Machine Learning (PUMML), JHEP12 (2017) 051 [arXiv:1707.08600] [INSPIRE].
A. Butter, G. Kasieczka, T. Plehn and M. Russell, Deep-learned Top Tagging with a Lorentz Layer, SciPost Phys.5 (2018) 028 [arXiv:1707.08966] [INSPIRE].
T. Cheng, Recursive Neural Networks in Quark/Gluon Tagging, Comput. Softw. Big Sci.2 (2018) 3 [arXiv:1711.02633] [INSPIRE].
A. Andreassen, I. Feige, C. Frye and M.D. Schwartz, JUNIPR: a Framework for Unsupervised Machine Learning in Particle Physics, Eur. Phys. J.C 79 (2019) 102 [arXiv:1804.09720] [INSPIRE].
S. Choi, S.J. Lee and M. Perelstein, Infrared Safety of a Neural-Net Top Tagging Algorithm, JHEP02 (2019) 132 [arXiv:1806.01263] [INSPIRE].
S.H. Lim and M.M. Nojiri, Spectral Analysis of Jet Substructure with Neural Networks: Boosted Higgs Case, JHEP10 (2018) 181 [arXiv:1807.03312] [INSPIRE].
F.A. Dreyer, G.P. Salam and G. Soyez, The Lund Jet Plane, JHEP12 (2018) 064 [arXiv:1807.04758] [INSPIRE].
J. Lin, M. Freytsis, I. Moult and B. Nachman, Boosting \( H\to b\overline{b} \)with Machine Learning, JHEP10 (2018) 101 [arXiv:1807.10768] [INSPIRE].
P.T. Komiske, E.M. Metodiev and J. Thaler, Energy Flow Networks: Deep Sets for Particle Jets, JHEP01 (2019) 121 [arXiv:1810.05165] [INSPIRE].
J. Arjona Martínez, O. Cerri, M. Pierini, M. Spiropulu and J.-R. Vlimant, Pileup mitigation at the Large Hadron Collider with Graph Neural Networks, Eur. Phys. J. Plus134 (2019) 333 [arXiv:1810.07988] [INSPIRE].
G. Kasieczka, N. Kiefer, T. Plehn and J.M. Thompson, Quark-gluon Tagging: Machine Learning vs. Detector, SciPost Phys.6 (2019) 069 [arXiv:1812.09223] [INSPIRE].
H. Qu and L. Gouskos, ParticleNet: Jet Tagging via Particle Clouds, arXiv:1902.08570 [INSPIRE].
A.J. Larkoski, I. Moult and B. Nachman, Jet Substructure at the Large Hadron Collider: A Review of Recent Advances in Theory and Machine Learning, arXiv:1709.04464 [INSPIRE].
L. Asquith et al., Jet Substructure at the Large Hadron Collider: Experimental Review, arXiv:1803.06991 [INSPIRE].
L.M. Dery, B. Nachman, F. Rubbo and A. Schwartzman, Weakly Supervised Classification in High Energy Physics, JHEP05 (2017) 145 [arXiv:1702.00414] [INSPIRE].
T. Cohen, M. Freytsis and B. Ostdiek, (Machine) Learning to Do More with Less, JHEP02 (2018) 034 [arXiv:1706.09451] [INSPIRE].
E.M. Metodiev, B. Nachman and J. Thaler, Classification without labels: Learning from mixed samples in high energy physics, JHEP10 (2017) 174 [arXiv:1708.02949] [INSPIRE].
G. Louppe, M. Kagan and K. Cranmer, Learning to Pivot with Adversarial Networks, arXiv:1611.01046 [INSPIRE].
C. Shimmin et al., Decorrelated Jet Substructure Tagging using Adversarial Neural Networks, Phys. Rev.D 96 (2017) 074034 [arXiv:1703.03507] [INSPIRE].
A. Chakraborty, A.M. Iyer and T.S. Roy, A Framework for Finding Anomalous Objects at the LHC, Nucl. Phys.B 932 (2018) 439 [arXiv:1707.07084] [INSPIRE].
J. Hajer, Y.-Y. Li, T. Liu and H. Wang, Novelty Detection Meets Collider Physics, arXiv:1807.10261 [INSPIRE].
T. Heimel, G. Kasieczka, T. Plehn and J.M. Thompson, QCD or What?, SciPost Phys.6 (2019) 030 [arXiv:1808.08979] [INSPIRE].
M. Farina, Y. Nakai and D. Shih, Searching for New Physics with Deep Autoencoders, arXiv:1808.08992 [INSPIRE].
O. Cerri, T.Q. Nguyen, M. Pierini, M. Spiropulu and J.-R. Vlimant, Variational Autoencoders for New Physics Mining at the Large Hadron Collider, JHEP05 (2019) 036 [arXiv:1811.10276] [INSPIRE].
T.S. Roy and A.H. Vijay, A robust anomaly finder based on autoencoder, arXiv:1903.02032 [INSPIRE].
T. Roxlo and M. Reece, Opening the black box of neural nets: case studies in stop/top discrimination, arXiv:1804.09278 [INSPIRE].
J. Brehmer, K. Cranmer, G. Louppe and J. Pavez, Constraining Effective Field Theories with Machine Learning, Phys. Rev. Lett.121 (2018) 111801 [arXiv:1805.00013] [INSPIRE].
J. Brehmer, K. Cranmer, G. Louppe and J. Pavez, A Guide to Constraining Effective Field Theories with Machine Learning, Phys. Rev.D 98 (2018) 052004 [arXiv:1805.00020] [INSPIRE].
J. Guo, J. Li, T. Li, F. Xu and W. Zhang, Deep learning for R-parity violating supersymmetry searches at the LHC, Phys. Rev.D 98 (2018) 076017 [arXiv:1805.10730] [INSPIRE].
J.H. Collins, K. Howe and B. Nachman, Anomaly Detection for Resonant New Physics with Machine Learning, Phys. Rev. Lett.121 (2018) 241803 [arXiv:1805.02664] [INSPIRE].
R.T. D’Agnolo and A. Wulzer, Learning New Physics from a Machine, Phys. Rev.D 99 (2019) 015014 [arXiv:1806.02350] [INSPIRE].
A. De Simone and T. Jacques, Guiding New Physics Searches with Unsupervised Learning, Eur. Phys. J.C 79 (2019) 289 [arXiv:1807.06038] [INSPIRE].
C. Englert, P. Galler, A. Pilkington and M. Spannowsky, Approaching robust EFT limits for CP-violation in the Higgs sector, Phys. Rev.D 99 (2019) 095007 [arXiv:1901.05982] [INSPIRE].
J.H. Collins, K. Howe and B. Nachman, Extending the search for new resonances with machine learning, Phys. Rev.D 99 (2019) 014038 [arXiv:1902.02634] [INSPIRE].
Z.C. Lipton, The mythos of model interpretability, in proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, U.S.A., 23 June 2016, arXiv:1606.03490.
K. Datta, A. Larkoski and B. Nachman, Automating the Construction of Jet Observables with Machine Learning, arXiv:1902.07180 [INSPIRE].
D. Krohn, J. Thaler and L.-T. Wang, Jet Trimming, JHEP02 (2010) 084 [arXiv:0912.1342] [INSPIRE].
F.V. Tkachov, Measuring multijet structure of hadronic energy flow or, what is a jet?, Int. J. Mod. Phys.A 12 (1997) 5411 [hep-ph/9601308] [INSPIRE].
M.M. Bronstein, J. Bruna, Y. LeCun, A. Szlam and P. Vandergheynst, Geometric Deep Learning: Going beyond Euclidean data, IEEE Sig. Proc. Mag.34 (2017) 18 [arXiv:1611.08097] [INSPIRE].
J. Cogan, M. Kagan, E. Strauss and A. Schwarztman, Jet-Images: Computer Vision Inspired Techniques for Jet Tagging, JHEP02 (2015) 118 [arXiv:1407.5675] [INSPIRE].
S.D. Ellis, C.K. Vermilion and J.R. Walsh, Techniques for improved heavy particle searches with jet substructure, Phys. Rev.D 80 (2009) 051501 [arXiv:0903.5081] [INSPIRE].
S.D. Ellis, C.K. Vermilion and J.R. Walsh, Recombination Algorithms and Jet Substructure: Pruning as a Tool for Heavy Particle Searches, Phys. Rev.D 81 (2010) 094023 [arXiv:0912.0033] [INSPIRE].
T. Sjöstrand et al., An Introduction to PYTHIA 8.2, Comput. Phys. Commun.191 (2015) 159 [arXiv:1410.3012] [INSPIRE].
J. Bellm et al., HERWIG 7.0/HERWIG++ 3.0 release note, Eur. Phys. J.C 76 (2016) 196 [arXiv:1512.01178] [INSPIRE].
M. Bahr et al., HERWIG++ Physics and Manual, Eur. Phys. J.C 58 (2008) 639 [arXiv:0803.0883] [INSPIRE].
P.T. Komiske, E.M. Metodiev and J. Thaler, Energy flow polynomials: A complete linear basis for jet substructure, JHEP04 (2018) 013 [arXiv:1712.07124] [INSPIRE].
J. Alwall et al., The automated computation of tree-level and next-to-leading order differential cross sections and their matching to parton shower simulations, JHEP07 (2014) 079 [arXiv:1405.0301] [INSPIRE].
C. Degrande, B. Fuks, V. Hirschi, J. Proudom and H.-S. Shao, Automated next-to-leading order predictions for new physics at the LHC: the case of colored scalar pair production, Phys. Rev.D 91 (2015) 094005 [arXiv:1412.5589] [INSPIRE].
FeynRules models to be used for NLO calculations with aMC@NLO, (2019) https://feynrules.irmp.ucl.ac.be/wiki/NLOModels.
A. Alloul, N.D. Christensen, C. Degrande, C. Duhr and B. Fuks, FeynRules 2.0 — A complete toolbox for tree-level phenomenology, Comput. Phys. Commun.185 (2014) 2250 [arXiv:1310.1921] [INSPIRE].
C. Degrande, C. Duhr, B. Fuks, D. Grellscheid, O. Mattelaer and T. Reiter, UFO — The Universal FeynRules Output, Comput. Phys. Commun.183 (2012) 1201 [arXiv:1108.2040] [INSPIRE].
DELPHES 3 collaboration, DELPHES 3, A modular framework for fast simulation of a generic collider experiment, JHEP02 (2014) 057 [arXiv:1307.6346] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, FastJet User Manual, Eur. Phys. J.C 72 (2012) 1896 [arXiv:1111.6097] [INSPIRE].
M. Cacciari and G.P. Salam, Dispelling the N 3myth for the k tjet-finder, Phys. Lett.B 641 (2006) 57 [hep-ph/0512210] [INSPIRE].
M. Cacciari, G.P. Salam and G. Soyez, The anti-k tjet clustering algorithm, JHEP04 (2008) 063 [arXiv:0802.1189] [INSPIRE].
B.R. Webber, QCD Jets and Parton Showers, in proceedings of the Gribov-80 Memorial Workshop on Quantum Chromodynamics and Beyond, Trieste, Italy, 26–28 May 2010, pp. 82–92 [https://doi.org/10.1142/9789814350198_0010] [arXiv:1009.5871] [INSPIRE].
B. Bhattacherjee, S. Mukhopadhyay, M.M. Nojiri, Y. Sakaki and B.R. Webber, Associated jet and subjet rates in light-quark and gluon jet discrimination, JHEP04 (2015) 131 [arXiv:1501.04794] [INSPIRE].
A. Krogh and J.A. Hertz, A simple weight decay can improve generalization, in proceedings of the Advances in Neural Information Processing Systems 4 (NIPS 1991), Denver, Colorado, U.S.A., 2–5 December 1991, J.E. Moody, S.J. Hanson and R.P. Lippmann eds., Morgan-Kaufmann (1992), pp. 950–957.
D.P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization, arXiv:1412.6980 [INSPIRE].
X. Glorot and Y. Bengio, Understanding the difficulty of training deep feedforward neural networks, in proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Chia Laguna Resort, Sardinia, Italy, 13–15 May 2010, Y.W. Teh and M. Titterington eds., pp. 249–256 [Proc. Mach. Learn. Res.9 (2010) 249].
F. Chollet et al., Keras, (2015) https://keras.io.
M. Abadi et al., TensorFlow: Large-scale machine learning on heterogeneous systems, (2015) https://www.tensorflow.org/.
D. Alvarez Melis and T. Jaakkola, Towards robust interpretability with self-explaining neural networks, in proceedings of the Advances in Neural Information Processing Systems 31 (NIPS 2018), Montreal, Canada, 3–8 December 2018, S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi and R. Garnett eds., Curran Associates, Inc. (2018), pp. 7786–7795 [arXiv:1806.07538].
D. Clevert, T. Unterthiner and S. Hochreiter, Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs), in proceedings of the International Conference on Learning Representations (ICLR), Caribe Hilton, San Juan, Puerto Rico, 2–4 May 2016, arXiv:1511.07289.
K. He, X. Zhang, S. Ren and J. Sun, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, in proceedings of the IEEE International Conference on Computer Vision (ICCV), Washington, D.C., U.S.A., 7–13 December 2015, arXiv:1502.01852 [INSPIRE].
M.T. Ribeiro, S. Singh and C. Guestrin, Model-Agnostic Interpretability of Machine Learning, in proceedings of the 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, U.S.A., 23 June 2016, arXiv:1606.05386.
R.D. Ball et al., Parton distributions with LHC data, Nucl. Phys.B 867 (2013) 244 [arXiv:1207.1303] [INSPIRE].
P. Skands, S. Carrazza and J. Rojo, Tuning PYTHIA 8.1: the Monash 2013 Tune, Eur. Phys. J.C 74 (2014) 3024 [arXiv:1404.5630] [INSPIRE].
Herwig collaboration, Minimum-bias and underlying-event tunes, (2015) https://herwig.hepforge.org/tutorials/mpi/tunes.html.
S. Gieseke, C. Rohr and A. Siodmok, Colour reconnections in HERWIG++, Eur. Phys. J.C 72 (2012) 2225 [arXiv:1206.0041] [INSPIRE].
Open Access
This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.
Author information
Authors and Affiliations
Corresponding author
Additional information
ArXiv ePrint: 1904.02092
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chakraborty, A., Lim, S.H. & Nojiri, M.M. Interpretable deep learning for two-prong jet classification with jet spectra. J. High Energ. Phys. 2019, 135 (2019). https://doi.org/10.1007/JHEP07(2019)135
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/JHEP07(2019)135