Abstract
Analyses of collider data, often assisted by modern Machine Learning methods, condense a number of observables into a few powerful discriminants for the separation of the targeted signal process from the contributing backgrounds. These discriminants are highly correlated with important physical observables; using them in the event selection thus leads to the distortion of physically relevant distributions. We present a novel method based on a differentiable estimate of mutual information, a measure of non-linear dependency between variables, to construct a discriminant that is statistically independent of a number of selected observables, and so manages to preserve their distributions in the event selection. Our strategy is evaluated in a realistic setting, the analysis of the Standard Model Higgs boson decaying into a pair of bottom quarks. Using the distribution of the invariant mass of the di-b-jet system to extract the Higgs boson signal strength, our method achieves state-of-the-art performance compared to other decorrelation techniques, while significantly improving the sensitivity of a similar, cut-based, analysis published by ATLAS.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
ATLAS collaboration, Observation of a new particle in the search for the Standard Model Higgs boson with the ATLAS detector at the LHC, Phys. Lett. B 716 (2012) 1 [arXiv:1207.7214] [INSPIRE].
CMS collaboration, Observation of a New Boson at a Mass of 125 GeV with the CMS Experiment at the LHC, Phys. Lett. B 716 (2012) 30 [arXiv:1207.7235] [INSPIRE].
ATLAS collaboration, Observation of Higgs boson production in association with a top quark pair at the LHC with the ATLAS detector, Phys. Lett. B 784 (2018) 173 [arXiv:1806.00425] [INSPIRE].
CMS collaboration, Observation of the Higgs boson decay to a pair of τ leptons with the CMS detector, Phys. Lett. B 779 (2018) 283 [arXiv:1708.00373] [INSPIRE].
ATLAS collaboration, Observation and measurement of Higgs boson decays to WW* with the ATLAS detector, Phys. Rev. D 92 (2015) 012006 [arXiv:1412.2641] [INSPIRE].
ATLAS and CMS collaborations, Measurements of the Higgs boson production and decay rates and constraints on its couplings from a combined ATLAS and CMS analysis of the LHC pp collision data at \( \sqrt{s} \) = 7 and 8 TeV, JHEP 08 (2016) 045 [arXiv:1606.02266] [INSPIRE].
ATLAS collaboration, Combined measurements of Higgs boson production and decay using up to 80 fb−1 of proton-proton collision data at \( \sqrt{s} \) = 13 TeV collected with the ATLAS experiment, ATLAS-CONF-2018-031 (2018) [INSPIRE].
CMS collaboration, Observation of Higgs boson decay to bottom quarks, Phys. Rev. Lett. 121 (2018) 121801 [arXiv:1808.08242] [INSPIRE].
ATLAS collaboration, Observation of H → \( b\overline{b} \) decays and VH production with the ATLAS detector, Phys. Lett. B 786 (2018) 59 [arXiv:1808.08238] [INSPIRE].
K. Cranmer, J. Pavez and G. Louppe, Approximating Likelihood Ratios with Calibrated Discriminative Classifiers, arXiv:1506.02169 [INSPIRE].
ATLAS collaboration, Performance of mass-decorrelated jet substructure observables for hadronic two-body decay tagging in ATLAS, ATL-PHYS-PUB-2018-014 (2018) [INSPIRE].
C. Shimmin et al., Decorrelated Jet Substructure Tagging using Adversarial Neural Networks, Phys. Rev. D 96 (2017) 074034 [arXiv:1703.03507] [INSPIRE].
G. Kasieczka and D. Shih, DisCo Fever: Robust Networks Through Distance Correlation, arXiv:2001.05310 [INSPIRE].
G. Louppe, M. Kagan and K. Cranmer, Learning to Pivot with Adversarial Networks, arXiv:1611.01046 [INSPIRE].
J. Alwall et al., The automated computation of tree-level and next-to-leading order differential cross sections and their matching to parton shower simulations, JHEP 07 (2014) 079 [arXiv:1405.0301] [INSPIRE].
T. Sjöstrand, S. Mrenna and P.Z. Skands, A Brief Introduction to PYTHIA 8.1, Comput. Phys. Commun. 178 (2008) 852 [arXiv:0710.3820] [INSPIRE].
J. Butterworth et al., Single Boson and Diboson Production Cross Sections in pp Collisions at \( \sqrt{s} \) = 7 TeV, ATL-COM-PHYS-2010-695 (2010).
M. Czakon and A. Mitov, Top++: A Program for the Calculation of the Top-Pair Cross-Section at Hadron Colliders, Comput. Phys. Commun. 185 (2014) 2930 [arXiv:1112.5675] [INSPIRE].
DELPHES 3 collaboration, DELPHES 3, A modular framework for fast simulation of a generic collider experiment, JHEP 02 (2014) 057 [arXiv:1307.6346] [INSPIRE].
ATLAS collaboration, Evidence for the H → \( b\overline{b} \) decay with the ATLAS detector, JHEP 12 (2017) 024 [arXiv:1708.03299] [INSPIRE].
ATLAS collaboration, Expected performance of the ATLAS b-tagging algorithms in Run-2, ATL-PHYS-PUB-2015-022 (2015) [INSPIRE].
F. Nogueira, A python implementation of global optimization with gaussian processes, (2019) https://github.com/fmfn/BayesianOptimization.
B.H. Zhang, B. Lemoine and M. Mitchell, Mitigating unwanted biases with adversarial learning, arXiv:1801.07593.
S. Kullback and R.A. Leibler, On information and sufficiency, Ann. Math. Stat. 22 (1951) 79.
L. Paninski, Estimation of entropy and mutual information, Neural Comput. 15 (2003) 1191.
C.J. Cellucci, A.M. Albano and P.E. Rapp, Statistical validation of mutual information calculations: Comparison of alternative numerical algorithms, Phys. Rev. E 71 (2005) 066208.
X. Nguyen, M.J. Wainwright and M.I. Jordan, Estimating divergence functionals and the likelihood ratio by convex risk minimization, IEEE Trans. Inf. Theory 56 (2010) 5847.
S. Nowozin, B. Cseke and R. Tomioka, f-GAN: Training generative neural samplers using variational divergence minimization, arXiv:1606.00709.
M.I. Belghazi et al., MINE: Mutual information neural estimation, arXiv:1801.04062.
D.J. Rezende and S. Mohamed, Variational inference with normalising flows, in proceedings of the 32nd International Conference on Machine Learning, Lille, France, 7–9 July 2015, pp. 1530–1538 [arXiv:1505.05770].
N. Thiemann, C. Igel, O. Wintenberger and Y. Seldin, A strongly quasiconvex PAC-Bayesian bound, arXiv:1608.05610.
A. Agarwal, A. Beygelzimer, M. Dudik, J. Langford and H. Wallach, A reductions approach to fair classification, in proceedings of the 35th International Conference on Machine Learning, Stockholmsm¨assan, Stockholm, Sweden, 10–15 July 2018, pp. 60–69 [arXiv:1803.02453].
I. Goodfellow et al., Generative adversarial nets, in Advances in neural information processing systems 27, proceedings of the 28th Annual Conference on Neural Information Processing Systems 2014 (NIPS 2014), Palais des Congrès de Montréal, Montréal, Canada, 8–13 December 2014, pp. 2672–2680 [arXiv:1406.2661].
X. Glorot, A. Bordes and Y. Bengio, Deep sparse rectifier neural networks, in proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, U.S.A., 11–13 April 2011, volume 15, pp. 315–323 and online pdf version at http://proceedings.mlr.press/v15/glorot11a/glorot11a.pdf.
D.P. Kingma and J. Ba, Adam: A Method for Stochastic Optimization, arXiv:1412.6980 [INSPIRE].
P. Windischhofer, M. Zgubič and D. Bortoletto, Code repository for the paper “Preserving physically important variables in optimal event selections: A case study in Higgs physics”, (2019) https://github.com/philippwindischhofer/HiggsPivoting.
M. Abadi et al., TensorFlow: Large-scale machine learning on heterogeneous systems, arXiv:1603.04467 and online at https://www.tensorflow.org.
J.S. Bendat and A.G. Piersol, Measurement and analysis of random data, John Wiley, New York U.S.A. (1966).
F. Mosteller and J.W. Tukey, Data analysis and regression, Addison-Weasly (1977).
G. Cowan, K. Cranmer, E. Gross and O. Vitells, Asymptotic formulae for likelihood-based tests of new physics, Eur. Phys. J. C 71 (2011) 1554 [Erratum ibid. 73 (2013) 2501] [arXiv:1007.1727] [INSPIRE].
Open Access
This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
ArXiv ePrint: 1907.02098
Rights and permissions
Open Access . This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
Windischhofer, P., Zgubič, M. & Bortoletto, D. Preserving physically important variables in optimal event selections: a case study in Higgs physics. J. High Energ. Phys. 2020, 1 (2020). https://doi.org/10.1007/JHEP07(2020)001
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/JHEP07(2020)001