A Caltech Library Service

Adversarially Learned Anomaly Detection on CMS open data: re-discovering the top quark

Knapp, O. and Cerri, O. and Dissertori, G. and Nguyen, T. Q. and Pierini, M. and Vlimant, J. R. (2021) Adversarially Learned Anomaly Detection on CMS open data: re-discovering the top quark. European Physical Journal Plus, 136 (2). Art. No. 236. ISSN 2190-5444. doi:10.1140/epjp/s13360-021-01109-4.

[img] PDF - Published Version
Creative Commons Attribution.

[img] PDF - Accepted Version
See Usage Policy.


Use this Persistent URL to link to this item:


We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton–proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb⁻¹ of 8 TeV CMS Open Data, we show how a data-driven anomaly detection and characterization would work in real life, re-discovering the top quark by identifying the main features of the tt̄ experimental signature at the LHC.

Item Type:Article
Related URLs:
URLURL TypeDescription Paper ItemData ItemData
Cerri, O.0000-0002-2191-0666
Dissertori, G.0000-0002-4549-2569
Nguyen, T. Q.0000-0003-3954-5131
Pierini, M.0000-0003-1939-4268
Vlimant, J. R.0000-0002-9705-101X
Additional Information:© The Author(s) 2021. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit Received: 22 September 2020 / Accepted: 11 January 2021. This work was possible thanks to the commitment of the CMS collaboration to release its data and MC samples through the CERN Open Data portal. We would like to thank our CMS colleagues and the CERN Open Data team for their effort to promote open access to science. In particular, we thank Kati Lassila-Perini for her precious help. This project is partially supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant Agreement No. 772369) and by the United States Department of Energy, Office of High Energy Physics Research under Caltech Contract No. DE-SC0011925. This work was conducted at “iBanks,” the AI GPU cluster at Caltech. We acknowledge NVIDIA, SuperMicro and the Kavli Foundation for their support of “iBanks.” Open Access funding provided by CERN. Data Availability Statement: This manuscript has associated data in a data repository. [Authors’ comment: the Delphes datasets is released on Zenodo and available at The CMS Open Data are released on the CERN Open Data portal at]
Funding AgencyGrant Number
European Research Council (ERC)772369
Department of Energy (DOE)DE-SC0011925
SuperMicro CorporationUNSPECIFIED
Kavli FoundationUNSPECIFIED
Issue or Number:2
Record Number:CaltechAUTHORS:20210222-100903670
Persistent URL:
Official Citation:Knapp, O., Cerri, O., Dissertori, G. et al. Adversarially Learned Anomaly Detection on CMS open data: re-discovering the top quark. Eur. Phys. J. Plus 136, 236 (2021).
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:108139
Deposited By: George Porter
Deposited On:23 Feb 2021 17:31
Last Modified:16 Nov 2021 19:09

Repository Staff Only: item control page