A Caltech Library Service

Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment

Pol, Adrian Alan and Azzolini, Virginia and Cerminara, Gianluca and De Guio, Federico and Franzoni, Giovanni and Pierini, Maurizio and Siroký, Filip and Vlimant, Jean-Roch (2019) Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment. EPJ Web of Conferences, 214 . Art. No. 06008. ISSN 2100-014X. doi:10.1051/epjconf/201921406008.

[img] PDF - Published Version
Creative Commons Attribution.


Use this Persistent URL to link to this item:


The certification of the CMS experiment data as usable for physics analysis is a crucial task to ensure the quality of all physics results published by the collaboration. Currently, the certification conducted by human experts is labor intensive and based on the scrutiny of distributions integrated on several hours of data taking. This contribution focuses on the design and prototype of an automated certification system assessing data quality on a per-luminosity section (i.e. 23 seconds of data taking) basis. Anomalies caused by detector malfunctioning or sub-optimal reconstruction are difficult to enumerate a priori and occur rarely, making it difficult to use classical supervised classification methods such as feedforward neural networks. We base our prototype on a semi-supervised approach which employs deep autoencoders. This approach has been qualified successfully on CMS data collected during the 2016 LHC run: we demonstrate its ability to detect anomalies with high accuracy and low false positive rate, when compared against the outcome of the manual certification by experts. A key advantage of this approach over other machine learning technologies is the great interpretability of the results, which can be further used to ascribe the origin of the problems in the data to a specific sub-detector or physics objects.

Item Type:Article
Related URLs:
URLURL TypeDescription
Pierini, Maurizio0000-0003-1939-4268
Vlimant, Jean-Roch0000-0002-9705-101X
Additional Information:© 2019 The Authors, published by EDP Sciences. This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Published online 17 September 2019. We thank the CMS collaboration for providing the data set used in this study. We are thankful to the members of the CMS Physics Performance and Dataset project for useful discussions, suggestions, and support. We acknowledge the support of the CMS CERN group for providing the computing resources to train our models. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation (grant agreement no. 772369).
Funding AgencyGrant Number
European Research Council (ERC)772369
Record Number:CaltechAUTHORS:20200310-081419478
Persistent URL:
Official Citation:Anomaly detection using Deep Autoencoders for the assessment of the quality of the data acquired by the CMS experiment. Adrian Alan Pol, Virginia Azzolini, Gianluca Cerminara, Federico De Guio, Giovanni Franzoni, Maurizio Pierini, Filip Siroký and Jean-Roch Vlimant. EPJ Web Conf., 214 (2019) 06008; DOI:
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:101809
Deposited By: Tony Diaz
Deposited On:10 Mar 2020 16:15
Last Modified:16 Nov 2021 18:06

Repository Staff Only: item control page