CaltechAUTHORS
  A Caltech Library Service

Alert Messaging in the CMS Distributed Workflow System

Maxa, Zdenek (2012) Alert Messaging in the CMS Distributed Workflow System. Journal of Physics: Conference Series, 396 . Art. No. 032074. ISSN 1742-6596. doi:10.1088/1742-6596/396/3/032074. https://resolver.caltech.edu/CaltechAUTHORS:20130320-140600825

Full text is not posted in this repository. Consult Related URLs below.

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20130320-140600825

Abstract

WMAgent is the core component of the CMS workload management system. One of the features of this job managing platform is a configurable messaging system aimed at generating, distributing and processing alerts: short messages describing a given alert-worthy information or pathological condition. Apart from the framework's sub-components running within the WMAgent instances, there is a stand-alone application collecting alerts from all WMAgent instances running across the CMS distributed computing environment. The alert framework has a versatile design that allows for receiving alert messages also from other CMS production applications, such as PhEDEx data transfer manager. We present implementation details of the system, including its Python implementation using ZeroMQ, CouchDB message storage and future visions as well as operational experiences. Inter-operation with monitoring platforms such as Dashboard or Lemon is described.


Item Type:Article
Related URLs:
URLURL TypeDescription
http://dx.doi.org/10.1088/1742-6596/396/3/032074DOIUNSPECIFIED
http://iopscience.iop.org/1742-6596/396/3/032074/PublisherUNSPECIFIED
Additional Information:© 2013 IOP Publishing Ltd. This work was supported by the US CMS Operations Program funded by the US Department of Energy.
Funders:
Funding AgencyGrant Number
Department of Energy (DOE)UNSPECIFIED
Classification Code:PACS: 29.40.Gx; 07.05.-t
DOI:10.1088/1742-6596/396/3/032074
Record Number:CaltechAUTHORS:20130320-140600825
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20130320-140600825
Official Citation:Alert Messaging in the CMS Distributed Workflow System Zdenek Maxa 2012 J. Phys.: Conf. Ser. 396 032074
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:37577
Collection:CaltechAUTHORS
Deposited By: Jason Perez
Deposited On:03 Apr 2013 21:21
Last Modified:09 Nov 2021 23:29

Repository Staff Only: item control page