CaltechAUTHORS
A Caltech Library Service

Workflow management in large distributed systems

Legrand, I. and Newman, H. and Voicu, R. and Dobre, C. and Grigoras, C. (2011) Workflow management in large distributed systems. Journal of Physics Conference Series , 331 . Art. No. 072022. ISSN 1742-6588 http://resolver.caltech.edu/CaltechAUTHORS:20120413-132252697

[img]
Preview
PDF - Published Version
See Usage Policy.

920Kb

Use this Persistent URL to link to this item: http://resolver.caltech.edu/CaltechAUTHORS:20120413-132252697

Abstract

The MonALISA (Monitoring Agents using a Large Integrated Services Architecture) framework provides a distributed service system capable of controlling and optimizing large-scale, data-intensive applications. An essential part of managing large-scale, distributed data-processing facilities is a monitoring system for computing facilities, storage, networks, and the very large number of applications running on these systems in near realtime. All this monitoring information gathered for all the subsystems is essential for developing the required higher-level services—the components that provide decision support and some degree of automated decisions—and for maintaining and optimizing workflow in large-scale distributed systems. These management and global optimization functions are performed by higher-level agent-based services. We present several applications of MonALISA's higher-level services including optimized dynamic routing, control, data-transfer scheduling, distributed job scheduling, dynamic allocation of storage resource to running jobs and automated management of remote services among a large set of grid facilities.


Item Type:Article
Additional Information:© 2011 Institute of Physics. Published under licence by IOP Publishing Ltd.
Classification Code:PACS: 07.05.Tp; 07.05.Kf; 29.50.+v; 84.40.Ua; 02.60.Pn; 89.70.-a; 29.40.Gx
Record Number:CaltechAUTHORS:20120413-132252697
Persistent URL:http://resolver.caltech.edu/CaltechAUTHORS:20120413-132252697
Related URLs:
Official Citation:Workflow management in large distributed systems I Legrand et al 2011 J. Phys.: Conf. Ser. 331 072022
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:30080
Collection:CaltechAUTHORS
Deposited By: Tony Diaz
Deposited On:17 Apr 2012 20:40
Last Modified:26 Dec 2012 15:03

Repository Staff Only: item control page