CaltechAUTHORS
  A Caltech Library Service

Fault-tolerant switched local area networks

LeMahieu, Paul and Bohossian, Vasken and Bruck, Jehoshua (1998) Fault-tolerant switched local area networks. In: Proceedings of the first merged International Parallel Processing Symposium & Symposium on Parallel and Distributed Processing. IEEE Computer Society , Los Alamitos, CA, pp. 747-751. ISBN 0-8186-8403-8 http://resolver.caltech.edu/CaltechAUTHORS:20111215-115455804

Full text is not posted in this repository. Consult Related URLs below.

Use this Persistent URL to link to this item: http://resolver.caltech.edu/CaltechAUTHORS:20111215-115455804

Abstract

The RAIN (Reliable Array of Independent Nodes) project at Caltech is focusing on creating highly reliable distributed systems by leveraging commercially available personal computers, workstations and interconnect technologies. In particular the issue of reliable communication is addressed by introducing redundancy in the form of multiple network interfaces per compute node. When using compute nodes with multiple network connections the question of how to best connect these nodes to a given network of switches arises. We examine networks of switches (e.g. based on Myrinet technology) and focus on degree-two compute nodes (two network adaptor cards per node). Our primary goal is to create networks that are as resistant as possible to partitioning. Our main contributions are: (i) a construction for degree-2 compute nodes connected by a ring network of switches of degree 4 that can tolerate any 3 switch failures without partitioning the nodes into disjoint sets; (ii) a proof that this construction is optimal in the sense that no construction can tolerate more switch failures while avoiding partitioning; and (ii) generalizations of this construction to arbitrary switch and node degrees and to other switch networks, in particular to a fully-connected network of switches.


Item Type:Book Section
Related URLs:
URLURL TypeDescription
http://dx.doi.org/10.1109/IPPS.1998.670011 DOIUNSPECIFIED
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=670011PublisherUNSPECIFIED
Additional Information:© 1998 IEEE. Date of Current Version: 06 August 2002. Supported in part by the NSF Young Investigator Award CCR-9457811, by the Sloan Research Fellowship, and by DARPA and BMDO through an agreement with NASA/OSAT.
Funders:
Funding AgencyGrant Number
NSF Young Investigator Award CCR-9457811
Sloan Research FellowshipUNSPECIFIED
Defense Advanced Research Projects Agency (DARPA)UNSPECIFIED
Ballistic Missile Defense Organization (BMDO)UNSPECIFIED
Other Numbering System:
Other Numbering System NameOther Numbering System ID
INSPEC Accession Number5907589
Record Number:CaltechAUTHORS:20111215-115455804
Persistent URL:http://resolver.caltech.edu/CaltechAUTHORS:20111215-115455804
Official Citation:LeMahieu, P.; Bohossian, V.; Bruck, J.; , "Fault-tolerant switched local area networks," Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998 , vol., no., pp.747-751, 30 Mar-3 Apr 1998 doi: 10.1109/IPPS.1998.670011 URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=670011&isnumber=14764
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:28478
Collection:CaltechAUTHORS
Deposited By: Tony Diaz
Deposited On:15 Dec 2011 23:30
Last Modified:15 Dec 2011 23:30

Repository Staff Only: item control page