Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
FAULT TOLERANCE IN SUPERCOMPUTER THROUGH DYNAMIC REPARTITIONING
Document Type and Number:
Japanese Patent JP2007220147
Kind Code:
A
Abstract:

To provide fault tolerance in a parallel computer's interconnection networks by software controlled dynamic repartitioning.

A multiprocessor, parallel computer is made tolerant to hardware failures by providing extra groups of redundant standby processors and by designing the system so that these extra groups of processors can be swapped with any group which experiences the hardware failure. This swapping can be under software control, thereby permitting the entire computer to sustain the hardware failure but, after swapping in the standby processors, to still appear to software as a pristine, fully functioning system.

COPYRIGHT: (C)2007,JPO&INPIT


Inventors:
CHEN DONG
COTEUS PAUL W
GARA ALAN G
TAKKEN TODD E
Application Number:
JP2007144007A
Publication Date:
August 30, 2007
Filing Date:
May 30, 2007
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
IBM
International Classes:
G06F11/10; G06F9/46; G06F11/20; G06F9/52; G06F11/00; G06F12/00; G06F12/02; G06F12/08; G06F12/10; G06F13/00; G06F13/24; G06F13/38; G06F15/173; G06F15/177; G06F15/80; G06F17/14; H04L1/00; H04L1/22; H04L7/02; H04L7/033; H04L12/28; H04L12/56; H04L25/02; H05K7/20
Domestic Patent References:
JPS61201365A1986-09-06
JPS62274454A1987-11-28
JPH03132861A1991-06-06
JPH06290158A1994-10-18
JP2004532447A2004-10-21
JPH0635872A1994-02-10
JP2002568482A2002-02-25
Attorney, Agent or Firm:
Takeshi Ueno
Tasaichi Tanae
Yoshihiro City
Hiroshi Sakaguchi