Publications

Publications / Report

An Investigation into Reliability, Availability, and Serviceability (RAS) Features for Massively Parallel Processor Systems

Kelly, Suzanne M.; Ogden, Jeffry B.

A study has been completed into the RAS features necessary for Massively Parallel Processor (MPP) systems. As part of this research, a use case model was built of how RAS features would be employed in an operational MPP system. Use cases are an effective way to specify requirements so that all involved parties can easily understand them. This technique is in contrast to laundry lists of requirements that are subject to misunderstanding as they are without context. As documented in the use case model, the study included a look at incorporating system software and end-user applications, as well as hardware, into the RAS system.