This document discusses reliability specification and metrics. It describes how to identify types of system failure, estimate costs and consequences, and identify root causes to generate reliability specifications. Types of failures include loss of service, incorrect service, and system/data corruption. Reliability metrics are discussed such as probability of failure on demand, rate of occurrence of failures/mean time to failure, and availability. These metrics provide measurements of system reliability.
Related topics: