Availability in Numbers

Byzackozack|March 3, 2025

Listen as Podcast

00:00 / 00:00

What is availabilty?

Availabilty of an application refers to the percentage of the time application provides operations expected from it, and is accessible to all the users.

Measuring Availability

  • Availability is measured as percentage of uptime over a given period, usually a month or a year.
  • The formulae is typically Availability = (Total Service Time - Downtime) / (Total Service Time)
  • The value is expected to be extremely close to 100% i.e. 99%+, the higher the better..

9s of Availability

Availability is generally represented as the numbers of 9s in the fractional plus integer part of the availabilty (eg. a service with 99.99% availability is described as having four 9s).


The availabilty percentage value is usually very high 99%+ , as any value lesser than that will lead to very significant downtime for the applications.


Here's a representation of how much different availability percentages equals to actual downtime per year.


99.9% availability - three 9s

DurationAcceptable downtime
Downtime per year8h 45min 57s
Downtime per month43m 49.7s
Downtime per week10m 4.8s
Downtime per day1m 26.4s

99.99% availability - four / quad 9s

DurationAcceptable downtime
Downtime per year52min 35.7s
Downtime per month4m 23s
Downtime per week1m 5s
Downtime per day8.6s

Availability of connected components

When multiple components are connected together the availability of the system starts to depend on the components that it's made of.


In your application's architecture, you have a couple of ways to connect components together.

1. Components connected in Sequence

When components are connected in a sequence, the functioning of a component directly depends on response or data of other components.


e.g. Consider a normal 3 tier application, consisting of only 3 servers (frontend, backend and database) where functioning of the application / frontend depends on the response from the backend server and the response from the backend server depends upon the response from the database.


i.e. in this case, if the database is down it will make the whole application unresponsive.


Overall availability of the system decreases when two components with availability < 100% are in sequence.


Availability (Total) = Availability (Foo) * Availability (Bar)


This multiplication occurs because the probabilities of all components being available must be combined.


e.g. If Foo crashes for 3 times a day, and Bar crashes 4 times a day. Your whole system will be unavailable for 3*4 = 12 times a day. The system becomes unavailable if either of the Foo or Bar are down, thus decreasing the availability even more.


If both Foo and Bar each had 99.9% availability, their total availability in sequence would be 99.8%.

2. Components connected in Parallel

When components are connected in parallel, the application depends on multiple copies of each component, rather than a single component, thus removing the direct dependecy between the components.


e.g. Consider the same 3 tier application as before, but there exists a copy of each server in a different AZs.


i.e. in this case, if the database is down, the application will failover to the database hosted in the other AZ, the application will be unresponsive only if databases are down in both AZs.


Overall availability increases when two components with availability < 100% are in parallel.


Availability (Total) = 1 - (1 - Availability (Foo)) * (1 - Availability (Bar))


Here, the formulae is different because this formula calculates the probability that not all components will fail simultaneously. i.e The system's is only unavailable if either system is down, thus making the availability even more than either foo or bar.


If both Foo and Bar each had 99.9% availability, their total availability in parallel would be 99.9999%.

Conclusion

If you want to make your application more available, you can introduce redundency in your architecture.

Calculating availability beforehand helps you efficitvely communicate availability with your team and helps business commit to realistic SLAs.

Application availability is defined as the percentage of time an application functions as expected and is accessible to users. Availability is measured by calculating uptime as a percentage, often expressed in "nines," indicating the level of uptime (e.g., 99.9%). The availability of connected components differs whether they are connected in sequence or parallel; components connected in a sequence decrease overall availability, while components connected in parallel increase it. Redundancy in architecture helps increase application availability. Calculating availability helps with communication and setting realistic SLAs.
Generated by AI