Observability Glossary

Mean Time Between Incidents (MTBI)

Linkedin icon
Reddit icon

Mean Time Between Incidents (MTBI) is a metric that measures how often an application experiences outages or incidents. For example, if a system experiences an incident every 10 days on average, then its MTBI is 10 days.

MTBI can be calculated by dividing the total time of observation by the number of incidents that occurred during that period. For instance, if a system has been observed for 100 days and experienced 10 incidents, the MTBI would be 10 days. This metric is essential for building resilient and reliable systems that can effectively handle failures and incidents without disrupting the user experience.

Measuring and improving the MTBI of an application over time enables developers to continiously address issues before they lead to significant downtime or poor user experience.

Explore related concepts
Start resolving issues today.
Without the hassle.