p50, p95, p99 Latencies

By Pradyumna Chippigiri

February 1st, 2026


Super important latency metrics for production performance monitoring.


Percentile latency distribution showing the tail latency spike after p90

As you can see in the graph, the tail latency is spikes after p90, this is because of the 10% of requests that are slower than p90.


So we wouldnt have been able to catch this if we had only looked at the p50 latency.


If we would have just looked at the p95 latency also, wewould have understood okay there is a little bit of latency spike but not the tail latency.


So we look at p50 + p95 + p99 to get a complete picture of the latency distribution.


As a gist, we can say :