SLE vs. SLA: What Engineering Teams Get Wrong
Most engineering teams have heard of SLAs. Fewer have heard of SLEs. Almost none use them correctly. The confusion between these two concepts is one of the most common β and most costly β misunderstandings in software delivery management. Getting this distinction right is the difference between teams that can predict delivery outcomes and teams that are perpetually surprised when things take longer than expected.
What an SLA actually is
A Service Level Agreement is a contractual commitment between a service provider and a customer. It defines the minimum acceptable performance standard β typically around uptime, response time, or resolution time. SLAs carry legal and financial consequences. If you breach an SLA, there are penalties: credits, refunds, or contract terminations.
SLAs are external-facing. They exist to protect the customer and create accountability for the provider. They are negotiated, documented, and enforced. Most engineering teams encounter SLAs in the context of infrastructure β cloud provider uptime guarantees, support ticket response windows, or API availability commitments.
What an SLE is β and why it matters more
A Service Level Expectation is an internal commitment a team makes to itself about how long work items should take to complete. Unlike an SLA, an SLE is not a contract β it is a data-driven target derived from the team's own historical delivery performance. Specifically, an SLE is defined as a cycle time threshold at a given percentile.
For example: "85% of our work items should complete within 12 days." This is not a guess or a management-imposed deadline. It is a statistical statement derived from the team's actual cycle time distribution. If the team has been completing 85% of items within 12 days historically, then 12 days at the 85th percentile becomes their SLE. Any item that exceeds this threshold is statistically anomalous and worth investigating.
The power of the SLE is that it gives teams a self-calibrating mechanism for managing delivery risk. Instead of asking "will this be done by Friday?" β a question that invites optimism bias and political pressure β teams ask "is this item aging past our SLE threshold?" The answer is objective, data-derived, and decoupled from individual blame.
Why teams confuse SLEs with SLAs
The confusion usually happens in one of two ways. First, teams treat SLAs as if they apply to internal delivery. They set hard deadlines on work items and treat every missed date as a failure, regardless of context. This creates a culture of defensive estimation where teams pad timelines to avoid looking bad rather than using data to understand their actual delivery capability.
Second, teams hear about SLEs and implement them as rigid rules instead of probabilistic expectations. They set a 12-day SLE and then treat every item that takes 13 days as a crisis. This misses the point entirely. An SLE at the 85th percentile means that roughly 15% of items will naturally exceed the threshold. That is expected behavior, not a failure. The question is whether the rate of SLE violations is increasing β which signals systemic degradation β or stable, which signals a healthy system operating within its natural variance.
How SLEs connect to flow metrics
SLEs do not exist in isolation. They are the output of a healthy flow system. A team's ability to meet its SLE consistently depends on three upstream conditions: controlled work in progress, stable throughput, and low blocked-item ratios. When any of these degrade, cycle times increase and SLE violations spike.
This is why SLE integrity is one of the six dimensions we evaluate in every delivery assessment at The Agility Doctor. We do not just check whether a team has defined an SLE β we measure whether they are meeting it, whether it is calibrated to recent data, and whether violations are being used as actionable signals rather than ignored or explained away.
Teams that track flow metrics without defining an SLE are collecting data with no decision framework. The metrics tell you what is happening. The SLE tells you whether what is happening is acceptable. Without that threshold, cycle time is just a number β interesting but not actionable.
Setting an SLE that works
The process for setting an SLE is straightforward. Pull your team's cycle time data for the past 60 to 90 days. Plot the distribution. Identify the 85th percentile β the cycle time at which 85% of items completed. That number becomes your SLE. Some teams use the 70th or 95th percentile depending on their risk tolerance and the nature of their work, but the 85th percentile is the most common starting point because it balances predictability with practical tolerance for variance.
Review the SLE regularly β every two to four weeks is typical. As the team improves its flow practices, the SLE should tighten naturally. If it is not tightening, something in the system is preventing improvement, and that is worth investigating.
The Kanban Guide defines the SLE as a core element of a mature delivery system. It is not optional or nice-to-have β it is the mechanism by which teams make and manage delivery commitments using real data instead of gut feel.
The operational difference
When a team has a well-defined SLE, their daily standup changes. Instead of going around the room asking what everyone is working on, the team looks at which items are aging past the SLE threshold and focuses attention there. This is a fundamentally different operating model β proactive risk management instead of reactive status reporting.
Sprint planning changes too. Instead of committing to a fixed scope based on velocity estimates, teams can use their SLE to make probabilistic forecasts. "Based on our current SLE, there is an 85% chance these items will complete within the next two weeks." This gives stakeholders a realistic expectation grounded in data rather than a promise that the team may not be able to keep.
The shift from SLA-thinking to SLE-thinking is the shift from external pressure to internal discipline. SLAs say "you must deliver by this date or face consequences." SLEs say "based on our data, here is what we can realistically commit to β and here is how we will know if something is going wrong early enough to intervene."
Start here
If your team does not have an SLE defined, you are managing delivery without a compass. The fastest way to understand where you stand is to run a diagnostic that evaluates your SLE integrity alongside five other delivery dimensions. Our free Lite assessment generates a maturity band in under two minutes and shows you exactly where SLE integrity fits into your overall delivery health. See how the scoring works or start your assessment now.