Professional Documents
Culture Documents
University
KANBAN AT VANGUARD
An Experience Report
KANBANCASESTUDYSERIES
Contents
Vanguard Fast Facts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Vanguard’s Lean Operating Model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Empirical Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
Recovering from Scrum Stall . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Scrum Stall Causes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Scrum Stall Effects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Team P: Evolution through Systems Thinking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Avoiding Scrum Stall . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Team Q: Exploring the Critical Success Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Never Experiencing Scrum Stall . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Team C: Attacking Variability at Its Source . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Team V: How NOT to Shift from Scrum to Kanban . . . . . . . . . . . . . . . . . . . . . . . 13
STATIK through No-Stall Scrum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Team J: A Scrum Team with Kanban Practices. . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
ESP’s Positive Impact on Kanban Maturity Levels . . . . . . . . . . . . . . 15
Vanguard Examples through the KMM Lens . . . . . . . . . . . . . . . . . . . . 16
Manage Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Make Policies Explicit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Method Selection Trends . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Empirical Measures for Value . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Scientific Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Critical Success Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
Promoting Enterprise Evolutionary Change. . . . . . . . . . . . . . . . . . . . . 17
–2–
Vanguard Fast Facts
–3–
systems thinking. So, the indispens- enable an Agile business?” Essential- 2. Assuming that empiricism is central
able upstream flow receives a lot of ly, this is the evolutionary approach to DevOps fast flow, fast feedback,
attention. of the Kanban Method. and fast learning, how are empirical
3. For “Fast Learning,” Vanguard knows Vanguard tested its Lean Operating measures used to promote continu-
it must continue to evolve its nimble Model for achieving business value at ous improvement in delivering value
digital infrastructure. Our CIO, John startup speed along four dimensions: at a reasonable cost?
Marcante, puts it this way: “We con- 1. If self-organizing teams are essential 3. If the scientific method is applied,
tinue to learn more each day. My ad- to enterprise agility, then what Lean are repeatable experiments confirm-
vice to everyone is to make sure you and Agile method selection trends ing expected results?
don’t get too far ahead of yourself. are observed after twelve years of 4. What critical success factors appear
Start at the foundation—do you have “transformation,” including five to be common at Vanguard despite
nimble enough infrastructure and recent years of focusing on Lean varying performance conditions?
software development practices to principles?
Empirical Results
At Vanguard, we tested the effects of • that hierarchical workflow boards method to actually deliver value
training, kanban board flight levels, stacked in “flight levels” or managed early and often.
time-boxed iterations, and new ways through time-boxed iterations were What really works at Vanguard are the
of thinking and working. Under con- less effective for driving favorable following:
trolled test conditions, we found: outcomes for decision making than • Deep-immersion coaching over
• that training classes and workshops classes of service on a single, larger a firm ninety-day commitment
delivered to full teams for Kan- board; window during which the teams and
ban practices and principles were • and that the so-called new ways of their management people can en-
inconclusive when tested against thinking and working first pio- gage deeply using practical systems
subsequent empirical performance neered by Microsoft Holland to thinking techniques.
for lead time and average delivery foster radical decentralization were • Good, old-fashioned systems analy-
rate of work items; sterile without a Lean or Agile sis with a hearty focus on system
context diagramming and workflow
as well as service class visualization.
• Enterprise Services Planning feed-
back opportunities, as shown in
Figure 2,1 with an immediate com-
mitment to a monthly operations
review.
• A robust commitment to upstream
filtering to eliminate unevenness as
close to the source as possible. This
pays enormous dividends in the
form of fast flow downstream under
WiP limits. This is the “secret sauce”
of success!
–4–
Recovering from Scrum Stall
In aeronautics, stall is defined as a Stall is a metaphor for unevenness stall symptoms, it would be “mediocre”
sudden reduction in the lift generated and over-burdening in the flow of (Figures 4 and 5).
by an aerofoil when the critical angle knowledge work. The symptoms of Scrum Stall Causes
of attack is reached or exceeded (see reduced lift and increased drag are Scrum stall has three causes:
Figure 3). The critical angle of attack seen in scatter plot high population
1. Myopia. This is a team-only ori-
usually is 15 degrees. At the stall, the means and wide standard deviations
entation characterized by wearing
airflow across the upper wing ceases to among samples, as well as in cumula-
blinders and sprinting heads-down,
flow smoothly. When this happens, up- tive flow diagrams depicting sluggish
with limited awareness of upstream
per surface airflow becomes turbulent, average delivery rates with or without
or downstream factors affecting
thus greatly reducing lift and increas- evidence of bottlenecks. If I were to use
end-to-end flow.
ing drag. In short, the plane can’t fly. one word to describe Agile teams with
2. Product Owner Bottleneck. Too
much power is required of this
Scrum role. Scrum’s framework
grew out of new product develop-
ment practices, and the need for this
kind of creative power can lead, in
far too many cases, either to a single
point of team failure or bottlenecked
flow due to incompetence. Where
is “encouraging acts of leadership at
every level” in Scrum?
3. Hortator. This is the Latin name
for a Roman galley cadence keeper
using a drum beat. Hortator means
encourager and exhorter. In Lat-
in, it is a masculine noun. Scrum
sprints can be like “ramming speed”
in the movie Ben Hur. Instead of a
Figure 3 Stall (Reference: https://www.skybrary.aero/index.php/Stall) WiP-limited cadence evolving nat-
urally from the team, as in Kanban,
the Scrum cadence often is imposed
and, therefore, generates unevenness
and over-burdening. Naturally, this
leads to waste.
Scrum Stall Effects
The effects from the three Scrum stall
causes are many, but three stand out at
Vanguard.
–6–
Figure 7 is what simplicity and effec-
tiveness look like in practice for a sys-
tem workflow visualization and classes
of service modeling.
Figure 8 is what simplicity and ef-
fectiveness look like in practice for de-
fining enterprise services planning ca-
dences. This was done in one working
session. Note how existing meetings
are repurposed.
Team P had poor upstream flow
control because of manual process-
es and work “push” that had evolved
willy-nilly over years, as depicted in
Figure 9.
After committing to evolutionary
change using the Kanban Method,
Team P improved upstream flow con-
trol, again using system context dia-
Figure 7 Facsimile of affinity grouping
gramming, as depicted in Figure 10.
leading to class of service modeling
An increase in automation in support
of capacity management was an addi-
tional benefit.
Scrum stall disappeared in the first
ninety days after Team P switched to
Kanban. They accelerated their evo-
lution toward fast flow, fast feedback,
and fast learning by committing to
Kanban’s Enterprise Services Planning
(ESP) approach:
• Operations review with mid-level
management and all teams (new
monthly meeting)
• Replenishment meeting including
mid-level management (repurposed
existing meeting)
• Kanban standup meeting with the
team itself, with enhanced collabo-
ration among parallel teams (repur-
posed, refined existing meeting)
• Strategy review with the financial
system governance board (repur-
posed existing meeting)
• Delivery planning with production
release continuous integration/con-
tinuous delivery (CI/CD) pipeline
technical management (repurposed
existing meeting)
• Risk reviews with the project man-
agement office (PMO)
The operations review and replen-
ishment meetings were the most Figure 8 Facsimile of ESP cadences
–7–
Figure 9 Upstream system for the Scrum stall state
significant contributors to improved At Vanguard, we have evolved Kan- The team’s emphasis on upstream
speed and volume of work flow. The ban practices at the team level by doing filtering and classifying work items en-
standup, strategy review, delivery plan- these things simultaneously. abled them to better “right size” their
ning, and risk meetings evolved from Team P’s new classes of service (Fig-
existing meetings. ure 11) grew out of system context dia-
The only ESP cadence not tried was gramming, upstream optimization, and
The operations review
the service delivery review. However, a fresh look at exactly who their cus- and replenishment
its lack has had no apparent adverse tomers were. Using the JIRA platform,
effect. they automated their class of service meetings were the most
Systems thinking focused everyone’s metrics and simultaneously invested significant contributors
attention on workflow visualization in evolving policies for managing val-
and end-to-end optimization. Imme- ue delivery this way. The cost of delay to improved speed and
diate work type affinity grouping led to
strong classes of service which, in turn,
concept was new to them but, with
practice, they improved their decision
volume of work flow.
led to stronger policies, reduced costs making for service class prioritization
of delay, and a far deeper understand- and WiP limits, resulting in better work items and to define an effective
ing of exactly who the customer is. work item flow. commit point. Explicit policy definition
–8–
continued to improve, resulting in faster
downstream flow (Figure 12).
System lead time improved by 78
percent in approximately ninety days!
Note the eradication of upper control
limit violations, as shown in Figure 13.
Standard deviation still was too high a
percentage of sample mean in relative
terms. In absolute terms, however,
Team P became much more predict-
able, with a bias toward “better than
Figure 12 Realistic work flow
expected.” Team P could now commit
to service levels.
All system variables were kept
constant except for improvements in
workflow modeling—an end-to-end
WiP limit and classes of service with
new, explicit pull policies as well as up-
stream work filters—all driving better
decision making.
Team P repeated their improved
system lead time performance three
months in a row, thus achieving statis-
tical significance (Figure 14).
In this experiment, the dramatic
improvement in system lead time was a
function of upstream work item filter-
ing and class of service definitions. In
other words, Team P directly attacked
Figure 13 Flow metrics
variability as close to the source as
possible, and then reinforced improved
flow evenness starting upstream by
implementing policies to reduce total
WiP downstream. Vanguard has ex-
perienced over and over the beneficial
effects of doing both workflow mod-
eling and service class definition early
and together.
Figure 14 Flow metrics repeated month after month
Figure 15 shows the cumulative
flow diagram (CFD) five months into
the Kanban evolution. Bottlenecks
are obvious, but the elevation rate has
improved over the Scrum stall condi-
tion. Team P applied Eliyahu Goldratt’s
Theory of Constraints directly by doing
bottleneck root cause analysis. Van-
guard’s Kanban teams have taken an-
other page out of Goldratt’s book: they
construct simple, practical solutions.
The root cause of the bottlenecks
was traced to poor labor capacity dis- Figure 15 Flow metrics CFD at five months
tribution.
–9–
PRINCIPLES
The other good Kanban behaviors review kept this important issue right Visualize.
only went so far when bottlenecks were out front for both the team and its
Limit WiP.
rooted in poor capacity distribution managers.
across the labor pool. In other words, Team P has remained a maturity Manage flow.
the tension introduced by WiP lim- level 2 team for managing flow and for Make policies explicit.
its failed to achieve creative slack for making policies explicit (Figure 17).
Implement feedback loops.
Team P. The lack of slack was traced Team Q, presented next, learned
to four critical resources handling from Team P and jumped from matu- Improve and evolve.
60 percent of the work, as depicted rity level 1 to maturity level 3 in just
in Figure 16. The monthly operations 90 days. Figure 17 Team P Kanban
Method feedback loop
– 10 –
Figure 18
Visualizing
the
alternative
path to
Agility
Figure 19
The
power of
a physical
board
– 12 –
illustrates an average delivery rate in- The following factors were focused • Kanban Maturity Model (KMM)
crease of 325 percent, and this really on early and all together: The Kanban Maturity Model was (and
got management’s attention. Indeed, • Big, visible physical Kanban board still is) used as a “map back to actual
this is being written up in a company • Upstream filtering practice.” The KMM is not used as a
newsletter as an example of success. • Focus on bottlenecks “map forward to next steps.”
– 13 –
(Figures 27 and 28) indicate that they
are an example of how not to switch
from Scrum to Kanban.
A mindset of a two-week sprint still
can be observed in the system lead
time metrics, with high unpredict-
ability within the ten-working-day
time frame. This team’s low maturity
workflow state definition still is rooted
in Scrum and not in realistic workflow
Figure 27 Lead time mediocrity
states.
Team V also carried over their old
Scrum sprint states, but instead of us-
ing them as their workflow basis, they
have rendered them as Kanban service
class swim lanes:
• Design
• Analytics
• Test
• Development Figure 28 Cumulative flow mediocrity
Fortunately, the population mean of just
under five days is good. Nevertheless, Since work variability is not at- 1. A new focus on backlog refinement
Team V is stuck in maturity level 1 prac- tacked as close to the source as possi- started a few months ago. Now, they
tices with a low interest in evolving. My- ble, bottlenecks downstream are to be strive for doing it two to three times
opia and the product owner bottleneck expected. per week. Before, the team struggled
are deeply ingrained in how they work. They have improved their flow over to do any backlog refinement because
Team V advertised to management their Scrum performance, yes, but they were spending too much time
a 20 to 30 percent improvement in cu- that flow has once again leveled off. “checking the boxes” on the rest of
mulative flow. Yet, “flares,” indicating a Myopia and the product owner bottle- the Scrum framework.
rate of new work arrival exceeding the neck are deeply ingrained in how they 2. They have evolved a risk review with
team’s rate of delivery, are evident, as work. a ninety-day look-ahead window,
are bottlenecks (Figure 28). There are two bright spots: which is very ESP-like.
– 14 –
and non-value-adding work simply
doesn’t make the cut.
Using Kanban-style fast feedback
metrics, Team J avoided Scrum story
point sizing and velocity metrics alto-
gether. Instead, they have pursued con-
tinuous-flow Scrum using one-week
sprints. Figure 29 shows Team J’s sys-
tem lead time performance at just sev-
en weeks of existence. Note that value
is accepted, on average, every two days.
Team J has an aggressive focus on Figure 29 Team J’s flow metrics—Scrum using Kanban-style
how work enters their system up- system lead time
stream. They have a superb product
vision that drives a ruthless commit-
ment to discarding low-value ideas
very early. They implement a simple
“To Do” board to filter out short-dura-
tion work not of direct, tangible value
to their product vision. They engage in
continuous backlog refinement, usually
two to three times per week.
As you can see from the beautiful
cumulative flow diagram in Figure 30,
team J is approaching one-piece flow.
Team J stands out in Vanguard’s ex- Figure 30 Team J’s flow metrics—Scrum using Kanban-style
periments because they are fully com- cumulative flow diagramming
mitted to evolutionary change through
the Three Ways of fast flow, fast feed- The coach has infused their work habits to abandon the Scrum framework rel-
back, and fast learning. They are egoless. with Kanban’s STATIK (systems think- atively soon so they can switch to con-
They have a deeply embedded Lean-Ag- ing approach to introducing Kanban), tinuous flow work habits and ESP-style
ile coach who also is a hands-on coder. which is laying the foundation for them cadences.
– 15 –
We have learned that physical boards levels of transparency within the team The service delivery review is less
invigorate daily Kanban standup meet- itself, and they attract high levels of at- important. Vanguard’s experiments
ings. They also promote unprecedented tention from outside the team. never established a need for it.
Conclusions
From May 2018 through April 2019, default Scrum method, 40 percent “vot- ed to improve as the Kanban Method
Vanguard tested its Lean Operating ed with their feet” and switched or were is adopted in more areas of Vanguard.
Model under scientifically rigorous planning to switch to Kanban. A related We have no evidence that this is hap-
controls along four dimensions: result was even more interesting: nearly pening—yet. The evidence we do have
1. Lean and Agile method selection all of the Kanban teams that recovered suggests large standard deviations for
trends from Scrum stall exhibited flow metrics these metrics, which are measured
2. Empirical measures used to promote that outperformed all but one of the very month to month. This suggests endem-
continuous improvement in deliver- best Scrum teams. ic unpredictability, but again, no em-
ing value at a reasonable cost Empirical Measures for Value pirical evidence yet has been collected
3. Repeatable experiments At the team level, system lead time and and analyzed.
4. Common critical success factors cumulative flow improved dramatically Scientific Method
Method Selection Trends for the post-Scrum Kanban teams that The scientific method has been applied,
Of the roughly twenty-four teams in the embraced these fast feedback loops. and repeatable experiments have been
sample set in a population of 240 teams, At the enterprise level, the four validated at the team level as well as
all of which were started with Vanguard’s DevOps outcome measures are expect- across teams and business areas.
– 16 –
These four logical performance per- • System context diagramming, which downstream WiP limits for smooth,
spectives provided the framework: leads to . . . fast flow, which results in . . .
1. Teams that recovered from Scrum • upstream filtering to eliminate • prudent deferral of commitment to
stall unevenness as close to the source as the last responsible moment, thus
2. Teams that avoided Scrum stall possible, drives . . . preserving options to pivot either in
3. Teams that never experienced • an aggressive focus on frequently pursuit of emerging value or to cut
Scrum stall and continuously refining the back- out waste.
4. Scrum teams that are successful log; and this fosters . . . And over all of this is Vanguard’s in-
because they adopt the STATIK • good behavior patterns and bet- creasing adoption of the thing that sets
approach along with the Scrum ter ways of thinking about affinity Kanban apart from all other methods
framework grouping of work into service class- —encouraging acts of leadership at
es, giving rise to . . . every level. Vanguard’s culture allows
Critical Success Factors
• explicit policy definition, especially this concept to flourish in more places
Finally, the fourth dimension’s experi-
for visualizing a commit point in a than most other companies of compa-
mental results say it all:
robust visual workflow and applying rable size.
– 17 –
Author
Kanban
University
https://www.edu.leankanban.com
KANBAN AT VANGUARD
An Experience Report
CLASSROOMCOMPANIONSERIES
KANBANCASESTUDY
Kanban
University
KANBAN AT VANGUARD
An Experience Report
CLASSROOMCOMPANIONSERIES
KANBANCASESTUDY
Kanban
University
KANBAN AT VANGUARD
An Experience Report
CLASSROOMCOMPANIONSERIES
KANBANCASESTUDY