All guides

The Retrospectives Guide

How to Run a Sprint Retrospective That Actually Changes Things

The five-stage retrospective structure, how to pick a format, anti-patterns to avoid, and how to make sure action items don't die in the doc.

By Estimioo·May 12, 2025·14 min read

The sprint retrospective is the single most important agile ceremony — and the one most teams sleepwalk through. Done right, it's where improvement compounds. Done wrong, it's a 30-minute therapy session that ends with three sticky notes nobody acts on.

This guide walks through the five-stage retrospective structure, how to pick a format that fits your team, the anti-patterns that quietly kill retro effectiveness, and how to make sure action items survive past Friday.

What a retrospective actually is

A retrospective is a structured conversation where a team inspects how it works and identifies one or two specific changes to try. The keyword is tries — retro outputs are experiments, not decisions. Most retros fail because they aim for resolution rather than hypothesis.

It's not a status update. It's not a venting session. It's not a postmortem (those are for specific incidents). It's a recurring inspection of the team's process, with the explicit goal of changing something.

The five-stage structure

Norm Kerth's Agile Retrospectives (still the canonical text, still worth reading) defines five stages every retro should move through:

  1. 01Set the stage — establish psychological safety, agree on the goal
  2. 02Gather data — what actually happened this sprint, factually
  3. 03Generate insights — why it happened, what patterns emerge
  4. 04Decide what to do — one or two specific experiments for next sprint
  5. 05Close the retrospective — confirm action owners, end on appreciation

Skipping stages is the single most common mistake. Teams jump from gathering data straight to deciding what to do without ever asking why. The result: surface-level actions that don't address root causes.

Stage 1 — Setting the stage (5 min)

Before anything else, remind the team of the Prime Directive (also Kerth): Regardless of what we discover, we understand and truly believe that everyone did the best job they could, given what they knew at the time.

This isn't ceremony. It's a contract. Without it, retros become defensive and people stop being honest.

Ask a quick warm-up question — "one word for how you feel about this sprint" — and let everyone answer in turn. You're checking the room's energy and giving every voice an early speaking moment.

Stage 2 — Gathering data (10–15 min)

Pick a format (we'll cover formats below) and let the team add cards privately first, then share. The order matters — discussing before everyone has added their own cards lets one person's framing anchor the room.

Common categories: What went well / What didn't / What confused us / What surprised us. The "confused us" column is underrated — it surfaces things nobody talks about because they assume everyone else understands.

Stage 3 — Generating insights (10 min)

This is the stage teams skip. Once cards are up, vote on which to discuss (three dots per person, applied to the cards they find most important). Then for each top card, ask: why does this happen?

Solutions before understanding produce theater — actions that look productive but don't address the actual cause.

Stage 4 — Deciding what to do (10 min)

For one or two of the top insights, agree on a specific experiment for next sprint. A good action item has:

  • A clear owner — one name, not "the team"
  • A measurable outcome — "reduce code review wait from 2 days to 1 day"
  • A deadline — "end of next sprint"
  • An explicit hypothesis — "we think this will help because…"

Limit yourselves to one or two actions. Five action items mean none of them will get done. One action item that ships is worth more than five that don't.

Stage 5 — Closing the retro (5 min)

Quick round of appreciations — one thing each person wants to thank someone else for. This isn't fluff; it changes how the team carries the conversation into the next sprint.

Then briefly confirm action owners aloud. "Sam owns the PR-review-time experiment, due end of next sprint." Public commitment matters.

Choosing a format

The five-stage structure stays constant; the format you use for gathering data changes. Rotating formats keeps retros fresh. Common ones:

  • Start / Stop / Continue — simplest, best for new teams. Easy to default to and easy to outgrow.
  • 4Ls (Liked / Learned / Lacked / Longed for) — surfaces emotional and structural signals together. Good for mid-stage teams.
  • Mad / Sad / Glad — when team energy is low and you suspect unspoken frustration.
  • Sailboat — what's pushing us forward (wind), what's slowing us (anchors), what risks lie ahead (rocks). Great for retros after a tough sprint.
  • What went well / What didn't / Action items — minimal; works once you've earned trust.

We covered seven formats in detail in Sprint Retrospective Templates Your Team Will Actually Use. Rotate every three to four sprints.

Anti-patterns that kill retros

  • Skipping retros when "nothing happened." Quiet sprints are exactly when calibration is cheap.
  • The same person facilitating every time. Rotate. The facilitator role surfaces blind spots for whoever holds it.
  • No follow-up on last sprint's actions. Open every retro by reviewing last sprint's commitments. If nobody owns follow-up, the team stops trusting the ritual.
  • Retros that become group therapy. Venting once is fine. Three sprints in a row is a signal — escalate to leadership, don't keep relitigating in retro.
  • Manager in the room without trust built. Especially for issues involving the manager's behavior. Consider a peer-only retro before bringing it broader.
  • Action items with no owner. "We should do X" without a name attached means nobody will.

Remote and async retrospectives

For distributed teams, the in-person retrospective doesn't translate directly. Async-first retros have one massive advantage and one real downside.

The advantage: people contribute in writing on their own time, which surfaces voices that get drowned out in synchronous discussion. The downside: synchronous insight generation — the messy, generative back-and-forth — is harder to replicate in async.

A hybrid pattern that works: open the retro board 48 hours before the meeting, let people add cards async, then run a 30-minute sync session focused on stages 3–5 (insights, decide, close). The data-gathering happens before the call.

This is part of a broader pattern we covered in Remote Scrum: 10 Practices That Actually Work.

Following up on actions

An action item that doesn't survive the week is worse than no action item — it teaches the team that retros don't change anything. Three things that help:

  • Add action items to your team's task tracker, not the retro doc
  • Review last sprint's actions at the start of every retro (not the end of the sprint)
  • If an action gets dropped, name it explicitly: "we agreed to X, we didn't do it, why?"

That last point feels uncomfortable. It's also what separates teams that improve from teams that go through the motions.

When retros stop being useful

If retros consistently produce the same insights ("we need better requirements") with no action, something deeper is wrong. Two checks:

  • Is the team empowered to act on what comes up? If not, you're running a complaint forum.
  • Is the retro cadence right? Two-week sprints fit most teams. Some need monthly retros instead.

Don't keep running retros that produce nothing. Pause for a sprint, then restart with a different format and a clear ask: what's one thing this team would change if it could?