Beyond OKRs: Engineering Goal-Setting Frameworks

When OKRs Are the Wrong Tool

OKRs work brilliantly under specific conditions: a stable enough environment to set meaningful quarterly goals, leadership that tolerates aspirational failure, measurable outcomes, and quarterly planning cadence. The model that John Doerr popularized in Measure What Matters and that Google documents in re:Work assumes exactly those conditions. When they are absent, OKRs become bureaucratic overhead that demoralizes teams and produces gamed metrics.

Here are the contexts where OKRs consistently fail:

Early-Stage Startups

In a pre-product-market-fit startup, priorities shift weekly based on customer feedback, funding conversations, and competitive moves. A quarterly objective set in week 1 is often obsolete by week 4. The team either ignores the OKRs (making them theater) or rigidly pursues them despite changed circumstances (making them harmful). Startups need a framework that adapts on a weekly timescale, not a quarterly one.

Firefighting Teams

Teams in a reliability crisis or heavy maintenance mode spend most of their capacity on incidents and unplanned work. Setting forward-looking OKRs when 70% of your time is reactive sets the team up for failure. They cannot deliver on the OKRs because the incidents consume the capacity, and the failed OKRs add demoralization on top of the existing firefighting stress.

Research and Exploration Teams

ML research teams, R&D groups, and exploratory engineering teams produce their best outcomes unpredictably. The whole point of research is that you do not know the answer in advance. Forcing a research team to commit to key results ("achieve 95% model accuracy") either constrains them to safe, incremental work or sets them up to fail at genuinely ambitious exploration. Research needs a framework built around hypotheses and learning, not committed outcomes.

Continuous-Flow Platform Teams

Platform and infrastructure teams often do continuous improvement work that does not decompose neatly into quarterly objectives. Their value is in steady incremental improvement (faster deploys, fewer incidents, better developer experience) measured by trends, not quarterly targets. Forcing discrete OKRs onto continuous-flow work creates artificial milestones that distort the natural rhythm of the work.

Alternative 1: North Star Metric + Weekly Bets

Best for: fast-moving early-stage teams, growth-focused engineering organizations.

A North Star Metric (NSM) is a single number that captures the core value your product delivers. "Weekly active teams" for a collaboration tool. "Successful API calls" for a developer platform. "Nights booked" for a travel product. The entire organization aligns around moving this one metric.

How It Works for Engineering

Below the NSM are input metrics that engineering can directly influence: page load time, feature adoption rate, onboarding completion rate, API reliability. Each input metric contributes to the NSM through a documented causal hypothesis.

Instead of quarterly objectives, the team runs weekly or biweekly "bets": small, time-boxed initiatives aimed at moving an input metric. "We bet that reducing checkout latency from 800ms to 300ms will increase the conversion input metric by 5%." The bet is scoped to fit a 1-2 week cycle. At the end, you measure: did the input metric move? Did it affect the NSM?

Why It Beats OKRs for Fast Teams

The weekly bet cycle adapts to changing priorities far faster than quarterly OKRs. When customer feedback reveals a new priority, you adjust next week's bet rather than abandoning a quarterly objective. The NSM provides stable long-term direction (it rarely changes) while the bets provide tactical flexibility (they change weekly).

The Trade-Off

North Star + Bets provides less explicit organizational alignment than OKRs. Different teams might run bets that pull in slightly different directions because the framework prioritizes speed over coordination. This is acceptable for small, fast teams (under 30 engineers) where everyone can see what everyone else is doing. It breaks down at scale where coordination matters more.

Alternative 2: Shape Up

Best for: product engineering teams that struggle with the abstraction of outcome-based goals.

Shape Up, developed at Basecamp, replaces quarterly objectives with 6-week cycles. The core concepts:

Appetite, not estimate: Instead of estimating how long a feature will take, decide how much time it is worth (the "appetite"). A small batch is 2 weeks; a big batch is 6 weeks. Then shape a solution to fit the appetite.
Shaping: Before committing, senior people "shape" the work — define the problem, sketch a solution at the right level of abstraction, and identify risks. Shaped work is concrete enough to bet on but loose enough to leave implementation details to the team.
Betting: Leadership bets on shaped work for the next cycle. A bet is a commitment to give a team uninterrupted time to deliver shaped work within its appetite.
The cool-down: Between 6-week cycles, a 2-week cool-down for bug fixes, exploration, and shaping the next cycle's bets.

Why It Beats OKRs for Some Product Teams

Shape Up is concrete. Engineers commit to building specific things, not achieving abstract outcomes. For teams that find OKRs too abstract ("what does it actually mean to improve developer velocity by 20%?"), Shape Up's tangible project commitments feel more natural. The fixed appetite also prevents the scope creep that plagues OKR-driven teams: when the time is fixed and the scope is variable, teams ship something useful within the cycle rather than perpetually extending a project to hit an outcome target.

The Trade-Off

Shape Up focuses on outputs (what you build) rather than outcomes (what changes for users). It provides less strategic alignment than OKRs because it does not force the team to articulate why the work matters in measurable terms. Pair Shape Up with organizational-level outcome metrics to compensate: use Shape Up for execution, use a North Star or organizational OKRs for direction.

Alternative 3: Continuous DORA Improvement

Best for: platform teams, infrastructure teams, and mature delivery organizations focused on flow.

The DORA metrics (deployment frequency, lead time for changes, change failure rate, time to restore service) measure software delivery performance. Instead of setting quarterly targets for these metrics (an OKR approach), the continuous improvement model monitors the trends and pursues steady incremental improvement without fixed quarterly milestones.

How It Works

The team maintains a dashboard of the four DORA metrics plus a few supporting metrics (developer satisfaction, on-call burden, infrastructure cost). They review the trends weekly. When a metric trends in the wrong direction, they investigate and address the cause. When a metric plateaus, they run experiments to break through the plateau.

There are no quarterly targets. The goal is continuous improvement: every metric should trend in the right direction over time, with the rate of improvement varying based on where the bottlenecks are.

Why It Beats OKRs for Flow Work

Platform and infrastructure work is inherently continuous. There is no natural "done" point for reliability or developer experience — you keep making it better incrementally. Forcing this work into quarterly OKRs creates artificial milestones that distort the work. The continuous improvement model matches the natural rhythm: monitor trends, address bottlenecks, improve steadily.

It also avoids the gaming problem. When deployment frequency is an OKR target, teams game it (deploy trivial changes to inflate the number). When it is a monitored trend in service of genuine flow improvement, the incentive to game disappears because the team owns the metric rather than being measured against it.

The Trade-Off

Continuous improvement provides less forcing function than OKRs. Without targets, complacency is a risk: the team monitors the metrics but does not push hard to improve them. Mitigate this with periodic "improvement sprints" where the team picks the weakest metric and runs a focused effort to improve it. The continuous model provides the steady-state; the improvement sprints provide the occasional push.

Alternative 4: Opportunity Solution Trees

Best for: discovery-driven product engineering, research teams, exploratory work.

Opportunity Solution Trees (OST), from Teresa Torres's continuous discovery work, structure goal-setting around a clear desired outcome at the top, branching into opportunities (user needs or problems), then solutions (ideas to address opportunities), then experiments (tests to validate solutions).

How It Works for Engineering

Start with a desired outcome ("reduce time-to-first-value for new users"). Map the opportunities (the problems preventing fast time-to-value): confusing onboarding, slow initial load, unclear next steps. For each opportunity, brainstorm solutions. For each solution, design experiments to validate it before committing engineering effort.

The framework embraces uncertainty. Instead of committing to "build feature X" (which assumes you know X is the right solution), you commit to "explore the onboarding opportunity through these three experiments." The output is learning, which informs the next round of solution exploration.

Why It Beats OKRs for Exploratory Work

Research and discovery work produces its value through learning, not through predetermined outcomes. OKRs force you to commit to results you cannot predict. OST lets you commit to a learning process toward an outcome, which matches the actual nature of exploratory work. You still have a clear desired outcome (the top of the tree) but the path to it emerges through experimentation rather than being committed in advance.

The Trade-Off

OST requires discipline to avoid becoming an excuse for unfocused exploration. The desired outcome at the top of the tree must be clear and the experiments must have genuine learning goals. Without this discipline, OST degrades into "we are exploring" with no accountability. It works best for teams with strong product discovery skills and weak with teams that need external structure.

Choosing the Right Framework

The right framework depends on two dimensions: how predictable is the work, and how continuous is the work. Map your team and pick accordingly:

Team Context	Best Framework	Why
Stable product team, quarterly planning	OKRs	Predictable enough for quarterly goals; benefits from outcome focus
Fast-moving early-stage team	North Star + Weekly Bets	Adapts to weekly priority shifts; lightweight
Product team that finds OKRs too abstract	Shape Up	Concrete project commitments; fixed appetite prevents scope creep
Platform / infrastructure team	Continuous DORA Improvement	Matches the continuous nature of flow work; avoids gaming
Research / ML / exploratory team	Opportunity Solution Trees	Embraces uncertainty; commits to learning, not outcomes
Firefighting / crisis team	None (or minimal)	Capacity is reactive; goal-setting overhead is counterproductive until stabilized

The Hybrid Model

Mature organizations rarely use a single framework across all teams. The most effective pattern I have seen combines organizational-level OKRs with team-level execution frameworks:

Company and CTO level: OKRs that set strategic direction. "Reduce customer churn by improving product reliability and performance."
Product engineering teams: Shape Up cycles or North Star bets, aligned to the organizational OKRs. The team chooses what to build to contribute to the organizational outcomes.
Platform teams: Continuous DORA improvement, with the understanding that better delivery infrastructure serves the organizational OKRs indirectly.
Research teams: Opportunity Solution Trees, with desired outcomes derived from the organizational OKRs.

The organizational OKRs provide alignment (everyone knows what outcomes the company is pursuing). The team frameworks provide execution flexibility (each team works in the way that fits its nature). This is the maturity: aligning on outcomes while allowing flexible execution, rather than forcing one framework everywhere.

How to Transition Away from OKRs

If your team is using OKRs and they are not working, do not abandon them abruptly. The transition:

Diagnose why OKRs are failing. Is the environment too unstable (early-stage)? Is the work too continuous (platform)? Is it too exploratory (research)? Is the team in firefighting mode? The diagnosis determines the alternative.
Pilot one team. Pick the team where OKRs are most clearly failing and try the matching alternative for one quarter. Measure whether the new framework improves focus, morale, and delivery.
Keep organizational OKRs. Even if individual teams move to other frameworks, maintain company and CTO-level OKRs for strategic alignment. The teams' frameworks should connect to these organizational outcomes.
Expand based on results. If the pilot works, extend the matching framework to similar teams. Do not force every team to the same alternative — match the framework to each team's context.

The Core Principle

Goal-setting frameworks are tools, not religions. OKRs, North Star Metrics, Shape Up, DORA continuous improvement, and Opportunity Solution Trees each work well in their fitting context and poorly outside it. The mistake is not choosing the wrong framework — it is choosing any framework dogmatically and forcing it onto work that does not fit.

The best CTOs I know are framework-pluralists. They understand the conditions under which each framework works, they diagnose their teams' contexts honestly, and they match the framework to the work. They maintain organizational alignment through high-level outcome goals while allowing teams the flexibility to organize their execution in the way that fits their nature.

OKRs are not outdated. They are over-applied. The 2026 maturity is knowing when to use them and when to reach for something else.

Related Guides

Engineering OKRs That Work

Real OKR examples and the anti-patterns that make most engineering OKRs useless.

Technical Debt Prioritization

The framework for deciding which debt to pay down first.

Technical Debt in the AI Era

Managing the new categories of debt created by AI coding tools.

Frequently Asked Questions

Why do OKRs fail for some engineering teams?

OKRs fail when they are imposed without the conditions that make them work: a stable enough environment to set quarterly goals, leadership that tolerates aspirational failure, and metrics that are not gameable. Early-stage startups fail with OKRs because priorities shift weekly, making quarterly objectives obsolete by week 3. Teams in firefighting mode fail because all capacity goes to incidents, leaving nothing for OKR work. Highly creative or research-oriented teams fail because their best outcomes are unpredictable and cannot be captured in advance as key results. For these contexts, a different framework works better.

What is the best alternative to OKRs for engineering teams?

There is no single best alternative. The right framework depends on your context. For fast-moving early-stage teams, North Star Metric + weekly bets works better than quarterly OKRs. For teams focused on flow and continuous improvement, the DORA metrics with a continuous improvement mindset (no fixed targets, just trend monitoring) is more natural. For research-oriented teams, OST (Opportunity Solution Trees) captures the exploratory nature of the work. The key is matching the framework to the predictability and nature of the work, not adopting a framework because it is fashionable.

Are OKRs outdated in 2026?

No, but they are over-applied. OKRs remain excellent for organizations with quarterly planning cycles, stable-enough environments, and outcomes that can be measured. The problem is that OKRs became the default for every team regardless of fit. The 2026 trend is not abandoning OKRs but matching goal-setting frameworks to team context: OKRs for predictable delivery teams, continuous-flow metrics for platform teams, bet-based frameworks for exploratory teams. The maturity is in choosing the right tool, not in rejecting OKRs wholesale.

What is the North Star Metric framework?

A North Star Metric (NSM) is a single metric that captures the core value your product delivers to users (e.g., "weekly active teams" for a collaboration tool, "nights booked" for a travel platform). The framework focuses the entire organization on moving one number, with input metrics that contribute to it. For engineering, the NSM approach works well when paired with weekly or biweekly "bets": small, time-boxed initiatives that aim to move an input metric. It is lighter weight than OKRs and adapts faster to changing priorities, which suits fast-moving teams.

How does Shape Up compare to OKRs for engineering goal-setting?

Shape Up (from Basecamp) is a project-shaping and scheduling methodology, not strictly a goal-setting framework, but it replaces OKRs for some teams. Instead of quarterly objectives, Shape Up uses 6-week cycles with "appetite-based" scoping: decide how much time a problem is worth, shape a solution to fit, and bet on it. It works well for product engineering teams that struggle with the abstraction of OKRs and prefer concrete project commitments. The trade-off: Shape Up provides less organizational alignment than OKRs because it focuses on what to build, not what outcomes to achieve.

Can you mix OKRs with other frameworks?

Yes, and most mature organizations do. A common pattern: organizational OKRs at the company and CTO level (providing strategic alignment), with teams choosing their own execution framework (Shape Up for product teams, continuous DORA improvement for platform teams, bet-based for exploratory teams). The organizational OKRs answer "what outcomes are we pursuing as a company"; the team frameworks answer "how does this team organize its work to contribute." Forcing every team to use the same framework is the mistake. Aligning on outcomes while allowing flexible execution is the goal.

Published Invalid Date · Updated Invalid Date

Thomas Prommer Technology Executive — CTO/CIO/CTAIO

These salary reports are built on firsthand hiring experience across 20+ years of engineering leadership (adidas, $9B platform, 500+ engineers) and a proprietary network of 200+ executive recruiters and headhunters who share placement data with us directly. As a top-1% expert on institutional investor networks, I've conducted 200+ technical due diligence consultations for PE/VC firms including Blackstone, Bain Capital, and Berenberg — work that requires current, accurate compensation benchmarks across every seniority level. Our team cross-references recruiter data with BLS statistics, job board salary disclosures, and executive compensation surveys to produce ranges you can actually negotiate with.

Profile LinkedIn Newsletter

Beyond OKRs: Goal-Setting Frameworks for Engineering Teams in 2026

When OKRs Are the Wrong Tool

Early-Stage Startups

Firefighting Teams

Research and Exploration Teams

Continuous-Flow Platform Teams

Alternative 1: North Star Metric + Weekly Bets

How It Works for Engineering

Why It Beats OKRs for Fast Teams

The Trade-Off

Alternative 2: Shape Up

Why It Beats OKRs for Some Product Teams

The Trade-Off

Alternative 3: Continuous DORA Improvement

How It Works

Why It Beats OKRs for Flow Work

The Trade-Off

Alternative 4: Opportunity Solution Trees

How It Works for Engineering

Why It Beats OKRs for Exploratory Work

The Trade-Off

Choosing the Right Framework

The Hybrid Model

How to Transition Away from OKRs

The Core Principle

Related Guides

Engineering OKRs That Work

Technical Debt Prioritization

Technical Debt in the AI Era

Frequently Asked Questions