Story Points vs. Function Points (FP): Evaluating the Systemic Risk of Using Team-Relative, Semiquantitative Sizing
| Introduction In software development, regardless of the delivery approach, accurately sizing work is crucial for planning, budgeting, and delivery. Nowadays, product and project teams are most of the time temporary, unlike the 1990s internal development teams with members working together for decades, and sometimes retiring from the same organisation that they joined as university graduates. Two widely discussed approaches are Story Points—a team-relative, semiquantitative Agile metric—and Function Points (FP), a more standardised, objective sizing method. While both have their place, the choice between them becomes critically important when organisations use these metrics for high-stakes decisions, such as hard fixed-price contractual cost estimates. This blog post looks at the Story Points and Function Points, highlighting the systemic risks of misapplication, and why using team-relative measures for contracts can be a recipe for disaster. Story Points: A Team-Relative Estimation Tool Story Points are an Agile estimation technique that originated in Extreme Programming (XP). Teams assign a relative value (e.g., 1, 2, 3, 5, 8) to each user story based on complexity, effort, and uncertainty. Key characteristics include:
Function Points: Objective, Standardised Measurement Function Points (FP) provide a standardised, technology-agnostic way to measure the functional size of software. Developed by Allan Albrecht at IBM in the 1970s, Function Point Analysis (FPA) counts the number and complexity of features delivered to the user, such as inputs, outputs, data files, and interfaces. Key attributes of Function Points:
The Systemic Risk: Using Story Points for Fixed-Price Contracts The Temptation Agile’s popularity—and the ease of assigning Story Points—tempts organisations to use these metrics for more than their intended purpose. Project Managers, Program Managers and procurement teams sometimes attempt to translate Story Points into contractual obligations, using them to estimate costs and set fixed prices for software delivery. The Problem This approach introduces systemic risk on multiple fronts:
When Story Points are used as the basis for hard, contractual commitments:
Why Function Points Work Better for Contracts Function Points sidestep many of these pitfalls:
Best Practices: Choosing the Right Metric for the Right Job
The bottom line Story Points and Function Points each have their place in modern software development. Story Points enable Agile teams’ adaptability and learning, but their subjectivity makes them unsuitable for high-stakes contractual cost estimation. Function Points, while not perfect, offer the objectivity and comparability needed to underpin reliable, fair, fixed-price contracts. Attempting to use team-relative, semiquantitative sizing for contractual obligations introduces systemic risk: cost overruns, legal disputes, and project failure. By respecting the strengths and limitations of each metric, organisations can deliver value, build trust, and avoid the pitfalls of metric misapplication in software development contracts. Question for readers: -What is your experience with using story points or function points in cost estimation and contracts? -Have you encountered challenges or successes with these metrics in real-world projects? Share your thoughts and join the conversation below. |
Scaling Agile Frameworks and Lean Principles: Enhancing Agility or Reintroducing Bureaucratic Waste?
| Introduction As Agile methodologies have matured, organizations of all sizes have sought ways to extend their benefits beyond individual teams. Enter the scaling frameworks designed to bring structure and coordination to Agile practices at the enterprise level. Yet, as companies implement these frameworks, a pressing question emerges: Do scaled Agile frameworks truly enhance organizational agility, or do they risk reintroducing the very bureaucratic waste that Lean principles aim to eradicate? This blog post examines the intersection of scaling frameworks and Lean thinking, weighing their benefits and pitfalls, and considers whether agility is being enhanced or undermined in the pursuit of scale. The Promise of Scaling Frameworks Why Scale Agile? Agile excels at the team level—delivering working software quickly, responding to change, and empowering self-organizing teams. However, large organizations face challenges such as:
Scaled Agile Frameworks: A Brief Overview
Lean Principles: The Pursuit of Waste Elimination Lean, originating from Toyota’s Production System, is built on the relentless pursuit of value and the elimination of waste (“muda”). Its core principles include:
The Tension: Frameworks vs. Waste How Scaling Frameworks Can Enhance Agility
However, as scaling frameworks are implemented, there is a real danger that the pendulum swings too far:
Striking the Balance: Lean-Agile at Scale
Sometimes scaled Agile frameworks can be a good option for managing complexity in large organizations. When thoughtfully applied, they can enhance alignment, transparency, and delivery at scale. However, if adopted blindly or enforced rigidly, they risk reintroducing the very bureaucratic waste that Lean thinking seeks to eradicate. The key is not in the framework itself, but in how organizations use it: as a flexible guide in the pursuit of value and excellence, always with Lean principles as the true north. Question for Readers: -Have you worked in organizations that adopted a scaled Agile framework? -Did the scaled Agile framework enhanced Agility and value delivery, or did it create new layers of bureaucracy? Share your experiences and insights in the comments below. |
Managing Measurement Debt Ethically: Leadership’s Duty to Retire Outdated Metrics
| Introduction In an era dominated by dashboards, KPIs, and data-driven decision-making, organizations are awash in metrics. Yet, just as technical debt accrues when legacy code lingers, “measurement debt” builds up when outdated, irrelevant, or misleading metrics persist in an organization’s reporting ecosystem. These obsolete metrics—once helpful, now useless or even harmful—consume team energy, cloud organizational focus, and compromise transparency. Addressing measurement debt isn’t just a matter of operational efficiency; it’s an ethical responsibility of leadership. This blog post explores the dangers of measurement debt, the ethical imperatives for retiring stale metrics, and strategies for fostering a healthy, focused measurement culture. A thought-provoking question for readers is included at the end. What Is Measurement Debt? Measurement debt refers to the cumulative burden of maintaining, reporting, or acting upon metrics that no longer add value. Just as technical debt slows innovation and increases risk, measurement debt can:
The Ethical Dimension: Leadership’s Duty Transparency and Integrity Ethical leadership requires honest reporting and clear communication. Continuing to track or emphasize metrics that are outdated, irrelevant, or misleading violates the principle of transparency. Stakeholders—whether teams, investors, or customers—trust that reported data reflects current reality, not the ghosts of projects past. Respect for People and Time Every metric reported or reviewed represents hours of collection, analysis, and discussion. Requiring teams to maintain useless metrics wastes precious cycles and signals a lack of respect for their time and expertise. Focus and Alignment Leaders have a duty to maintain organizational clarity. Allowing outdated metrics to persist clouds focus, diluting attention from what truly matters and potentially driving harmful or meaningless behaviours. The Hidden Costs of Outdated Metrics Opportunity Cost Every hour spent updating a useless metric is an hour not spent on improvement, innovation, or customer value. Measurement debt diverts energy from high-impact work to low-impact bureaucracy. Decision Paralysis Overloaded dashboards and conflicting metrics make it harder to discern trends or make timely decisions. Leaders may become paralysed by data noise or misled by irrelevant information. Metric Gaming and Distrust When teams see that some metrics are meaningless, they may begin to question the whole measurement system—or game the numbers to minimize effort. This undermines trust in leadership and in the value of measurement itself. Why Do Outdated Metrics Persist?
Measurement debt is more than a nuisance—it's a leadership and ethical challenge. By proactively retiring outdated metrics, leaders demonstrate respect for teams, uphold transparency, and sharpen the organization’s focus on what truly matters. In a world obsessed with numbers, true excellence lies not in tracking more, but in tracking what matters most. Question for Readers: -Have you experienced the burden of measurement debt in your organization? -How did it affect team morale, focus, or decision-making? -What steps have you seen (or wish you’d seen) to retire outdated metrics? Share your stories below. |
The Ethical Trap of the Cookie-Cutter Frameworks
| The Ethical Trap of the "Agile Industrial Complex": Unpacking the Perils of Cookie-Cutter Frameworks
Ignoring Context and Needs Real agility is about adaptation. But firms in the Agile Industrial Complex often apply the same solution to every client, ignoring:
Incentives to Sell, Not Solve Consulting firms profit from selling frameworks and certifications—not necessarily from the client’s long-term success. This misalignment of incentives can lead to:
A shiny new framework, complete with roles, ceremonies, and artifacts, can create the illusion of progress. But without cultural change and real buy-in, teams may simply go through the motions—"doing Agile" without being agile. This is often dubbed “Agile Theatre.” Ethical Dilemmas for Leaders and Champions Leaders and internal champions may feel pressured to implement what the consultants recommend, even when it conflicts with reality. They may witness:
The Agile Industrial Complex thrives on selling certainty in a world defined by change. But real agility cannot be packaged and sold like a product. It demands humility, context-sensitivity, and a relentless focus on people and outcomes. Consulting firms—and the organizations that hire them—must reject the lure of cookie-cutter solutions in favour of genuine, ethical transformation. Only then can Agile’s original promise be realized: better products, happier teams, and real business value. Question for Readers: Have you experienced an “Agile transformation” driven by external consultants or frameworks that didn’t fit your organization’s needs? What lessons did you learn, and what would you do differently next time? Share your stories and advice in the comments below. |
Statistical Misuse of Ordinal Scales: The Mathematical and Ethical Flaws of Averaging Planning Poker Metrics
| Statistical Misuse of Ordinal Scales: The Mathematical and Ethical Flaws of Averaging Planning Poker Metrics Introduction In Agile software development, metrics like Planning Poker story points are widely used to estimate the size and complexity of work items. These metrics are based on ordinal scales—a type of ranking where the relative order of items matters, but the exact differences between them do not. Despite this, it’s common practice to calculate averages, run regressions, and otherwise apply standard mathematical operations to such data. This statistical misuse isn’t just a technical mistake; it has real-world consequences for decision-making and can cross into the realm of ethical misrepresentation. In this blog post, we examine the nature of ordinal data, why treating it as interval data is problematic, and the ethical implications for teams and organizations. We also provide guidance to help avoid these pitfalls, concluding with a question for readers to reflect on their own experiences. Understanding Ordinal Scales in Agile Contexts What Is an Ordinal Scale? An ordinal scale is a way of ranking items or outcomes according to some criterion, but without specifying the degree of difference between them. For example, a restaurant rating system (poor, fair, good, excellent) or a pain scale (mild, moderate, severe) are ordinal. In Agile, Planning Poker uses a sequence of numbers (often Fibonacci: 1, 2, 3, 5, 8, 13, etc.) to estimate effort, but the gaps between these numbers are not consistent or meaningful in a mathematical sense. Why Do Teams Use Ordinal Scales? Ordinal scales like Planning Poker sequences are practical for group estimation, helping to drive consensus and discussion. They acknowledge the uncertainty and subjectivity inherent in software estimation, allowing teams to quickly rank work items from smallest to largest without worrying about precise measurement. Statistical Misuse: Averages and Regressions on Ordinal Data The Mathematics of Ordinal Data Ordinal data only tells us the order of items, not the magnitude of differences. For example, the difference in effort between a 2-point and a 3-point story is not necessarily the same as between a 5-point and an 8-point story. Treating these numbers as if they are evenly spaced (like real numbers on a ruler) violates the fundamental properties of ordinal data. The Flaws of Mathematical Averages Despite this, many teams and organizations calculate the average story point value for a sprint, or the average velocity across sprints. They may even run regressions to forecast future delivery. However, calculating averages or running arithmetic operations on ordinal data is mathematically unsound because:
Some organizations take it further, applying regression analysis or more complex statistical models to ordinal data. These methods assume interval or ratio-level data, where arithmetic operations are valid. Using them on ordinal metrics produces results that are, at best, spurious and, at worst, drive misguided decisions. Real-World Consequences of Statistical Misuse Poor Decision-Making Relying on mathematically flawed averages or projections leads to poor planning, unrealistic commitments, and ultimately, failed projects. Teams may be pushed to deliver "average" story sizes that are not grounded in reality or pressured to meet forecasted velocities that have no statistical validity. Erosion of Trust When stakeholders realize that the numbers don’t add up—or worse, when projects fail due to flawed metrics—trust in the estimation process and in leadership breaks down. Ethical Implications Misrepresenting ordinal metrics as if they were interval or ratio data is more than just a technical error; it’s an ethical lapse. It can:
Best Practices: Using Ordinal Metrics Responsibly
Ordinal metrics like Planning Poker story points have value when used as intended—to facilitate team discussion and consensus. But applying standard mathematical operations to these numbers is both mathematically invalid and ethically questionable. By respecting the true nature of ordinal data and reporting it with integrity, teams and organizations can avoid misleading themselves and their stakeholders, making better decisions and building greater trust. Question for Readers: Have you encountered situations where averages or advanced analytics were applied to ordinal metrics like story points or Planning Poker estimates? How did it affect planning, transparency, or trust in your teams? Share your experiences and insights below. |





