The evidence problem
AI spend is nearly universal. Provable ROI is not.
FinOps Foundation's 2026 State of FinOps reports that 98% of practitioners now manage AI spend — up from 31% just two years earlier. McKinsey's 2025 State of AI finds only 5–6% of organisations seeing clear financial ROI. IBM's 2025 AI in Business study reports that only 29% of executives feel confident measuring AI return. These numbers describe the same problem from three directions: cost is becoming real faster than value is becoming governable.
Why AI needs value management
AI has moved from experimentation into financial, operating, and portfolio governance.
Why AI needs value management
AI is no longer a model-selection problem or a cloud-cost problem. It is a management problem that cuts across SaaS, cloud, data platforms, labour models, software delivery, security, sovereignty, procurement, and strategic portfolio planning — simultaneously, in most large enterprises, with no single discipline responsible for the whole.
The structural consequence is that AI investment accumulates faster than the governance needed to evaluate it. Deloitte's 2025 State of Generative AI in the Enterprise found that only 16% of organisations have fully designed the roles, processes, and operating models required for AI integration. Kyndryl's 2025 survey found 61% of CEOs facing increased pressure to prove AI returns within the year. Both findings point to the same gap: the appetite to invest has outpaced the organisational capacity to govern.
The question has shifted accordingly. Leaders are no longer only asking how much the organisation spends on AI. They are asking which investments deserve to exist, which should scale, which should be optimised, and which are generating cost, risk, or dependency faster than value.
The five layers
The layers are cumulative. Each one depends on cleaner evidence from the layers below it.
The five layers of AI value management
Usage Transparency
Question
What are we spending, where is demand appearing, and who is consuming AI capability?
Evidence Signal
AI cost can be normalised across SaaS, APIs, cloud platforms, private infrastructure, and shared services.
Decisions It Supports
Allocate, forecast, charge back, cap, investigate anomalies, and separate pilots from run-rate demand.
Output Quality
Question
Is the AI producing work that can be trusted, used, and improved?
Evidence Signal
Leaders can see useful output, discarded output, review burden, model fit, and quality risk rather than only call volume.
Decisions It Supports
Route models, tune prompts, strengthen evaluation, set human review thresholds, and stop low-quality use cases.
Productivity Value
Question
Is AI improving the workflow, or creating new work around the workflow?
Evidence Signal
Usage intensity is compared with delivery throughput, rework, cycle time, quality, and the human effort needed to manage AI.
Decisions It Supports
Redesign work, change enablement, compare alternatives, and distinguish capacity release from activity growth.
Delivery Alignment
Question
Is AI effort aligned to funded initiatives, product outcomes, and strategic portfolio commitments?
Evidence Signal
AI spend and effort can be mapped to epics, capabilities, products, value streams, PI objectives, or equivalent planning structures.
Decisions It Supports
Prioritise, sequence, fund, defer, remove duplication, and decide whether experimentation should become delivery.
Portfolio Strategy
Question
Does the total AI portfolio justify its cost, risk, dependency, and opportunity cost?
Evidence Signal
Executives can compare AI investments by strategic fit, value evidence, operating risk, vendor exposure, and human capability impact.
Decisions It Supports
Scale, optimise, contain, replatform, renegotiate, or stop initiatives based on evidence rather than enthusiasm.
The layers are a governance sequence, not a procurement checklist. A leadership team cannot make credible portfolio choices if it cannot see usage. It cannot prove productivity value if it does not know whether outputs are useful. It cannot align AI effort to strategy if delivery systems cannot connect AI activity to funded work.
This sequencing matters because skipping layers is the dominant failure pattern. Organisations that measure portfolio strategy before establishing output quality end up with investment decisions based on adoption stories rather than outcome evidence. Those that measure productivity value before building usage transparency cannot tell whether the productivity claim covers the full cost of the capability producing it. The FinOps Foundation's 2026 data shows governance and forecasting rank higher than optimisation as AI management priorities — which is the profile of an industry still working on layers 1 and 2 before reaching 4 and 5.
How the layers connect to existing disciplines
AI value management should connect established management systems rather than create another silo.
How the layers connect to existing disciplines
AI value management is best understood as a connecting discipline. It does not replace FinOps, ITFM, TBM, SPM, ITAM, security, or platform engineering. It gives them a shared operating question: is AI creating enough value to justify the full cost and risk of the capability?
How AI value management layers connect to enterprise management disciplines
Usage transparency
- Primary discipline
- FinOps, ITFM, procurement, SaaS management
- Evidence needed
- Spend, consumption, user, team, product, vendor, model, and environment data
- Failure pattern
- AI cost becomes visible only after demand is already embedded
Output quality
- Primary discipline
- AI operations, engineering, risk, product management
- Evidence needed
- Acceptance, rejection, rework, evaluation, incident, and review signals
- Failure pattern
- Cheap output is mistaken for valuable output
Productivity value
- Primary discipline
- CIO office, COO office, engineering leadership, HR
- Evidence needed
- Workflow baseline, cycle time, throughput, quality, supervision, and capacity data
- Failure pattern
- Hours saved are counted without showing what those hours become
Delivery alignment
- Primary discipline
- SPM, product operations, programme leadership, TBM
- Evidence needed
- Initiative, epic, value stream, PI objective, product, and funding alignment
- Failure pattern
- AI work grows outside the portfolio choices leaders thought they had made
Portfolio strategy
- Primary discipline
- CFO, CIO, CAIO, CEO, board, risk committee
- Evidence needed
- Value proof, risk posture, vendor exposure, sovereignty, scenario, and opportunity-cost data
- Failure pattern
- AI becomes strategic by narrative, but not by evidence
The full cost stack
The value question is incomplete unless the cost model includes the operating burden around AI.
The full cost stack behind AI value
Most weak AI value cases begin with a narrow cost base: the licence fee or model bill, without the recurring burden needed to make the capability reliable, governed, and useful. IDC warns that large enterprises routinely underestimate AI infrastructure cost as estates scale. The FinOps Foundation finds that more than half of AI tool spending sits outside formal IT budgets. These are not edge cases — they describe the standard enterprise starting position.
A complete cost view covers software subscriptions, usage charges, model calls, embeddings, vector search, cloud and private infrastructure, data preparation, evaluation, security, legal review, change management, platform support, human oversight, energy, and opportunity cost. The AI TCO Framework organises this into seven layers and provides illustrative cost shares for each.
The cost stack changes the meaning of value. A low-cost model is not cheap if it increases review burden or produces unreliable outputs. A high-cost model may be justified if it eliminates human rework in a material workflow. A SaaS AI add-on at thirty dollars per seat creates real economic exposure when no one tracks whether those seats are actively used.
Personal accountability
Enterprise AI value management also needs a human version: reflective habits that help people use AI deliberately.
Personal accountability: the human layer inside every layer
AI value management usually starts at enterprise level, but it cannot stay there. AI changes individual work habits before it changes portfolio economics. If people use AI without reflection, the organisation may gain activity while losing judgement, learning, or quality control.
The useful analogy is not employee monitoring. It is personal habit visibility. Just as a person may track sleep, training, or recovery to understand whether a routine is helping them, knowledge workers need a way to see whether AI is improving their work or quietly changing their dependence on it.
This matters because the most valuable AI behaviour is not maximum usage. It is appropriate usage: knowing when to delegate, when to review, when to challenge the answer, when to learn from it, and when to do the thinking directly.
Board questions
At the highest layer, value management becomes a small number of repeatable executive questions.
Questions every leadership team should be able to answer
- What is our total AI exposure across SaaS, cloud, data centre, private cloud, labour substitution, and shared platform services?
- Which AI investments have evidence of value, and which remain speculative?
- What happens to our economics if usage rises tenfold, quality thresholds require more expensive models, or vendors change price structures?
- Which AI systems are material to customers, employees, regulatory obligations, or operational resilience?
- Where are prompts, completions, embeddings, logs, and agent actions processed and governed?
- Which teams are gaining real capacity, and which are absorbing new review, security, or coordination burden?
- Are we redesigning work around AI, or simply layering AI tools on top of existing work?
- What is the stop, scale, optimise, or contain decision for each material AI initiative?
- Are we protecting the learning path for junior and entry-level roles as easier work becomes automated?
- Is AI improving trust, quality, accountability, and resilience, or weakening them in ways our metrics do not yet show?
How to use the framework
The practical value of the model is deciding what evidence is needed next.
How to use the framework
Start by identifying the layer where the organisation is currently guessing. If leaders cannot see usage, start with instrumentation and allocation. If output quality is unclear, invest in evaluation and review signals. If productivity claims rely on sentiment, build workflow baselines. If AI work is not tied to funded initiatives, connect it to portfolio structures. If the executive team cannot compare investments, create a portfolio review cadence with stop, scale, optimise, and contain decisions.
The goal is not to build the most elaborate measurement system. It is to make the next management decision less dependent on narrative.
Related reading
The AI Value Gap
Start with the core diagnosis: why visibility, accountability, and proof drift apart.
AI Economics Maturity Model
Use the maturity model to assess how far the organisation has progressed beyond fragmented AI activity.
AI Economics KPIs
Choose practical metrics for cost, quality, value, ownership, and portfolio governance.
AI TCO Framework
Pair the value layers with a full view of the cost stack behind AI capability.
FinOps & AI
Understand how usage transparency and unit economics become operating disciplines.
SPM & AI
Connect the higher layers to portfolio sequencing, funding, and benefits realisation.