Angus Knight Group

Data & AI The Data Playbook

Strategy, governance, architecture, quality, AI and adoption - built for Angus Knight Group.

Strategy Governance Architecture Quality AI & Analytics People Measurement

Empowering you to write your story. akguk.co.uk

How to use the AKG Data & AI Playbook

Eight parts, one thread. Built specifically for Angus Knight Group. Each part builds on the last. You can read it cover to cover or jump to the part that matches your current problem. The only rule: if something in your chosen part feels abstract, the answer is usually in an earlier one.

Part 1

Strategy & Foundations

Why data programmes exist, what principles hold them together, and how the operating framework makes ownership visible.

Part 2

Governance & Accountability

The eight-step sequence, the three roles that get confused, the shape of the team, and the 24-month roadmap.

Part 3

Architecture & Engineering

The layered platform model, stack vs platform, the governed data stack, and how to choose between mesh, fabric and lakehouse.

Part 4

Quality, Trust & Observability

The five pillars, the quality framework, the observability cycle, and data contracts that make change safe.

Part 5

Metadata, Privacy & Lifecycle

The data lifecycle, the metadata stack, privacy classification, and GDPR's seven principles.

Part 6

AI & Analytics Enablement

The AI maturity ladder, governance landscape, model lifecycle, and the BI journey from raw data to decision.

Part 7

People, Culture & Adoption

Data literacy, culture, the two mindsets for building a team, and how to communicate findings to different audiences.

Part 8

Measurement & Maturity

The maturity pyramid, metrics by level, the KPI hierarchy, and how to prove return on investment credibly.

Where to start if you only have 20 minutes

New to the programme

Start with the case for change

Interlude - The Decision Test
1.1 - Why data matters
1.3 - Strategy on one page
8.1 - The maturity pyramid

Setting up governance

Start with ownership

2.1 - Eight-step implementation
2.2 - Three roles
2.3 - Owners and stewards
2.5 - 24-month roadmap

Preparing for AI

Start with the foundation

6.1 - AI maturity ladder
4.1 - Five pillars of quality
4.4 - Data contracts
6.3 - Model lifecycle

Interlude

The Decision Test

You've sat in the meeting where someone asks for more data and no one asks what for. This is the question that ends it.

You've sat in the meeting where someone asks for more data and no one asks what for. This is the question that ends it - and why it's so hard to ask.

Someone senior leans back and says, I think we need to be capturing a lot more data here. Heads nod. It sounds responsible - prudent, even. And the thing you're not supposed to say starts to form, and you swallow it, because saying it out loud would make you the person who's against data: more data for what, exactly?

Nobody asks. So the field gets added, the pipeline gets built, the table gets wider - and eighteen months later it's still there, unqueried and unowned, quietly aging into a compliance problem nobody volunteers to touch. It was never used for anything, because there was never anything it was for. It just felt safer to collect it than to be the one who said no.

That's the whole problem, and it has a cheap fix: one question, asked before anything gets built. Not how much can we get? but what would we do differently if we had it? If someone can answer - we'd intervene with that cohort earlier, we'd reassign that caseload - then go, build it, it's worth every penny. And if the room goes quiet, you've just saved yourself eighteen months and a compliance problem. The quiet is the answer.

It's the same question everywhere, only asked at different moments. A dashboard nobody acts on, a metric that has never once changed what someone did, a field you're keeping because deleting it feels riskier than holding it - they're all the same thing: stuff you pay to keep that isn't earning its place. Ask the question early, before you collect, and you never pay the cost. Ask it late, of what you've already got, and at least you find out how much of it to switch off.

How the filter works

The question only does anything if you actually ask it - out loud, every time, before the building starts. That's all the diagram is: one gate, and the two things that can come out the other side.

Asking it without being a pain about it:

Ask before you build, not after. Once the pipeline exists, the effort already spent argues for keeping it. The only cheap moment to say no is before anything's been made.

A good answer is a specific thing someone would do. We'd reprice these accounts. We'd reroute this stock. If they can tell you the action, you're finished - build it, and gladly.

"It might be useful one day" is a no. So is "good to have." Those aren't decisions, they're hopes - and the data sits there costing money against a decision that never arrives.

Notice who has to do the answering. The person asking names what they'd do. You stop having to argue why you don't need something - which is an argument you can never really win.

The pitfallSomeone asks for "a fuller picture of the customer," so forty fields go onto the table. A year later you check: nineteen of them have never been queried, not once. But they're in the backups now, in the access reviews, in the privacy assessment - and pulling them back out is its own project nobody will fund.

The practiceMake them name what they'd do with the field before it goes on, not after. The question costs you one slightly awkward moment in a meeting. The field costs you every quarter, for as long as you keep it.

If nobody can say what they'd do with it, you're not collecting an asset. You're volunteering for a cost.

· The "more data" request

It comes back every quarter, because asking for more data always sounds responsible and saying no never does. The question doesn't make you the difficult one - that's the point of it. It just moves the awkward moment onto the person who should have been able to say what it was for.

How the conversation actually goes:

Ask what they'd do, not what they want. What would change if you had it? is a different question from do you want it? - and only one of them has a useful answer.

Let it go quiet. The honest answer is often a pause, and the pause is the result: the decision was never there. You don't have to rescue the silence. Let them sit in it.

When the answer's good, stop testing and build. We'd stop chasing the accounts that always churn. Done. The test is a gate, not a toll booth - once they're through it, you're on their side.

Sometimes they don't want data, they want cover. They could make the call today, but they'd like a number to point at if it goes wrong. Worth saying so, kindly - the data isn't the thing standing in the way, the decision is.

Write the question down as the rule. Then saying no isn't you being awkward, it's just the process - which is the only version of it that survives you having a bad week.

The pitfall"Can we capture more of the customer journey?" comes up at every planning meeting. Nobody can say for which decision, but nobody wants to be the one blocking "more data," so it goes on the roadmap again. Three roadmaps later the tracking has doubled and not one decision has changed.

The practiceNext time it lands, ask what they'd do differently with it. If the room goes quiet, that isn't you being obstructive - it's the room answering the question. Spend the quarter on something that can name a decision.

Nobody really wants more data. They want to feel sure. Find the decision and you usually find they needed far less than they asked for.

Everything after this - governance, architecture, quality, all of it - is machinery for making decisions you can stand behind. Worth being clear on what you're deciding before you go and build the machine. Part 2 starts with who gets to decide.

Part 1 of 8

Strategy & Foundations

Before any tool is chosen or any pipeline is built, a data programme needs a reason to exist, a few principles it will live by, a strategy that fits on one page, and an operating model that makes ownership visible. This part covers all four, in that order.

1.1 · Why data matters: the business case

Most data programmes are sold on tools. A new warehouse. A streaming engine. A catalog. A purchase order goes through, dashboards multiply, and a year later leadership asks the uncomfortable question: where is the value?

The honest answer is almost always that the buyer skipped what sits beneath the waterline. Tools are the visible tip of the iceberg. Strategy is the mass underneath: the problem worth solving, the value proposition, the people who will do the work, the systems that will support them, the cost it takes, and the outcomes that justify it.

A useful test before any procurement decision: can you describe the nine elements below the waterline for the use case in front of you? If you cannot, the tool is not the right next step.

The pitfallThe data team built a sophisticated platform nobody asked for. When the programme director asked "what did this get us?", the only answer was a list of pipelines and dashboards nobody could name.

The practiceLead with the business problem the data solves. Tie every layer of investment to a visible outcome someone in the business actually cares about.

Tools live above the waterline. Strategy lives beneath it. If you cannot name the problem, the people, and the outcome, the tool will not save you.

1.2 · Principles: what 'good' looks like

Principles are how a programme stays coherent when the people running it change. They are tool-agnostic. They give engineers, analysts, and leaders a shared answer to "why are we doing it this way?" The ten below are adapted from IBM's modern data architecture work and are the most useful we have seen in practice.

How to use these principles:

Print them. Put them on the wall of the room where data decisions get made.

Reference them by number in design reviews. "This proposal conflicts with #3 and #7" is a faster conversation than re-arguing first principles every time.

Revisit them annually. Two or three will need refining as the programme matures; the rest will hold.

The pitfallA company adopted ten data principles from a consulting deck. They were printed, framed, and ignored, because nobody could say what any of them changed about a Tuesday.

The practiceAdopt only the principles you will actually enforce. A principle that doesn't change a decision fails the Decision Test, cut it.

AKG in practice The five principles for data at AKG

Single version of the truth

One agreed definition wins. Local workarounds and report-level overrides are not permitted.

Definitions before metrics

Agree what a thing means before you start counting it. Metrics built on undefined terms produce noise, not insight.

Governance before tooling

A new platform does not fix an ownership problem. Get the decision rights right first.

Business meaning over technical convenience

Data structures should reflect what the business means, not what is easiest to implement in Snowflake.

Change is controlled, not improvised

All changes to definitions, metrics or data models go through HALO. No undocumented logic, no quiet fixes.

1.3 · Your data strategy on one page

Long strategy documents tend to die in shared drives. A one-page version survives, and more importantly, it can be reviewed. The seven steps below sequence the work in the order that compounds.

A few notes on the sequence:

Steps 1 - 3 are diagnostic. Skipping them is the single most common reason data programmes lose momentum in year two.

Step 4 is where the first real value lands. Pick a use case where the decision is obvious and the data is roughly available, not the hardest problem in the company.

Step 5 (govern) earns its place after step 4. Governing data nobody uses yet is theatre.

Steps 6 - 7 are how a one-off win becomes a programme.

The pitfallThe data strategy was a 40-slide deck. Six people had read it, three disagreed about what it meant, and the team kept building whatever felt most urgent that week.

The practiceGet the strategy onto one page anyone can hold in their head. If it doesn't fit on a page, it won't fit into a decision.

Goal first, tools later. Start tiny. Share results in plain language. Repeat what works; drop what doesn't.

Visual The seven-step strategy sequence

Define the problem worth solving

What decision will be made differently? Who makes it? What does good look like?

Diagnostic

Audit current data and capability

What do you have? What is trusted? Where are the gaps between now and the decision you need to make?

Diagnostic

Identify the highest-value use case

Pick the problem where the decision is obvious and the data is roughly available. Not the hardest problem.

Diagnostic

Deliver a visible win

Ship something real. A working dashboard, a trusted metric, a decision made differently. This earns the mandate.

Deliver

Govern what you built

Now add ownership, definitions, quality checks. Governing data nobody uses yet is theatre. Earn it first.

Deliver

Scale what worked

Repeat the pattern across more domains. Use the first win as proof of the model, not as a one-off.

Scale

Measure and improve

Track maturity, report on decisions affected, prove ROI. The programme that cannot measure itself cannot improve itself.

Scale

1.4 · The operating framework

Strategy says what to do. The operating framework says who decides, who does, and who is accountable when things go wrong. The picture below is the cleanest single view we have of how the layers fit together.

Reading the rings, from the centre outward:

Data Foundation; the raw material. Source systems, master data, transactions, events. Most companies inherit this layer; few control it as well as they think.

Data Management, what turns raw data into something usable. Quality, metadata, lineage, cataloging. This is where governance does its actual day-to-day work.

Decision Authority, who has the right to define a metric, change a definition, or override a control. This is the layer most often missing and most often the cause of conflict.

Analytics & AI; the visible output: dashboards, advanced analytics, AI models, RAG applications. None of it is reliable without the layers below.

The four corner boxes are enablers, not afterthoughts:

Technical Enablement; the platforms and access models that make the rings physically possible.

Business Outcomes, what the programme is being held to. Trusted metrics, faster decisions, safe AI adoption.

Operating Model; the human structure: roles, responsibilities, governance cadence, stewardship.

Risk & Control, regulatory, audit, ethics, bias. Increasingly non-optional as AI scales.

The pitfallStrategy said "become data-driven." But nobody owned it, no cadence reviewed it, and no budget backed it. A year later it was the same slide with the date changed.

The practiceWire the strategy into an operating model, named owners, a review cadence, and funding. A strategy without an operating model is a wish.

If ownership is unclear, nothing above it scales. Governance is decision clarity, not documentation.

One short idea before the machinery starts. It's the one the rest of the book quietly leans on, so it's worth the page; the interlude that follows, The Decision Test.

End of Part 1 sample

This is the first of eight parts. The remaining parts will cover: Governance & Accountability; Architecture & Engineering; Quality, Trust & Observability; Metadata, Privacy & Lifecycle; AI & Analytics Enablement; People, Culture & Adoption; and Measurement & Maturity. An appendix with reference grids, checklists, and source attributions will follow.

Part 2 of 8

Governance & Accountability

Governance is what turns a strategy into something that survives change of personnel, change of tools, and change of mood. Done well it is invisible, decisions happen quickly because everyone knows who decides. Done badly it becomes a folder of policies nobody reads, and an inbox of approval requests nobody acts on. This part covers the five things to get right: the implementation sequence, the three roles that get confused, the people who actually do the work, the shape of the team, and the multi-year roadmap.

2.1 · An eight-step implementation playbook

Governance programmes fail in predictable ways. The most common failure is starting with policy instead of with the business case. The second most common is starting with the framework instead of with the sponsor. The eight steps below sequence the work so that each one earns the right to the next.

Three notes on how to run this sequence:

Steps 1 and 2 are sales, first to the wider business, then to a single named executive. If you cannot complete step 2, do not proceed to step 3. Without a sponsor, the programme will be killed by the first conflict.

Step 4, benchmarking maturity, is also a politically useful artefact. It gives leadership a tangible "where we are now" and "where we want to be in 18 months" without anyone losing face. We return to maturity in Part 8.

Step 7 is where most programmes go wrong. Guardrails should empower decisions, not block them. If your governance documents are a list of things you can't do, you'll be ignored within a quarter. Write what you can do, and what the path is to do something new.

The pitfallA team spent six months writing a beautiful governance policy. It lives on SharePoint. Nobody has opened it since the launch email, and the same quality issues keep recurring, because no policy ever changed what anyone actually does.

The practiceStart with one painful, visible problem. Fix it with a lightweight rule that changes a real decision. Point to that win when you ask for the mandate to do more.

Govern at the level of decisions, not at the level of documents. If a policy doesn't change a decision, it doesn't exist.

2.2 · Three roles that keep getting confused

Most governance friction comes from a single confusion. The words governance, ownership, and stewardship are used as if they were synonyms, and the result is owners who don't have authority, stewards who are expected to decide, and governance that has no accountability. The fix is a shared definition of which is which.

The mental model that works in practice: a steward catches a quality issue this morning. They follow up with the affected consumers and document what happened. If the issue requires a policy change, a new threshold, a different SLA; they take it to the owner of that domain. If the change affects more than one domain, the owner takes it to the governance forum. Governance writes the new rule. The owner accepts it. The steward enforces it.

The signs that the roles are confused in your organisation:

Stewards being held responsible for decisions they have no authority to make.

Owners signing off on decisions but not knowing who actually executes.

Governance meetings that last 90 minutes and produce no decision.

Three different definitions of "employment outcome" across three teams in the same programme.

The pitfallA steward gets blamed in a steering meeting for a wrong number. But she was never given authority to change the definition; that sits with an owner who's been in back-to-back meetings for three weeks. Nothing gets fixed, and everyone quietly concludes the data team has failed again.

The practiceSeparate the three roles explicitly. The steward maintains and escalates, the owner decides, governance sets the rule. When a number is wrong, everyone already knows whose call it is to fix it.

Governance = who decides. Ownership = who is accountable. Stewardship = who executes. Mix them up and you get endless meetings.

2.3 · Owners, custodians, and stewards in practice

Once governance, ownership, and stewardship are separate concepts, the next step is naming the practical roles inside ownership. The three that matter most are the data owner, the data custodian, and the data steward. All three are required. They have different decision rights, different backgrounds, and different failure modes.

How to use this in conversation:

When a quality issue lands, ask: is this a steward problem (execution), a custodian problem (technical reliability), or an owner problem (decision-making)? The answer routes the work.

When hiring or assigning, pay attention to the typical background row. Putting an IT background into a data owner role usually fails; the role needs business authority, not technical depth.

The failure-impact row is also useful for risk reporting. It maps directly to the kind of business consequence each role's failure produces, which makes risk conversations more concrete than the usual "data quality is bad."

The pitfallAn IT manager was made "data owner" for the participant domain because he ran the database. Six months on, operations and commissioning are still arguing about what "employment outcome" means, because the owner has all the technical authority and none of the business authority to settle it.

The practicePut a business leader in the owner seat and IT in the custodian seat. The owner decides what the data means; the custodian keeps it safe and available. Match the role to the kind of authority it actually needs.

Owner decides. Custodian protects. Steward maintains. Name a specific human to each, for each critical dataset. Without a named human, the role is theoretical.

AKG in practice AKG's three-layer data model

AKG separates data into three distinct layers. Each has a different purpose, different ownership, and different decision rights. They must not be collapsed - if the physical layer starts redefining business concepts, meaning drifts towards what is easiest to measure rather than what the business actually needs to know.

Conceptual

Business meaning and shared language. Defines entities (Participant, Employer) and events (Employment Outcome, Disengagement). Exists independent of contracts, metrics and systems. Changes infrequently and deliberately.

Owned by: The Business

Stewarded by: Director of BI

Dimensional

Measurement, metrics and reporting logic. Defines facts, dimensions, valid combinations and standard calculations. Normative - it defines which combinations are approved, not just possible.

Owned by: BI Team

Stewarded by: Head of BI & Analytics

Physical

Technical implementation in Snowflake. Tables, columns, joins, storage and movement. Can change without changing business meaning - but must not redefine concepts or metrics independently.

Owned by: BI & IT

Stewarded by: Head of MI & Infrastructure

Think of it as a cake: the conceptual layer is the list of cakes you want to bake; the dimensional layer is the recipe; the physical layer is the kitchen. You have to know what cakes you want before you write recipes or mess about in the kitchen.

AKG in practice The dimensional layer in practice: STAR schema

At AKG, the dimensional layer is implemented as a STAR schema in Snowflake. A STAR schema has one central fact table surrounded by dimension tables. The fact table holds the numbers you measure (outcomes, starts, earnings). The dimension tables hold the context you filter by (programme, employer, participant, time). This structure keeps reporting fast, readable, and consistent.

The STAR schema is not a technical convenience. It is the direct expression of the dimensional layer: facts are the things we measure, dimensions are the context of the things we measure. The schema enforces this distinction physically.

2.4 · The shape of the data team

Two analysts walk into the same meeting with different numbers for "sustained employment outcomes." Both are sure. Both have the query to prove it. The meeting burns forty minutes, not because either is wrong, but because there is no answer to the only question that matters: whose definition wins? That is not a data-quality problem. It is a question about the shape of the data team. At AKG, where the same participant can appear in PICS, AKG360, and a commissioner return on the same day, this question has a real cost.

There are four common shapes, sitting on a spectrum from one central team that does everything to every business unit running its own, shown below. None is the right answer. Each trades the same two things against each other: speed inside the domains, and consistency across them. A central team gives you one version of the truth and a queue to get anything done. Fully decentralised gives every team what it wants, fast, and three definitions of "sustained outcome" by Christmas. Hub-and-spoke and federated are the attempts to buy some of both.

But the shape is the second question. The first is Wernfeldt's, and it is the one to write on the wall: the model matters less than knowing who decides when people disagree. You can run any of these shapes well, and you can run any of them into the ground; the difference is whether, when two teams collide over a definition, someone can say whose call it is without looking it up. Centralised answers "the centre, always." Federated, done well, answers "the centre breaks the tie; the domains propose." Decentralised, too often, answers nothing at all, and nothing is the answer that quietly costs you a clean set of numbers.

Choosing a shape you can actually run:

Start more central than feels comfortable. Hewing's rule holds, begin centralised, then loosen. It is far easier to hand autonomy out than to claw it back once five teams own five pipelines.

Let the symptom tell you which way to move. Domains idling in a central queue means you have outgrown centralised. The same metric meaning three different things means you have drifted too far the other way.

Name the tie-breaker before you need one. The cheap moment is now, in the calm; the expensive moment is mid-argument with two VPs watching. Write down who settles a cross-domain clash, a named person or a standing forum, not "we'll figure it out."

Don't reorganise your way out of a quality problem. Moving the boxes on the org chart does not fix data nobody trusts; it just changes who gets blamed. Part 3 makes the same point about data mesh.

The pitfallA company went "data mesh" and pushed ownership out to every domain. Eighteen months later there were four definitions of a sustained outcome, no one had the authority to choose between them, and every commissioner return arrived with a footnote. The shape had changed; the question of who decides had never been asked.

The practicePick whichever shape fits how your teams actually work, then name, in writing, who breaks a tie when two domains disagree. The org chart is the easy half. The tie-breaker is the half that turns it into a model.

THE DECISION TEST Team shape is the Decision Test pointed at the org chart. A structure earns its place only if it makes one decision clearer, who settles a tie. If it cannot answer that, you have drawn boxes, not built a model.

There is no correct shape. There is only whether, when two teams disagree, someone can say whose call it is, quickly, and without anyone having to look it up.

AKG in practice The BI team RACI - who decides what

The table below shows how decision rights are split across the three senior BI roles at AKG. R = Responsible (does the work), A = Accountable (owns the outcome), C = Consulted, I = Informed. Where a cell shows A/R, one person is both accountable and doing the work.

Activity	DBI	HoMI	HoBI
Strategy & Governance
Data, MI and insight strategy	A/R	C	C
Physical data layer and infrastructure governance	C	A/R	C
Logical data layer and data product governance	C	C	A/R
Business definitions and KPIs	A/R	C	C
Reporting & Insights Delivery
Snowflake architecture and data models	I	A/R	C
Source system ingestion (PICS, AKG360)	I	A/R	C
Management information (standard MI)	I	C	A/R
Power BI platform, dashboards and reporting	C	I	A/R
Insights demand management and prioritisation	A/R	C	C
Data Quality & Incidents
Quality controls in Snowflake	I	A/R	C
Quality issues at reporting and insight level	I	C	A/R
Incident management (Snowflake layer)	I	A/R	C
Incident management (post-Snowflake)	I	C	A/R

DBI = Director of Business Intelligence (Ben Edgington). HoMI = Head of MI and Data Infrastructure (Alan Beaton). HoBI = Head of BI and Data Analytics (Kevin Colborne).

2.5 · A 24-month governance roadmap

Eight steps tell you what to do in what order. A roadmap tells you how long it should take, and what "good" looks like at each milestone. The phases below are drawn from regulated industry practice, financial services in particular, but the shape applies more broadly. Two-year programmes are typical; faster is possible but rare; slower usually means the sponsor has lost interest.

What each phase optimises for:

Phase 1 (0 - 6 months) optimises for clarity. Who owns what, what's risky, what the operating model looks like. The deliverables are mostly documents, but the substance is naming people.

Phase 2 (6 - 12 months) optimises for embedded controls. The goal is moving governance from a folder of policies to lineage, contracts, and quality checks that live in the platform.

Phase 3 (12 - 18 months) optimises for continuous assurance. The shift is from "we reviewed it last quarter" to "we know now whether it's working."

Phase 4 (18 - 24 months) optimises for invisibility. Machine-readable policies, embedded guardrails, executive dashboards. At this point governance has become part of the architecture rather than a function bolted on top.

The pitfallLeadership was promised "enterprise data governance" in six months. At month five, a tool bought, a policy written, nothing actually embedded; the sponsor lost patience and quietly redirected the budget. The programme died not because it failed, but because it promised the wrong timeline.

The practicePhase it over 18 - 24 months with a visible win at each stage. Anchor ownership first, embed controls second, automate last. Show the sponsor progress they can see before asking them to keep believing.

Governance maturity is a 24-month story. Programmes that promise it in 6 are selling you a document. Programmes that take 5 years have lost their sponsor.

End of Part 2

Next: Part 3, Architecture & Engineering. The platform that has to carry the governance model defined here, and how the choice of stack, mesh, fabric, or lakehouse follows from the bottleneck rather than the brand.

Visual The 24-month governance roadmap

Phase 1

Months 0 - 6

Optimises for: Clarity

Business case and sponsor secured

Data domains identified

Owners named per domain

Maturity baseline established

Operating model documented

First governance forum running

Phase 2

Months 6 - 12

Optimises for: Embedded controls

Data contracts on critical datasets

Lineage mapped for key pipelines

Quality checks in platform

Data dictionary live

Classification applied at ingestion

First quality KPIs reported

Phase 3

Months 12 - 18

Optimises for: Continuous assurance

Observability cycle operational

SLAs tracked automatically

Incidents trigger defined workflow

Self-service expanding

Maturity reassessed vs baseline

ROI portfolio started

Phase 4

Months 18 - 24

Optimises for: Invisibility

Governance embedded in architecture

Policy checks automated in CI

Executive dashboard running

AI use cases on trusted data

Level 3-4 maturity demonstrated

Programme self-sustaining

Part 3 of 8

Architecture & Engineering

Architecture is where strategy meets reality. The right architecture is not the one with the most components or the trendiest names; it is the one that matches the bottleneck you actually have. This part covers the four architectural decisions that matter most: the layered platform model, the difference between a stack and a platform, the governed view of what sits between source data and the board deck, and how to choose between mesh, fabric, and lakehouse when the question is which paradigm to invest in.

3.1 · The layered model of a modern data platform

Almost every modern data platform, regardless of vendor, cloud, or scale, has the same seven layers. The names vary; the function does not. Understanding the layers is useful because it gives a vocabulary for diagnosing problems ("the issue is in curation, not in storage") and because it makes clear which layers are optional and which are not.

Four things worth noticing in the model:

The top layer (Experience & Consumption) is where the business sees value. Every layer below it exists to serve that one. If a programme cannot draw a line from a layer-7 investment to layer-1 outcomes, the investment is wrong-sized.

Layer 7 (Source Systems) is usually inherited. You don't control it, but you inherit all its problems. A meaningful share of data quality issues are actually source system issues. Naming this honestly saves arguments.

Layers 4 (Processing & Orchestration) is where most modernisation projects get stuck. The pipelines work in dev, the schedules drift, the dependencies are unclear, error handling becomes an inbox of alerts nobody acts on.

The cross-cutting capabilities on the right, governance, metadata, DataOps, are not add-ons. Treating them as Phase Two work is the single most reliable way to produce an unusable platform.

The pitfallThe team poured a year into a slick consumption layer, dashboards, a semantic model, the lot, on top of ingestion that silently dropped rows. The dashboards were beautiful, and wrong.

The practiceBuild the lower layers before the visible ones. A flawless consumption layer sitting on broken ingestion is just a confident way to be wrong.

Every modern data platform has these layers. The difference between a good one and a swamp is whether someone owns each layer, or just hopes it works.

3.2 · Stack vs platform

Two companies can buy exactly the same tools and end up in very different places. One has a reliable data platform people trust. The other has a stack, a chain of tools held together with glue and weekend work. The difference is not the tools. It is what was designed around them.

A modern data stack is a flexible, fast way to get started. Pick a tool for ingestion, one for storage, one for transformation, one for analytics. Each tool does one job well. New teams can be productive within weeks. This is genuinely valuable; most companies should start as a stack.

What changes is what the stack becomes after a year. Once there are 30 pipelines, 200 tables, and 50 consumers, the gaps that didn't matter at the start start to matter a lot. Who owns the customer table? Why is the dashboard wrong? How much is this costing us? A stack does not answer those questions. A platform does.

The transition from stack to platform is usually triggered by one of these:

A senior leader stops trusting a dashboard because they spotted a discrepancy.

Cloud spend grows faster than business value and someone in finance starts asking questions.

A compliance event forces a question of who has access to what, and the answer is unclear.

A new analyst joins and spends their first week asking which table holds the right participant record, what worked for a team of 5 does not scale to 15.

The pitfallThey bought best-in-class tools for every job. Eighteen months later, three engineers spent most of their week as human glue, nobody could say which numbers were trusted, and the cloud bill arrived as a surprise every month.

The practiceDesign how the tools run together, ownership, contracts, lineage, cost visibility, not just which tools you own. A stack becomes a platform when you engineer the seams.

A stack is what you build with. A platform is what makes it reliable at scale. Don't just buy tools, design how they run.

AKG in practice Why STAR schema over the alternatives

AKG uses a STAR schema in Snowflake rather than Data Vault, wide tables, or a pure normalised model. The choice reflects the primary use case: consistent, fast reporting to commissioners and leadership, not exploratory data science or raw audit trails.

3.3 · From source system to board deck

The seven-layer platform model is engineering-focused. The view that follows is the same system seen through a governance lens, six layers between the raw data leadership inherits and the numbers leadership puts in board decks. Each layer is a place where ownership can be clear or unclear. Most organisations stop being explicit about ownership at the third layer up. They then wonder why the numbers at the top don't reconcile.

The diagnostic question that comes out of this view is simple: can you trace any number at the top all the way back to its source? Not in theory, in practice, within the next thirty minutes, with documented links, named owners, and a quality check at each transition.

Reading the stack from the inside out:

Source Systems exist whether you admit it or not. The ERP doesn't care about your data programme.

Lineage & Transparency is where many organisations stop and think they're governed. They have a lineage tool, therefore they have lineage. The tool is necessary; on its own it is not sufficient.

Data Quality is what makes change safe. Without it, every schema change in the source becomes a fire drill downstream. With it, broken contracts trigger alerts before broken dashboards.

Definitions & Ownership is the governance layer. Business definitions, metric logic, named owners, escalation paths. It is an operating model, not a policy folder.

Trust at Scale is what leadership thinks they're paying for when they buy a platform. They're not wrong to want it; they're wrong to think it can exist without the layers underneath.

AI, Analytics & Automation is only possible when every layer below exists. Putting an LLM on top of layer-1 data is a known way to fail expensively.

The pitfallAsked to trace a commissioner-return number back to its source, the team needed four days and three Slack threads. The number had been wrong for two quarters, and nobody could prove where it broke.

The practiceMake every number traceable from board deck to source, named owners and a quality check at each layer. If you can't trace it, you can't defend it.

Every company has this stack. The difference is whether someone owns each layer, or just hopes it works.

3.4 · Mesh, fabric, or lakehouse, pick by bottleneck

Three architectural paradigms get talked about as if they were competitors. They are not. Each answers a different question, and the right one for an organisation depends on which question is actually being asked. The framing that works: start with the bottleneck, not with the brand.

Each paradigm changes a different dimension. Data Mesh is an organisational shift; it changes who is accountable. Data Fabric is an architectural shift; it changes how you find and access data without moving it. Data Lakehouse is a technological shift; it changes the underlying storage model. They can coexist. A company can run a lakehouse, expose it through a fabric, and govern it as a mesh. What matters is which constraint you are actually trying to relieve.

How to decide which is the right next bet:

If the central data team is the bottleneck, work piles up, domains can't move; the answer is organisational. Mesh.

If integration and discovery are the bottleneck, data is everywhere but hard to find or join; the answer is architectural. Fabric.

If storage cost, duplication, or the BI-vs-ML split is the bottleneck, multiple copies, slow analytics on lake data, expensive warehouse; the answer is technological. Lakehouse.

If you cannot name the bottleneck specifically, the answer is not any of them. The answer is to fix governance and quality first. None of these paradigms will save a programme that hasn't done the work in Parts 1 and 2.

The pitfallLeadership read an article and mandated a data mesh. Eighteen months on, the central bottleneck was untouched; they had reorganised the org chart but never fixed the actual constraint, which was data quality.

The practiceName the real bottleneck first, then pick the paradigm that relieves it. Mesh, fabric, and lakehouse each fix a different problem, buying the wrong one is expensive.

If your strategy starts with "which tool," it's already wrong. Start with which bottleneck. The paradigm follows.

THE DECISION TEST "Which paradigm?" is the same trap as "which tool?", a purchase hunting for a justification. Name the bottleneck a paradigm would actually relieve: the central queue, the discovery problem, the storage bill. If you can't name one, no architecture is the answer, and the real work is still back in Parts 1 and 2.

3.5 · The medallion architecture

A team stands up a warehouse, points every source at it, and lets analysts build straight on top of whatever lands. For a while it works. Then a source system renames a field, a board dashboard breaks, and nobody can say whether the number was ever right, because there was no layer between the raw feed and the report where anyone checked. The medallion architecture puts those layers back. It splits the journey from raw data to analytics into three deliberate stages, Bronze, Silver, and Gold, each with one job, so that quality is built in rather than hoped for.

The names are a convention, not a product you buy. Bronze is raw, Silver is refined, Gold is ready, and data moves one way through them, each layer improving on the one before. The whole discipline is in keeping the stages separate. The moment you clean data in Bronze, or let an analyst query raw rows from Gold, the layering stops protecting you and the old problem walks back in.

The medallion architecture: sources flow through Bronze, Silver and Gold to BI and ML

What each layer is for:

Bronze is the landing zone. It takes an exact, append-only copy of the source, schema quirks and all, and changes nothing. That is the point: because it is untouched, you can always reprocess from it and recover when something downstream breaks. The one rule here is the one most often broken. Do not clean or transform data in Bronze. The day you do, you lose the faithful copy you were keeping it for. This is the data engineer's floor.

Silver is where data becomes trustworthy. Records are deduplicated, formats standardised, invalid rows filtered out, and business logic and joins applied, so that a row in Silver means what it claims to mean. This is the layer programmes skip when they are in a hurry, building Gold straight on top of Bronze, and that skip is exactly what produces dashboards that are confidently wrong. Silver is where analytics engineers and data scientists do their work.

Gold is built for decision-making. It holds aggregated, business-level metrics in shapes optimised for querying, star schemas and the curated tables behind a dashboard or a model. It is the only layer most analysts and business users ever touch, and the only one where a number is ready to be acted on. Everything beneath it exists to make Gold worth trusting.

The pitfallAn analyst, short on time, pointed a board dashboard straight at the Bronze feed. It looked fine in the demo. Three weeks later the source renamed a column, the dashboard quietly showed missing values as zero, and a decision was made on a number nobody had validated.

The practiceLet each layer do its one job. Raw stays raw in Bronze, gets cleaned and validated in Silver, and is shaped for use in Gold. Never transform in Bronze, never skip Silver, never explore raw data straight from Gold.

The takeaways worth keeping:

Bronze is raw, Silver is clean, Gold is insight. Each name describes a job, not a tool.

Every layer has one clear purpose, and the value is in keeping them separate.

Transforming in Bronze, or skipping Silver, quietly reintroduces the problem the architecture removes.

A layered design scales: new sources land in Bronze without disturbing the Gold tables a board depends on.

THE DECISION TEST "Build a pipeline" is not a decision; it is plumbing. The medallion architecture earns its keep at exactly one point, the Gold layer, where a number is finally ready to change something. Bronze and Silver are cost until Gold pays them back. If you cannot name the decision a Gold table changes, you have built three careful layers to move data that nobody acts on.

End of Part 3

Next: Part 4, Quality, Trust & Observability. The architecture has to carry trustworthy data. What that means in practice: the five pillars, the modern quality framework, the observability cycle, and the data contracts that hold producers and consumers to each other.

Part 4 of 8

Quality, Trust & Observability

A data platform that is well-architected but full of unreliable data is worse than no platform at all; it gives leadership the confidence to act on numbers they should be questioning. This part covers what "good" looks like in practice: the five pillars every dataset has to hold up across, the framework that turns those pillars into a measurable system, the observability loop that catches and prevents incidents, and the data contracts that make change safe instead of scary.

4.1 · The five pillars of data quality

Data quality is one of those phrases that means everything and therefore nothing. The five pillars below are the shared vocabulary that makes the phrase actionable. Every dataset the business depends on must hold up across all five, and when it doesn't, the pillar tells you what kind of problem you have and what kind of check would have caught it.

How to use the pillars in a conversation:

When someone says "the data is bad," make them name a pillar. "The numbers don't match PICS" is an accuracy problem. "Half the rows are missing" is completeness. "The dashboard says 24 hours stale" is timeliness. Naming the pillar narrows the fix.

Treat the five as a set, not a ranking. A table can be perfectly accurate and useless because it's two weeks stale. A real-time table is useless if it conforms to a schema that nobody understands.

Each pillar has a typical technical check that catches its failure mode. Those checks are not optional; they are how a programme moves from manual eyeballing to automated trust.

The pitfall"The data is bad" had been the standing complaint for a year. Nobody could say which way it was bad, so every fix was a guess, and none of them stuck.

The practiceName the pillar. "The numbers don't match PICS" is accuracy; "half the rows are missing" is completeness. Naming the failure mode points straight at the fix.

All five pillars rest on the same foundation: trust. The day stakeholders stop trusting a dashboard, every layer underneath becomes wasted work.

4.2 · The modern data quality framework

The pillars tell you what to check. The framework that follows tells you how to organise the checking so it becomes a system. At the centre is data trust; the actual goal. Around it are four quadrants: what the business gets when trust is high, what is checked technically, what is measured as a programme, and what governance capabilities the system depends on.

Reading the four quadrants:

Business value is the outcome you are paying for. High adoption, confident decisions, reduced risk, trusted AI. These are the things leadership signed up for.

Technical audits are what your platform does automatically. Freshness checks, volume anomaly detection, schema drift, null rates. The cost of running them is trivial; the cost of not running them shows up in incidents.

Quality KPIs are how you measure your own programme. Data SLAs, time-to-detect, time-to-resolve, issue volume. Without these, you can't tell whether you're getting better.

Governance stack is what makes all the above possible: catalog, lineage, contracts, stewardship. Without these layers, technical audits are noise and KPIs are guesses.

The arrows in the framework all point inward because the relationship is causal, not incidental. Business value is produced by the other three working together. Removing any one collapses the model.

The pitfallQuality was "owned" by everyone and therefore no one. Issues were caught by whoever happened to notice, fixed in a panic, and forgotten, until the same thing broke again.

The practiceRun quality as a system: automated audits, a measured KPI set, and a governance stack underneath. Trust is produced by the machine, not by vigilance.

Data Pipelines × Governance → Reliability → Trust → Adoption → Value. Each arrow is earned, not assumed.

4.3 · The observability cycle

Every incident is data. Caught early, it is a learning event. Caught late, it is a credibility event. The observability cycle is the loop that turns incidents into permanent improvements instead of recurring fires.

The three steps and what "good" looks like at each:

Detect; the goal is shorter time-to-detect. "We found out when the CEO emailed" is the failure mode. Real-time alerts on freshness, volume, schema, and quality thresholds are the floor, not the ceiling.

Diagnose; the goal is shorter time-to-resolve (MTTR). Trace upstream through lineage, identify which job, source, or contract broke. Without lineage this step takes hours; with lineage it takes minutes.

Prevent; the goal is non-recurrence. After every incident, ask: what test, contract, or guardrail would have caught this earlier? Add it. The same incident happening twice is a process failure, not a data failure.

The metrics that matter, time-to-detect, time-to-resolve, incident recurrence, should appear in your quality KPI set. They are leading indicators of trust. If they're moving in the right direction, trust will follow. If they're flat or worsening, no amount of communication will fix the perception, because the perception is correct.

The pitfallThe team found out the pipeline had been broken for nine days when a programme director emailed asking why their commissioner return still showed last week's figures.

The practiceDetect with real-time alerts, diagnose through lineage, prevent with a test after every incident. A programme director emailing about stale data should never be your monitoring system.

The same incident twice is a process failure, not a data failure. Every fire should leave behind a test or a contract that prevents the next one.

THE DECISION TEST An alert no one acts on is a dashboard that changes no decision, wearing a different hat. Before you add a check, ask what you'd actually do the morning it fires. If the answer is "nothing," you haven't added safety; you've added noise, and noise is what trains people to ignore the alert that finally matters.

4.4 · Data contracts

A data contract is a written agreement between the team that produces a dataset and the teams that consume it. It looks like an API contract because it functions like one. The producer commits to a schema, an SLA, and a meaning. The consumer commits to depending only on what the contract guarantees. When either side wants to change the contract, that conversation happens before the change ships, not after a dashboard breaks.

What a working contract actually contains:

Schema, field names, types, nullability. Strict enough to fail loudly when a producer tries to ship a breaking change.

SLAs, freshness, latency, availability. "Updated by 6am, less than two hours behind source, 99.5% uptime."

Semantics, what each field means in business terms. The field is called customer_id; the contract says which system of record it points to and what "customer" means here.

Quality, allowed null rates, valid value ranges, referential rules. These are the technical audits from section 4.2 codified.

Ownership; the producer team, the on-call rota, the escalation path. Named humans.

Versioning, how breaking changes are signalled, how consumers are notified, how the deprecation window works.

Contracts work because they are enforced in CI/CD, not in PDF files. A schema drift breaks the build, not production. An SLA violation triggers an incident workflow, not a Slack argument. This is the difference between a contract that lives in the platform and a contract that lives in a folder.

The pitfallA source system change in PICS on a Friday meant that by Monday four dashboards were silently wrong, and the analytics team lost a week tracing which change broke what.

The practicePut a contract between producer and consumer, enforced in CI. A breaking change should fail the build, not the board report.

A contract is what turns "who broke the dashboard?" into "the producer's CI caught it before deployment." Enforce in code, not in conversation.

End of Part 4

Next: Part 5, Metadata, Privacy & Lifecycle. The catalog, classification, lineage, and lifecycle controls that turn a trustworthy platform into a governable one. Where the data came from, who is allowed to see it, and what happens to it when nobody needs it any more.

AKG in practice Data product categorisation - Certified, Indicative, Development

Every data product issued by AKG's BI team carries one of three labels. The label tells users how much assurance sits behind the numbers, and what decisions it is safe to use them for. Products that don't carry a label should be treated as Development until confirmed otherwise.

✓ Certified

Formally approved. Tested for accuracy and consistency. Safe for strategic decisions and contractual reporting.

Agreed calculation methods
Controlled, quality-assured sources
Peer-reviewed and formally signed off
Version controlled

~ Indicative

Initial checks done. Suitable for most operational decisions. Not for critical, contractual or external reporting.

May still be refined
Minor calculation adjustments possible
Interpret with appropriate judgement
Not the official source of truth yet

⚠ Development

Work in progress. For exploration and scoping only. Not suitable for any decision-making.

May use draft logic or incomplete data
Not fully tested or validated
May change without notice

Part 5 of 8

Metadata, Privacy & Lifecycle

Data without context is noise. Metadata is the context that makes data findable, understandable, and trustworthy. Privacy is the discipline that ensures context survives contact with regulation. Lifecycle is the recognition that data does not live forever, and that managing what happens at the ends is as important as managing what happens in the middle. This part covers the four anchors: the lifecycle every dataset moves through, the layered metadata that gives data meaning, the classification model that drives privacy controls, and the GDPR principles every privacy programme has to satisfy.

5.1 · The data lifecycle

Every dataset moves through six stages, whether you manage them deliberately or not. Where programmes typically focus their attention is on Create and Use, getting data in, then getting value out. The expensive failures happen at the stages that get less attention: Share (who got what), Archive (where did it go), and Delete (is it actually gone).

Three things worth being explicit about:

Classification happens at Create. Trying to classify retroactively is expensive and unreliable. Build classification into the ingestion process so every new dataset starts with a known tier.

Share is where most privacy incidents originate. Data leaves the boundary of one team and ends up somewhere it shouldn't be. Contracts (Part 4) and audit trails are the controls that matter here.

Delete is the stage where regulators ask hard questions. Right-to-erasure under GDPR is not a feature; it is a process that has to demonstrate what was deleted, when, and from how many systems. If you can't answer those questions in writing, you don't actually have a Delete stage.

Manage all six stages or hope the regulators don't ask about the three you ignored.

5.2 · The metadata stack

Metadata is often discussed as a single thing. It isn't. There are at least four kinds, each valuable for different reasons, and most data catalogs ship the foundational layer and call the job done. The value compounds upward, and the higher layers are what turn a catalog from a directory into a tool people actually use.

Reading the stack from the bottom up:

Technical metadata is the foundation, schemas, types, file formats, lineage edges. This is what most catalogs auto-ingest. Without it, nothing above it is possible.

Business metadata is what makes the data understandable to anyone outside the engineering team. Definitions, glossaries, metric logic, named owners. This is the layer that takes deliberate human effort and that delivers the most value when it's well-maintained.

Operational metadata is the evidence that the platform is alive. Pipeline runs, refresh times, job status, cost per query. Most teams have this data somewhere, surfacing it in the catalog is what turns it into a trust signal.

Social metadata is the most undervalued layer. Who endorsed this dataset? Who reported an issue last week? Who actually uses it? In practice this is the metadata consumers care most about, because it's the closest thing to a recommendation engine.

Most catalogs ship technical metadata and call it a day. The investment that matters is in the business and social layers, that's where adoption lives.

AKG in practice How the AKG Data Dictionary works

AKG maintains a Data Dictionary hosted in SharePoint. It is the authoritative source for all business definitions - it takes precedence over local practice, historic usage, or report-specific interpretations. Every term has a named owner. All changes go through HALO.

Request

Raised via HALO. States who, what, and why.

›

BI Review

Assesses risks to data integrity, impact on reports and cost of change.

›

Owner Approval

Named term owner reviews proposed definition and approves or rejects.

›

Roadmap

BI builds implementation plan covering upstream, models and downstream reports.

›

Publish

Definition added to SharePoint with effective date and next review date.

Conflicting interpretations must not be resolved through local workarounds or undocumented logic. Raise via HALO - the term owner decides.

5.3 · Privacy classification

Privacy controls cannot be uniform. Treating every dataset like it contains PII is expensive and wasteful; treating no datasets like they contain PII is reckless. The fix is classification: every dataset belongs to exactly one tier, and the tier drives the controls.

How classification actually works in practice:

Classify at point of creation. Retroactive classification of a large estate is a thankless multi-quarter project. New data gets tagged correctly from day one; legacy data gets cleaned up opportunistically.

Unclassified data defaults to the most restrictive tier. This protects against the default failure mode of "we hadn't decided yet, so we let everyone see it." At AKG, where participant records carry protected characteristics and contractual obligations, that is not a defensible position.

Classification is a property of the dataset, not the system. Restricted data in a public BI tool is still Restricted. Internal data in an external API is still Internal. The controls follow the tier across boundaries.

Re-classification needs a process. Data that was Internal last year may be Confidential this year because the business has changed. Build a review cadence in, annual minimum for Confidential and above.

Classification is the input to every other privacy control. Get it wrong and every downstream control fires the wrong way.

5.4 · GDPR's seven principles

The European Union's General Data Protection Regulation has set the global default for how personal data should be handled. Even organisations outside the EU find themselves answering to it, through customers, partners, regulators, or successor laws modelled on it. The seven principles below are the foundation. They are not optional, and they are not bureaucratic; they are a coherent design philosophy for how a programme treats people whose data it holds.

Why each principle matters in practice:

Lawfulness, fairness, transparency is the foundation. If a data subject would be surprised to learn you held this data, you have a problem.

Purpose limitation prevents scope creep. Data collected to fulfil an order should not silently become marketing fuel.

Data minimisation is the discipline of asking "do we actually need this field?" before adding it to a form. The cheapest field to protect is the one you didn't collect.

Accuracy is shared with Part 4's quality pillars. Wrong data about a person can cause real-world harm, credit decisions, healthcare, employment.

Storage limitation is the policy hook for the Delete stage in section 5.1. If you cannot delete, you cannot comply.

Integrity and confidentiality is where information security meets data governance. Encryption, access control, breach response.

Accountability is the meta-principle. Demonstrating compliance is itself a deliverable, documentation, audit trails, decisions on the record.

THE DECISION TEST Same question, one field smaller. Before you add a column to the form, ask what someone would do with it. If the answer is nothing, you've just spared yourself a thing to encrypt, retain, and one day explain to a regulator who asks why you were holding it.

Privacy is not about avoiding fines. It is about earning the right to keep operating with data that belongs, ultimately, to other people.

End of Part 5

Next: Part 6, AI & Analytics Enablement. The layer leadership is most excited about, and the one that depends most on everything in the first five parts being in place. The maturity ladder from chaos to AI-ready, the AI model lifecycle, BI and self-service, and the discipline of telling stories with data.

Part 6 of 8

AI & Analytics Enablement

This is the part of the playbook leadership is most excited about, and the part that depends most on everything before it being in place. Generative AI has produced more demand for trustworthy data in the last two years than the previous two decades combined. The companies that can supply it have an advantage that compounds. The companies that cannot are quietly being lapped. This part covers the four anchors: where an organisation sits on the AI maturity curve, the governance landscape every AI programme has to navigate, the lifecycle that turns a model into a product, and the BI journey that turns raw data into a decision worth making.

6.1 · The AI maturity ladder

Most organisations describe themselves as "AI-ready." Very few are. The maturity ladder below is a more honest diagnostic, five levels from firefighting chaos to genuine AI competitiveness, with a sharp threshold in the middle that very few cross.

What each level looks like in the wild:

Level 1, Chaos & Firefighting. Excel-driven analytics. Different teams report different numbers for the same metric. A senior analyst leaves and three months of institutional knowledge leaves with them.

Level 2, Defined Foundations. Standards are documented but enforcement is uneven. Owners are named on paper. Data quality fixes are ongoing, and ongoing, and ongoing.

Level 3, Governed & Trusted. This is the tipping point. Trusted data products exist. Metadata and lineage are live. Stewards are active. The business has stopped treating data as a problem to be managed and started treating it as a capability to be invested in.

Level 4, Scalable & Automated. Self-service analytics is the norm. Data contracts are enforced in CI. New use cases ship in weeks rather than quarters.

Level 5, Strategic & AI-Ready. LLMs and RAG run reliably on production data. Autonomous agents act on trusted information. The company has a measurable advantage that competitors find hard to copy.

The pitfallLeadership wanted an AI strategy, so a pilot was built on top of participant data three teams knew was incomplete. It confidently produced wrong outcome predictions, and the commissioner demo went badly.

The practiceBe honest about your maturity level. Fix trusted data first; the AI that runs on it will be the part that works. Level 5 ambitions on Level 1 data fail expensively.

Don't build Level 5 models on Level 1 data. Most failed AI programmes are not AI failures; they are foundation failures. Fix the foundation first.

AKG in practice Where AKG sits on the AI maturity ladder

AKG's current data stack places the organisation at the boundary between Level 2, Defined Foundations and the early stages of Level 3, Governed & Trusted.

The foundations are now visible. Snowflake provides the storage and transformation layer. Power BI provides the reporting and self service layer. HALO supports workflow, ticketing and operational control. The three layer data model described in Part 2 gives AKG a clearer route from raw data, to governed business logic, to trusted reporting outputs.

That means AKG is not starting from scratch. Ownership is being named. Standards are being defined. The direction is right.

The next step is embedding the controls that make those foundations trusted in practice, not just documented in principle. That means stronger stewardship, clearer data ownership, active quality monitoring, visible lineage, agreed definitions, and stronger controls over how data products are changed, tested and released.

The implication for AI is direct. Reliable AI outputs require Level 3 as a minimum. AI cannot compensate for weak definitions, inconsistent reporting logic, unclear ownership or poor quality source data. It will simply accelerate the consequences.

For AKG, the parts of this playbook that unlock AI readiness are not found in Part 6. They are found in Part 2, Data Architecture, Part 3, Data Governance, and Part 4, Data Quality. Part 6 is the reward for doing those parts well.

In simple terms, AKG has the ambition and the emerging technical foundations to move towards AI enabled analytics. The priority now is to cross the Level 3 threshold properly, by turning defined foundations into trusted, governed and reusable data products.

6.2 · The AI governance landscape

AI governance has stopped being optional. Three regimes, one regulatory, one voluntary framework, one certifiable management system, between them shape how serious AI work has to be governed today. They are not competitors. The EU AI Act says what is required by law. NIST AI RMF says how to organise the work. ISO 42001 provides a way to demonstrate it externally. Most programmes operating at any scale will need to satisfy all three.

Practical notes on each:

The EU AI Act applies extraterritorially. If you serve EU customers, or your model does; the risk classification applies. Unacceptable-risk uses are prohibited outright. High-risk uses come with substantial documentation, conformity assessment, and ongoing monitoring obligations.

NIST AI RMF is voluntary in the US, but its Govern - Map - Measure - Manage structure has become a de facto international vocabulary. Many organisations use it as their internal operating model regardless of jurisdiction.

ISO/IEC 42001 is the management-system standard, Plan-Do-Check-Act for AI. It is certifiable, which matters for customers, regulators, and procurement teams who want assurance beyond a self-attestation.

Other regimes worth knowing about: OECD AI Principles (high-level, influential), UNESCO AI Ethics Recommendation (rights-based), NIST AI 600-1 (generative AI specifically). Most of these reinforce rather than contradict the three above.

The pitfallA model went live scoring loan applications. Nobody had classified its risk, documented the training data, or checked for bias, until a regulator asked, and there were no answers.

The practiceClassify risk before building. Map the use case to the EU AI Act tier and the NIST functions early, so governance is designed in, not reconstructed under audit.

AI governance is not red tape. It is the operating model that lets a company deploy AI without exposing itself to consequences it has not thought through.

6.3 · The AI / model lifecycle

Models are not products you ship and forget. They are systems that drift, decay, and need active management. The lifecycle below applies as much to a fraud-detection model as to a RAG-powered customer service bot. The stages are not optional, what changes between use cases is the rigour applied at each, which is in turn determined by the risk classification set at the Design stage.

What can go wrong if a stage is skipped or rushed:

Skipping Design and the risk classification is wrong from the start. You spend validation effort on the wrong things, and you discover at deployment that the use case was actually High Risk under the EU AI Act.

Rushing Collect leads to bias problems that surface in production. "We trained on the data we had" is the most expensive sentence in machine learning.

Cutting corners on Validate is how models with strong test-set accuracy fail in deployment. Fairness, robustness, and explainability tests are not optional; they are how you discover the problems your accuracy metric doesn't show.

Skipping Monitor is the most common failure. A model performing well on day one is not the same as a model performing well on day 90. Drift is real, and unmonitored drift is the source of most quiet AI failures.

The pitfallA fraud model launched with 95% test accuracy and was never looked at again. A year later it was quietly missing half the fraud, because the patterns had shifted and nobody was watching.

The practiceTreat the model as a system with a full lifecycle. Monitor for drift, retrain on a cadence, validate every redeploy. Shipping is the start, not the finish.

"Just put GPT on it" is not a lifecycle. Treat every model, including LLM-based applications, as a system that needs design, validation, deployment, and monitoring.

6.4 · The BI journey: raw to actionable

Not every analytics workload is AI. Most of the value most organisations get from their data comes from older, less glamorous disciplines, descriptive analytics, dashboards, reporting, and the slow work of helping people make better decisions. The journey below is the path data takes from arrival to action. Each stage is a discipline of its own, and the difference between a high-functioning BI capability and a low-functioning one is whether all six stages are taken seriously.

Where most BI programmes drop the ball:

Stages 1 - 4 are the technical journey. Most BI teams handle these well. The data lands, gets sorted, gets arranged into a model, gets shown on a dashboard.

Stage 5, Story, is where most programmes underinvest. A dashboard without a narrative is a wall of numbers. The analyst's job is not just to show the data but to explain what it means, what changed, and what action follows. Data storytelling is a real skill, and it is teachable.

Stage 6, Actionable, is the measure of whether any of this matters. If the dashboard does not change a decision, the dashboard is decoration. The most useful question a BI team can ask its consumers: "what would you do differently if this number changed by 20%?" If the answer is "nothing," the report is on its way to being retired.

Self-service does not skip these stages; it distributes them. The platform supplies stages 1 - 4 as infrastructure; the consumer supplies stages 5 - 6 themselves. This works only if the consumer has the literacy to use the platform well. That is the subject of Part 7.

The pitfallThe team shipped a 30-tab performance dashboard and called it self-service. Nobody used it; there was no story and no obvious action, just a wall of numbers programme managers quietly went back to ignoring while asking the BI team for the same weekly Excel extract.

The practiceCarry the data all the way to a decision: a clear story and an obvious next action. If a dashboard doesn't change what someone does, it's decoration.

If a dashboard does not change a decision, the dashboard is decoration. Measure your BI programme by decisions affected, not reports produced.

THE DECISION TEST "If this number moved by 20%, what would you do?" is the same question pointed at a dashboard instead of a data request. If the honest answer is "nothing," the dashboard was never for a decision, and someone is still paying to refresh it every morning. Turn it off.

End of Part 6

Next: Part 7, People, Culture & Adoption. None of what's been described in Parts 1 - 6 happens without people who can do the work, a culture that values evidence over intuition, and a deliberate effort to bring the organisation along. The literacy ladder, the change management model, and the communication patterns that turn a data programme into a way of working.

AKG in practice Dashboard UAT process - the gate to Certified

At AKG, a dashboard does not become a Certified data product until it passes User Acceptance Testing. UAT is the formal step between "built and looks right" and "trusted for decisions." Skipping it is how dashboards that are confidently wrong end up in board decks. The nine-step process below is the standard; the Go/No-Go decision at the end is the moment a product earns its Certified label.

Part 7 of 8

People, Culture & Adoption

Everything in the first six parts can be built correctly and still fail, because data work is, in the end, done by people for people. A perfect platform that nobody trusts is worthless. A flawless model that nobody acts on is a science project. This part is about the human side: how literacy is built, where a culture sits on the spectrum from instinct to evidence, the two opposite ways data teams approach their work, and how to communicate so the work actually lands.

7.1 · The data literacy ladder

Data literacy is not a switch that flips. It is a ladder people climb, and the organisation's job is to help them climb it. The four rungs below describe a progression from simply knowing the data exists to being able to answer your own questions without help. Most self-service initiatives fail because they hand Level-2 people Level-4 tools and wonder why adoption stalls.

How to use the ladder:

Assess where the bulk of your organisation actually sits. Be honest; most people in most companies are at Aware or Skilled, not Fluent.

Match the intervention to the rung. Awareness campaigns move people from unaware to Aware. Tool training moves Aware to Skilled. Critical-thinking and interpretation coaching moves Skilled to Fluent. Only Fluent people genuinely self-serve.

Don't confuse tool access with literacy. Giving everyone a BI licence does not make them literate any more than giving everyone a word processor makes them writers.

The test questions on the right are deliberately concrete. "Can a programme manager answer a new question about their caseload without filing a ticket?" is a better measure of literacy than any survey.

Self-serve analytics works only when people are Fluent. Tools don't create literacy, training and culture do.

AKG in practice How the AKG data team operates - our values

The AKG BI team's mission is to turn insight into impact. These four values are not aspirational - they are the standard expected in day-to-day work. They shape how the team prioritises, how it communicates with stakeholders, and what it means to be a good colleague here.

Empowerment

Enable others to do their best work. Focus on insight that helps people act, not just understand. Commit to shared plans and shared accountability - succeed together, fail together.

Integrity

Provide objective, evidence-based insight even when it's uncomfortable. Be honest about what the data does and does not show. Prioritise trust over short-term approval.

Empathy

Put participants, learners and service users at the centre. Test every piece of work against: how will this feel and what difference will it make for the participant? Recognise the pressures stakeholders face.

Connected

Work transparently and collaboratively. Communicate clearly about what you're doing, why it matters, and when it will be delivered. Actively seek learning from wider communities of practice.

7.2 · The data culture scale

Every person relates to evidence in a particular way, and so does every organisation. The scale below runs from the gut-feeler who decides on instinct and uses data only to justify decisions already made, to the scientist who treats their own beliefs as hypotheses to be tested. Most people, and most companies, sit somewhere in the middle.

What the scale is good for:

Diagnosis. Where does your leadership team actually sit? A data programme reporting to a gut-feeler CEO faces a different challenge than one reporting to a scientist.

Realistic targets. The goal is not to turn everyone into a scientist, that's neither achievable nor desirable. The goal is to move the median one step to the right. A company of reporters who become analysts is transformed.

Spotting the trap. The sceptic is not the enemy, healthy scepticism is part of good analysis. The trap is when scepticism becomes a permanent excuse to ignore inconvenient evidence.

Self-awareness. The most useful application is personal. Where do you sit? When was the last time data changed your mind about something you believed?

The goal is not to make everyone a scientist. It is to move the median one step to the right.

7.3 · Two ways to build a data team

Give two leaders the same headcount, the same budget, and the same tools, and you can still get opposite outcomes. The difference is mindset. The comparison below contrasts the "dashboard factory builder", who optimises for output and treats governance as an obstacle, with the "modern data leader" who optimises for impact and treats governance as an enabler.

The pattern underneath the comparison:

The dashboard factory measures itself by what it produces. The modern data leader measures by what changes. A team can ship a hundred dashboards and change zero decisions, and the factory builder will call that a good quarter.

The factory builder treats governance as red tape that slows them down. The modern leader treats governance as the thing that lets users find data, trust it, and act on it without friction. Same word, opposite meaning.

The most telling row is the last one. When friction appears, the factory builder blames the culture, the stakeholders, the lack of resources. The modern leader takes control, delivers value first, earns trust, and uses that trust to scale.

This is not about talent. The factory builder is often more technically skilled. It is about where they point that skill.

Culture is not built by declaring it. It is built by delivering value before demanding literacy. Quick wins first, then the right to ask for more.

Visual Dashboard factory vs modern data leader

	Dashboard factory builder	Modern data leader
Measures success by	Reports produced, dashboards shipped, tickets closed	Decisions changed, problems solved, trust built
Treats governance as	Red tape that slows the team down	The thing that makes data findable, trustworthy and usable
Response to "the data is wrong"	Fixes it quietly, ships a corrected version, moves on	Traces the root cause, adds a contract or test, prevents recurrence
Self-service means	Give everyone a BI licence and a link	Build literacy, certify data, design for the rung people are actually on
When adoption stalls	Blames the culture, the stakeholders, the lack of resources	Delivers value first, earns trust, uses that trust to ask for more
Relationship with the business	Order-taker. Builds what is asked for, whether it is useful or not	Partner. Asks what decision it serves before building anything
Long-term outcome	A catalogue of dashboards nobody opens and a team nobody trusts	A data capability the business could not imagine being without

7.4 · Communication patterns by audience

The same finding needs four different deliveries depending on who is listening. This is the skill that most distinguishes analysts whose work gets used from analysts whose work gets filed. Most analysis dies in translation, not in calculation; the number was right, but the delivery didn't fit the listener.

The principle behind the four patterns:

Executives want the decision. Lead with the recommendation, support it with the one number that matters, and leave the method in the appendix. A single slide beats a dashboard.

Peers and analysts want the method. They will and should challenge the approach, so show the assumptions, make it reproducible, and state the confidence levels honestly.

Business users want to know what to do next. Plain language, tied to their actual workflow, with the next action obvious. Jargon is where you lose them.

The wider organisation wants to know why it matters. This is the domain of narrative, a story with a before and after, carried by visuals, repeated across channels until it sticks.

Notice that the analytical work is identical across all four. What changes is the framing, and the framing is the job. An analyst who can do this well multiplies the value of every other capability in this playbook.

The finding is the same. The framing is the job. Most analysis dies in translation, not in calculation.

End of Part 7

Next: Part 8, Measurement & Maturity, the final part. How to tell whether any of this is working: the maturity model that tracks the programme's progress, the KPIs that matter, and the discipline of proving return on investment. Followed by the appendix.

Visual The four communication patterns

⬡

Executives

What should we decide?

Lead with the recommendation. One number. One action.

Start with the decision, not the data
One slide beats a 20-tab dashboard
Put the method in the appendix
State confidence level and caveats briefly
End with a clear ask

◎

Peers and analysts

How did you get there?

Show the method. Make it reproducible. Invite challenge.

State assumptions explicitly
Show confidence intervals and edge cases
Make the code or logic accessible
Welcome scrutiny - it improves the work
Document limitations honestly

⚡

Business users

What do I do with this?

Plain language. Tied to their workflow. Next action obvious.

No jargon - if they have to ask, you have lost them
Connect directly to their daily decisions
Show the "so what" before the "how"
Keep it short - they have twelve other things open
Make the next step a single click or call

◉

Wider organisation

Why does this matter to me?

Tell a story. Before and after. Repeat until it sticks.

Lead with a human story, then the number
Before and after is the simplest structure
Visuals carry more than text at this scale
Repeat the same story across channels
Celebrate the people who used the data, not the data itself

Part 8 of 8

Measurement & Maturity

The final discipline is knowing whether any of it is working. A data programme that cannot measure itself cannot improve itself, cannot defend its budget, and cannot tell its sponsor a credible story about progress. This part covers the four anchors of measurement: the maturity model that tracks where the programme is, the metrics that fit each level of maturity, the KPI hierarchy that serves different audiences, and the hard discipline of proving return on investment.

8.1 · The maturity pyramid

Maturity models are useful for one reason above all: they let you have an honest, non-personal conversation about where the programme actually is. "We're at Level 2" is a more productive sentence than "the data is bad." The five-level pyramid below runs from a programme that exists on paper to one that delivers measurable strategic value.

Using the model well:

Assess honestly. Most programmes overestimate their level. Having owners named on paper (Level 1 - 2) is not the same as having active, accountable stewards (Level 3+).

Don't skip levels. A programme at Level 2 cannot leap to Level 5 by buying an AI tool. The levels are cumulative; each depends on the one below being solid.

Match metrics to level. This is the most practical use of the pyramid, and the subject of the next section. Measuring strategic-value metrics at a Level-1 programme produces nothing but demoralisation.

Use it for the sponsor conversation. "We were at Level 2 a year ago, we're at Level 3 now, here's what Level 4 requires" is exactly the kind of narrative that keeps a programme funded.

The pitfallA brand-new programme set itself a target of "reduce time-to-insight by 30%." With no stable definitions or agreed ownership, the metric bounced around meaninglessly, and the team felt like failures by month three.

The practiceAssess your maturity honestly and measure what's real for your level. A Level-1 programme counts owners and complaints, and that's exactly the right thing to count.

Maturity is cumulative. You cannot buy your way to Level 5; you have to climb through the levels below it.

8.2 · Metrics by maturity level

The single most common measurement mistake is using metrics that don't fit your maturity. A Level-1 programme measuring "reduction in time-to-insight for analytics teams" will report noise, because there isn't yet a stable enough foundation to move that number. The matrix below matches three kinds of metric, activity, quality output, and business impact, to each maturity level.

How to read the matrix:

Read across a row to get a coherent metric set for your current level. At Level 1, you measure whether assets have owners, count quality complaints, and track manual fixes. That's honest and it's achievable.

Read up a column to see how the bar rises. The activity metric goes from "% of assets with an owner" (L1) to "data product reuse and adoption rate" (L5). Same dimension, vastly more sophisticated question.

Notice the shift in emphasis. Lower levels are dominated by activity metrics, things you do. Upper levels are dominated by impact metrics, outcomes you produce. The transition happens around Level 3, the governance threshold from Part 6.

Set targets one level up, not five. The goal of a Level-2 programme is to hit Level-3 metrics, not Level-5 ones. Ambition is good; measuring against an unreachable bar is not.

The pitfallThe dashboard tracked twenty metrics because twenty looked thorough. Half couldn't be moved at the team's maturity, so nobody acted on any of them, and the dashboard became wallpaper.

The practicePick the row that matches your level. A coherent handful of metrics you can actually move beats a wall of numbers that only looks comprehensive.

Measure what's real for your level. Activity metrics at the bottom, business impact at the top, and a deliberate climb between them.

THE DECISION TEST A metric nobody acts on is the same waste wearing a number. Keep the few that change what someone does; let the rest go before they quietly become wallpaper on a screen no one reads.

8.3 · The KPI hierarchy

Different audiences need different metrics. The board does not want to hear about pipeline success rates; the data team cannot meaningfully report on revenue influenced. The three-tier hierarchy below sorts metrics by the question they answer and the audience they serve.

The three tiers and who they're for:

Operational metrics answer "is the machine running?", pipeline success, freshness, quality pass rates, incident volume. These are for the data team, reviewed daily or weekly.

Tactical metrics answer "is the programme working?", SLA compliance, resolution time, adoption, certified products. These are for data leadership, reviewed monthly.

Strategic metrics answer "why does this programme exist?", revenue influenced, risk reduced, cost avoided, decisions accelerated. These are for the board, reviewed quarterly.

The cardinal sin is reporting the wrong tier to the wrong audience. A board deck full of pipeline success rates signals that the programme doesn't understand its own purpose. A data-team standup focused on revenue attribution wastes everyone's time.

The pitfallThe quarterly update to the senior leadership team opened with pipeline success rates and refresh times. Eyes glazed; someone asked what any of it had to do with participant outcomes or commissioner confidence. The programme looked like it didn't understand its own purpose.

The practiceMatch the metric tier to the audience. Operational for the team, tactical for leadership, strategic for the board. Show directors revenue, risk, and decisions, not refresh times.

Match the tier to the audience. Operational for the team, tactical for leadership, strategic for the board.

8.4 · Proving return on investment

The hardest question a data programme faces is also the most important: was it worth it? The value chain below traces the path from spend to outcome through four links. The uncomfortable truth is that most programmes can prove the first link; they spent the money, and struggle with everything after it.

Working the chain honestly:

Investment is trivially measurable. You know what you spent on platform, people, governance, and tooling.

Capability is measurable with some effort. Trusted data, faster access, working self-service; these show up in the tactical metrics from section 8.3.

Behaviour is where attribution gets hard. Are more decisions actually being made with evidence? This requires instrumenting how decisions get made, which most organisations don't do. It is also where the value actually begins.

Outcome is where the value lands and where attribution is hardest. Revenue went up, but was it the data programme, the new product, the market, or the sales team? The honest answer is usually "some combination," and the discipline is to claim a defensible share rather than the whole thing or nothing.

The practical move is to instrument behaviour change, not just spend. Track decisions that data influenced. Capture the before-and-after of specific use cases. Build a portfolio of documented wins; each one a small, defensible story, rather than reaching for a single grand ROI number that no one believes. A credible collection of "this programme manager intervened earlier because of this insight, and here is what happened to that participant" beats a spurious precise figure every time.

The pitfallAsked to prove ROI, the team reached for one grand number: "£2.4M of efficiency savings." The CFO picked it apart in ten minutes, and the credibility of the whole programme went with it.

The practiceBuild a portfolio of documented wins, specific decisions made differently, with measured before-and-after. A defensible collection of small stories beats one spurious big number.

Attribution gets harder the closer you get to value. Instrument behaviour change, build a portfolio of documented wins, and claim a defensible share, not the whole thing, not nothing.

End of Part 8

That completes the eight parts of the playbook: Strategy, Governance, Architecture, Quality, Metadata & Privacy, AI & Analytics, People & Culture, and Measurement. What remains is the appendix, a consolidated reference of the frameworks, a glossary, and source attributions.

Reference

Glossary

Key terms used throughout the playbook. Where a term is covered in depth in a specific section, the section reference is noted.

CI/CD

Continuous Integration / Continuous Deployment. Automated pipelines that test and deploy code changes. Data contracts enforced here catch breaking changes before they reach production.

Data contract

A written, machine-enforced agreement between a data producer and its consumers. Covers schema, SLA, semantics, quality and ownership. See 4.4.

Data custodian

The role responsible for the technical reliability, security and availability of a dataset. Usually an IT or engineering background. Does not decide what data means. See 2.3.

Data fabric

An architectural approach that provides unified access to data across locations without moving it. Addresses the discovery and integration bottleneck. See 3.4.

Data mesh

An organisational paradigm where domain teams own their own data products end-to-end. Addresses the central-team bottleneck. See 3.4.

Data owner

The business leader accountable for what a dataset means, how it is defined, and how disputes about it are resolved. Must have business authority, not just technical access. See 2.3.

Data product

Any dataset, dashboard or analytical output treated as a product - with clear requirements, testing, versioning and ongoing ownership. At AKG: Certified, Indicative or Development. See 4.4.

Data steward

The person who maintains, monitors and enforces quality standards for a dataset on a day-to-day basis. Escalates decisions to the owner. See 2.3.

Dimensional layer

AKG's middle data layer. Defines how business concepts become measurable: facts, dimensions, valid combinations and standard calculations. Owned by the BI team. See 2.3.

Fact table

In a STAR schema, the central table that holds the numbers you measure (e.g. outcomes, starts). Surrounded by dimension tables that provide context.

GDPR

General Data Protection Regulation. EU law governing how personal data is collected, stored and used. Applies to any organisation handling EU residents' data. See 5.4.

Governance

The system that defines who can make what decisions about data. Not policies and documents - decision rights. If it does not change who decides, it is not governance. See Part 2.

Lakehouse

A storage architecture combining the flexibility of a data lake with the structure of a data warehouse. Addresses the storage cost and BI/ML split bottleneck. See 3.4.

Lineage

The documented chain from a data point back to its source. Enables root-cause analysis when numbers are wrong. Required for trustworthy AI. See 5.2.

LLM / RAG

Large Language Model / Retrieval-Augmented Generation. AI approaches that generate or retrieve answers from text. Require trusted, governed data to produce reliable outputs. See 6.1.

Metadata

Data about data. Includes technical (schema, types), business (definitions, owners), operational (pipeline runs, freshness) and social (endorsements, usage) layers. See 5.2.

MTTR

Mean Time to Resolve. The average time from detecting a data incident to resolving it. A key quality KPI. Shorter MTTR = higher trust. See 4.3.

NIST AI RMF

US National Institute of Standards and Technology AI Risk Management Framework. Govern-Map-Measure-Manage structure for AI governance. Widely adopted internationally. See 6.2.

Observability

The ability to detect, diagnose and prevent data quality incidents using automated monitoring. The three-step loop: detect, diagnose, prevent. See 4.3.

Physical layer

AKG's bottom data layer. The technical implementation in Snowflake: tables, columns, joins, storage. Implements the dimensional layer. Owned by BI and IT. See 2.3.

RACI

Responsible, Accountable, Consulted, Informed. A framework for mapping decision rights across roles. R does the work; A owns the outcome; C is consulted; I is kept informed. See 2.4.

SLA

Service Level Agreement. A commitment on data freshness, availability or quality. "Updated by 6am, 99.5% uptime." Enforced in data contracts, tracked as a quality KPI.

STAR schema

AKG's dimensional layer implementation in Snowflake. A central fact table surrounded by dimension tables. Fast for reporting, readable, and enforces the separation of facts from context. See 2.3.

The Decision Test

One question asked before building anything: "What would we do differently if we had this data?" If no one can answer, do not build it. The filter that prevents data programmes from collecting cost. See Interlude.