C
CDMP Master
Academy

Join CDMP Master Academy

Free to join · No card required

Sign Up Sign In
Home/Blog/What is Data Lineage? Definition, Types & Why It Matters
Data Management

What is Data Lineage? Definition, Types & Why It Matters

Data lineage explained: what it is, why organisations need it, the difference between technical and business lineage, and how it appears in the CDMP exam.

CDMP Master Academy·19 February 2025·9 min read

Choose Your Reading Style

A professional-level summary covering key definitions, frameworks, and exam-relevant points.

DMBOK Context

Data lineage is classified as a form of technical metadata in the DAMA DMBOK v2. It is managed within the Metadata Management knowledge area (11% CDMP weight) and is also relevant to Data Integration and Interoperability (6% weight). Lineage documentation is a core deliverable of any metadata management programme.

Lineage Types

TypeAudienceGranularityFocus
Technical LineageData engineers, IT teamsColumn-level, system-levelETL jobs, transformations, database objects
Business LineageAnalysts, stewards, complianceReport-level, process-levelBusiness processes, KPIs, regulatory reports

Use Cases

The primary use cases for data lineage are: impact analysis (understanding what will break if a source system changes), root cause analysis (tracing a data quality issue back to its source), regulatory compliance (demonstrating to regulators how personal data flows through the organisation), data trust (enabling consumers to verify the origin and transformation history of data), and migration planning (understanding dependencies before moving systems).

Lineage vs Provenance

The CDMP exam distinguishes between lineage (the movement and transformation history of data) and provenance (the origin, ownership, and authority under which data was created). Both are forms of technical metadata, but they answer different questions: lineage answers "where has this data been?"; provenance answers "where did this data come from and who is responsible for it?"

Premium Reading Styles — Locked

You've read this topic 3 ways. Premium unlocks 3 more.

Simple, Analogy, and Overview give you the foundation. Premium members get Memory Hack, Exam Focus, and Deep Dive — the three styles that actually make you score 95%+ on the CDMP exam.

PREMIUM
Memory Hack
Stick it forever

Vivid mnemonics, acronyms, and mental shortcuts that make every DMBOK concept impossible to forget — even under exam pressure.

e.g. "Data Governance is a COUNTRY — Constitution, Operations, Undertakings, Nations, Treaties, Rights, Yields"
Unlock to read →
PREMIUM
Exam Focus
Know what's tested

Exactly which concepts appear in CDMP questions, how they are phrased, the most common wrong-answer traps, and the precise wording examiners use.

e.g. "CDMP tests the DIFFERENCE between data stewardship and data ownership — here's the exact line"
Unlock to read →
PREMIUM
Deep Dive
Master every detail

Exhaustive breakdowns with real-world examples, edge cases, cross-topic connections, and practitioner-level nuance that separates 70% scorers from 95%+ scorers.

e.g. "Why the 6 Data Quality dimensions overlap — and how CDMP questions exploit that overlap"
Unlock to read →

Premium also includes:

Sim Office
50 realistic workplace tasks — apply DMBOK to real data management scenarios
Interview Prep
10 DMBOK interview tracks with STAR-format model answers
12 Simulation Exams
60 questions each, timed, with full explanations for every option
Per-Topic Quizzes
Mastery quizzes after every topic so you know your gaps before exam day
95%+ Score Target
Our methodology is built around elite scores, not just passing
All 21 DMBOK Topics
Complete coverage of every DAMA DMBOK v2 knowledge area
CDMP Score Levels:
Associate60–69%
Practitioner70–79%
Master80–89%
Elite (Our Target)90–100%

Cancel anytime · Instant access · 95%+ score methodology

Share this article