Choose Your Reading Style
A professional-level summary covering key definitions, frameworks, and exam-relevant points.
Metadata Management: Professional Framework
Metadata management is the discipline of creating, maintaining, and governing the metadata that describes an organisation's data assets. It is one of the highest-weighted knowledge areas in the CDMP exam (approximately 11%), reflecting its foundational role in enabling data discovery, data lineage, data governance, and data quality management.
The Three Metadata Types
| Type | Audience | Examples | Primary Use |
|---|---|---|---|
| Business Metadata | Business users, data owners | Definitions, business rules, data ownership, usage policies | Data understanding and governance |
| Technical Metadata | IT, data engineers, architects | Schema, data types, table relationships, system locations | Data integration and system management |
| Operational Metadata | Data operations, monitoring teams | Load timestamps, record counts, error logs, process durations | Data pipeline monitoring and troubleshooting |
Data Lineage
Data lineage is a specific category of metadata that documents the origin of data and all the transformations it has undergone as it moves through systems. It answers the question: "Where did this data come from, and what happened to it along the way?" Lineage is critical for regulatory compliance (particularly BCBS 239 in banking), impact analysis, and root cause analysis of data quality issues.
Data Catalog
A data catalog is the primary tool for implementing metadata management at scale. It provides a searchable inventory of data assets, their associated metadata, data lineage, data quality scores, and collaboration features. Modern data catalogs (Collibra, Alation, Microsoft Purview, Apache Atlas) use automated metadata harvesting to reduce the manual effort of metadata capture.
CDMP Exam Focus
Key exam topics include: the three types of metadata and their distinctions, the role of metadata in data governance and data quality, the concept and components of data lineage, the purpose and features of a data catalog, and the relationship between metadata management and other DMBOK knowledge areas.