What Is Data Management?
Data management is the development, execution, and supervision of plans, policies, programs, and practices that deliver, control, protect, and enhance the value of data and information assets throughout their lifecycles. This definition, from the DAMA Data Management Body of Knowledge (DMBOK v2), captures the essential nature of data management: it is not a single activity but a comprehensive discipline that spans the entire life of data — from creation to archival or deletion.
In practical terms, data management encompasses everything from designing the databases that store data, to governing who can access it, to ensuring its quality, to making it discoverable and understandable across the organisation. It is the discipline that transforms raw data into a trusted, valuable enterprise asset.
Why Is Data Management Important?
The case for structured data management has never been stronger. Organisations today generate and consume data at unprecedented scale — IDC estimates that the global datasphere will reach 175 zettabytes by 2025. But volume alone does not create value. Data only creates value when it is:
- Accurate — correctly representing the real world
- Accessible — available to those who need it, when they need it
- Understandable — with agreed definitions and context
- Trustworthy — with known provenance and quality
- Secure — protected from unauthorised access and misuse
- Compliant — managed in accordance with regulatory requirements
Achieving all of these properties simultaneously requires the systematic, disciplined approach that data management provides.
The DAMA DMBOK v2: The Definitive Data Management Reference
The DAMA Data Management Body of Knowledge (DMBOK v2), published in 2017, is the most comprehensive and widely adopted reference framework for data management. It defines 11 knowledge areas (expanded to 14 in the CDMP exam framework) and provides detailed guidance on the activities, roles, tools, and best practices within each area.
The DMBOK v2 is the primary reference for the CDMP certification exam, making it essential reading for anyone pursuing the credential. But its value extends far beyond exam preparation — it is a practical guide for building and improving data management capabilities in any organisation.
The DAMA Wheel
The DAMA Wheel is the iconic visual representation of the DMBOK framework. It shows Data Governance at the centre — the hub that connects and coordinates all other data management disciplines — surrounded by the 10 knowledge areas as spokes: Data Architecture, Data Modeling, Data Storage and Operations, Data Security, Data Integration and Interoperability, Document and Content Management, Reference and Master Data, Data Warehousing and Business Intelligence, Metadata Management, and Data Quality Management.
The positioning of Data Governance at the centre is deliberate and significant: governance is not just another knowledge area — it is the overarching framework that provides the authority, decision rights, and accountability structures within which all other data management activities operate.
The 14 CDMP Knowledge Areas
The CDMP exam framework expands the DMBOK's 10 knowledge areas to 14 by treating Data Ethics, Big Data and Data Science, Data Lifecycle Management, and the Data Management Process as distinct areas. Here is a brief overview of each:
Data Governance
The framework of authority, decision rights, accountability, and control over data assets. Includes the Data Governance Council, data stewardship, policies, standards, and the CDO role.
Data Quality Management
The processes and practices for measuring, monitoring, and improving data quality across the six dimensions (Accuracy, Completeness, Consistency, Timeliness, Validity, Uniqueness).
Data Modeling and Design
The discipline of creating conceptual, logical, and physical data models that represent the structure and relationships of data. Includes entity-relationship modeling, normalisation, and dimensional modeling.
Metadata Management
The management of data about data — business metadata (definitions, ownership), technical metadata (formats, lineage), and operational metadata (usage statistics, access logs).
Master Data Management
The processes and technologies for creating and maintaining a single, authoritative version of key business entities (customers, products, locations) — the "golden record".
Data Warehousing and Business Intelligence
The design, construction, and operation of data warehouses and BI systems that support analytical decision-making. Includes dimensional modeling, ETL/ELT, and OLAP.
Data Architecture
The design of the overall data landscape — how data is structured, stored, integrated, and used across the enterprise. Includes enterprise data models and architecture frameworks.
Data Security
The policies, procedures, and controls that protect data from unauthorised access, misuse, and loss. Includes access control models, encryption, data masking, and regulatory compliance.
Data Integration and Interoperability
The processes and technologies for moving, transforming, and synchronising data between systems. Includes ETL, CDC, data virtualisation, and API integration.
Data Storage and Operations
The management of physical and virtual data storage — databases, file systems, cloud storage — including backup, recovery, archiving, and performance optimisation.
Reference and Master Data
The management of reference data (code sets, lookup tables, hierarchies) that provides the shared context for understanding and interpreting master and transactional data.
Document and Content Management
The management of unstructured data — documents, emails, images, videos — including content lifecycle management, records management, and retention schedules.
Data Ethics
The ethical principles and frameworks that govern the responsible use of data, including algorithmic bias, privacy ethics, and the distinction between legal compliance and ethical behaviour.
Big Data and Data Science
The management of large-scale, high-velocity, and varied data sets, and the governance of data science and machine learning activities within the organisation.
Data Management Maturity
Organisations do not implement all data management capabilities at once — they mature over time. The CMMI Data Management Maturity (DMM) model provides a framework for assessing and improving data management maturity across five levels:
- Initial: Ad hoc, reactive, no formal processes
- Repeatable: Some processes defined, inconsistently applied
- Defined: Standardised processes documented and consistently applied
- Managed: Processes measured and controlled with quantitative metrics
- Optimising: Continuous improvement driven by data and innovation
Most organisations sit at Level 1 or 2. The goal of a data management programme is to systematically advance maturity, prioritising the capabilities that deliver the most business value.