Choose Your Reading Style
A professional-level summary covering key definitions, frameworks, and exam-relevant points.
The 5 Vs of Big Data
| V | Definition | Management Challenge |
|---|---|---|
| Volume | Massive scale (TB to PB) | Distributed storage; cost-effective infrastructure |
| Velocity | High speed of generation and processing | Real-time and streaming processing capabilities |
| Variety | Diverse formats (structured, semi-structured, unstructured) | Flexible storage; schema-on-read approaches |
| Veracity | Uncertainty and trustworthiness of data | Data quality management; validation at ingestion |
| Value | Business insights derivable from the data | Analytics capabilities; data science; BI |
CDMP Exam Relevance
Big Data & Data Science is covered in the CDMP exam (approximately 6% weight). Key exam topics include: the 5 Vs and their definitions, the difference between batch and streaming processing, the role of data governance in big data environments, the characteristics of NoSQL databases vs relational databases, and the governance challenges specific to big data (quality, lineage, privacy, metadata). Understanding big data is also important for Data Architecture and Data Integration questions.