This document provides an overview of key concepts in data science and big data, including:
- Data science involves extracting knowledge and insights from structured, semi-structured, and unstructured data.
- The data value chain describes the process of acquiring data, analyzing it, curating it for storage, and using it.
- Big data is characterized by its volume, velocity, variety, and veracity. Hadoop is an open-source framework that allows distributed processing of large datasets across computer clusters.