Here are the answers to the assignment questions:
1. Big data refers to huge volumes of both structured and unstructured data that is so large in size and complex that traditional data processing applications are inadequate to deal with it.
2. The three main types of data are:
- Structured data: Data that is organized and has a predefined data model e.g. numbers in a database. Sources include CRM systems, transactions etc.
- Semi-structured data: Data that has some structure but not fully structured e.g. log files, XML files. Sources include sensors, images, audio/video etc.
- Unstructured data: Data with no predefined structure e.g. text, emails. Sources include