SlideShare a Scribd company logo
An Introduction to Big Data
Concept & Ecosystem
What is Big Data?
Volume
• 100s of TBs
• PB scale
• Too big for
traditional
transaction
processing
Velocity
• Distributed,
Parallel Processing
Variety
• Structured &
unstructured
content
Veracity
• Trustworthiness,
Reliability
Drivers for Big Data Adoption
Big Data
Adoption
Commodity
Hardware
Support
Open
Source
Ecosystem
Web
Economy
Reduced
Storage
Costs
Sources of Big Data
Archives
Documents
Media
Business
Apps, Data
Storage
Public Web
Social
Media
Machine/
Sensor Data
Where it is used?
Trends Patterns
Predictions
Usage ScenariosWhatwedo?
Activities
Conversations
Social Media
Photographs
Videos
Transactions
WhatbigdataDoes?
Text Analysis
Speech Analysis
Sentiment
Analysis
Spending Analysis
Geographical
Analysis
Working with Big Data
Data Source /
Ingestion
Data Storage
Data
Processing/
Transformation
Data Analysis
& Output
Hadoop
Combination of MapReduce engine
and HDFS
Shift of responsibilities for
availability & distribution
Brings processing closer to the data
Hadoop Eco-System
Apache Hadoop
HBase, Cassandra
Hive, Pig
Sqoop
Mahout
MapReduce, HDFS
Database
Structured Queries
RDBMS
Connectivity
Machine Learning/
Data Mining
MapReduce
Input
Map • Key
Reduce • Aggregate
Value
MapReduce..Word count Example
MapReduce..Word count Example
MapReduce..Word count Example
MapReduce..Word count Example
MapReduce..Word count Example
MapReduce..Word count Example
Hive
•  Started as a sub-project of Hadoop
•  Now a top-level Apache project
•  Provides SQL like abstraction layer over MapReduce
•  Has its own HDFS table file format (and it’s fully
schema-bound)
•  Can also work over Hbase
•  Acts as a bridge to many BI products which expect
tabular data
Big Data + NoSQL
CAP
Theorem
Consistency
AvailabilityPartition Tolerance
NoSQL
Relational
NoSQL
• Neo4j• Hbase
• MongoDB
• Amazon
DynamoDB
• Redis
Key-Value
Stores
Document
Stores
Graph
Databases
Wide
Column/
Column
Family
Hadoop Distros
Cloudera HortonWorks
MapR
IBM
InfoSphere
BigInsights
Hadoop In Clouds
Amazon
EMR
Microsoft
HDInsight
Google
Cloud
Platform
WHIRR
Additional Information
}  To learn more about big data & the eco-system,, get in
touch with us.
training@forwardsprint.com
www.forwardsprint.com
Thank you!
www.forwardsprint.com

More Related Content

PDF
NoSQL Introduction
PPT
Big Data: An Overview
PPTX
Big Data - A brief introduction
PDF
ROI of Big Data Analytics Native on Hadoop
PPTX
Introduction to BIg Data and Hadoop
PDF
Introduction to Big Data
PPTX
Big Data Use Cases
PPTX
Big Data Analytics with Hadoop, MongoDB and SQL Server
NoSQL Introduction
Big Data: An Overview
Big Data - A brief introduction
ROI of Big Data Analytics Native on Hadoop
Introduction to BIg Data and Hadoop
Introduction to Big Data
Big Data Use Cases
Big Data Analytics with Hadoop, MongoDB and SQL Server

What's hot (20)

PPTX
Hadoop: An Industry Perspective
PPT
BigData Analytics with Hadoop and BIRT
PDF
Rob peglar introduction_analytics _big data_hadoop
PPTX
Introduction of Big data, NoSQL & Hadoop
PPTX
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
PPTX
Introduction to Apache Hadoop Eco-System
PDF
Big Data: an introduction
PDF
Introduction to Big Data
PDF
Introduction to Bigdata and HADOOP
PPTX
Intro to Big Data Hadoop
PPTX
Big Data - An Overview
PDF
Big data technologies and Hadoop infrastructure
PPTX
Hadoop and BigData - July 2016
PDF
Hadoop,Big Data Analytics and More
PPTX
PPTX
Hadoop and big data
PPTX
Владимир Слободянюк «DWH & BigData – architecture approaches»
PDF
Big Data Telecom
PDF
Hadoop Big Data Lakes Keynote
PDF
Big Data Real Time Applications
Hadoop: An Industry Perspective
BigData Analytics with Hadoop and BIRT
Rob peglar introduction_analytics _big data_hadoop
Introduction of Big data, NoSQL & Hadoop
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
Introduction to Apache Hadoop Eco-System
Big Data: an introduction
Introduction to Big Data
Introduction to Bigdata and HADOOP
Intro to Big Data Hadoop
Big Data - An Overview
Big data technologies and Hadoop infrastructure
Hadoop and BigData - July 2016
Hadoop,Big Data Analytics and More
Hadoop and big data
Владимир Слободянюк «DWH & BigData – architecture approaches»
Big Data Telecom
Hadoop Big Data Lakes Keynote
Big Data Real Time Applications
Ad

Similar to An introduction to Big Data (20)

PDF
Big data and hadoop
PPT
Lecture 5 - Big Data and Hadoop Intro.ppt
PPTX
Introduction to big data
PPTX
Unit 1 - Introduction to Big Data and hadoop.pptx
PDF
big data analytics introduction chapter 1
PPTX
Big-Data-Seminar-6-Aug-2014-Koenig
PPT
Data analytics & its Trends
PDF
PDF
Lesson 1 introduction to_big_data_and_hadoop.pptx
PDF
Hadoop Master Class : A concise overview
PPTX
PPT
Oh! Session on Introduction to BIG Data
PPTX
big_data_presentation with creativitty__
DOCX
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
PPTX
Big data analytics - Introduction to Big Data and Hadoop
PPTX
1 PSUT Big Data Class, introduction
PDF
Big data & hadoop Introduction
PPTX
Chapter1-Introduction Εισαγωγικές έννοιες
PDF
Introduction to Big Data
PPTX
A Big Data Concept
Big data and hadoop
Lecture 5 - Big Data and Hadoop Intro.ppt
Introduction to big data
Unit 1 - Introduction to Big Data and hadoop.pptx
big data analytics introduction chapter 1
Big-Data-Seminar-6-Aug-2014-Koenig
Data analytics & its Trends
Lesson 1 introduction to_big_data_and_hadoop.pptx
Hadoop Master Class : A concise overview
Oh! Session on Introduction to BIG Data
big_data_presentation with creativitty__
BIGDATAPrepared ByMuhammad Abrar UddinIntrodu.docx
Big data analytics - Introduction to Big Data and Hadoop
1 PSUT Big Data Class, introduction
Big data & hadoop Introduction
Chapter1-Introduction Εισαγωγικές έννοιες
Introduction to Big Data
A Big Data Concept
Ad

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
Teaching material agriculture food technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
KodekX | Application Modernization Development
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Encapsulation_ Review paper, used for researhc scholars
NewMind AI Weekly Chronicles - August'25 Week I
CIFDAQ's Market Insight: SEC Turns Pro Crypto
“AI and Expert System Decision Support & Business Intelligence Systems”
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Dropbox Q2 2025 Financial Results & Investor Presentation
Teaching material agriculture food technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Big Data Technologies - Introduction.pptx
Spectral efficient network and resource selection model in 5G networks
Empathic Computing: Creating Shared Understanding
NewMind AI Monthly Chronicles - July 2025
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation theory and applications.pdf
Unlocking AI with Model Context Protocol (MCP)
KodekX | Application Modernization Development

An introduction to Big Data