BIG DATA
Delivered by
Seema Navaghare B.
Sheetal Mahagade D.
Guided by
Titare S.I.
 Introduction
 Characteristics
 Architecture
 Challenges
 Types of Big Data
 Applications
 Benefits
 Conclusion
 Definition
Data: Beyond the storage capacity and beyond
the processing power.
 Big Data includes
• Social media Data
• Stock exchange Data
• Power grid Data
• Transport Data
• Search engine Data
 Volume
 Variety
 Velocity
 Variability
 Value
Data
ingest
staging
processing
Dataworkflow
management
Access
Insight
Hadoop framework
Physical H/w
Value of Data
Data pipeline
 Data Resource Identification
 Data Ingestion
 Data Staging
 Hadoop framework (Data processing)
 Data pipeline
 Data workflow management
 Physical H/w
Data ingestion
Event ingestion
Batch
Ingestion
Operational
system 1
Operational
system 2
Flat files
Staging Area
Wharehouse
Staging
Database
Metadata Aggregate
data
Raw Data
HDFS
Map Reduce Algorithm
Big Data
Landing
Zone
Keyword
research
Content
classification
User
Segmentation
Pptbig data4
Receive
data
Verify
data
Transform Load
Report
error
Valid?
 Data analysis
 Data curation
 Search
 Storage
 Information privacy
 Sharing
 Structured
 Semi-structured
 Unstructured
 Healthcare
 Public sector
 Education
 Banking
 Industry
 Fully understanding the potential of data-driven
marketing
 Improving customer engagement and increasing
customer loyalty
 Reevaluating risk portfolios quickly
 Personalizing the customer experience
 Adding value to online and offline customer
interactions
The availability of Big Data ,low-cost commodity h/w,
And new information management and analytic s/w
Have produced a unique moment in the history of Data
analysis.
The Convergence of these trends means that we have
The capabilities required to analyze astonishing data
sets quickly and cost effectively for the first time in
history
[1].S. Madden From Databases to Big Data IEEE
Internet Computing, 16 (2012 June), pp.4-6
[2].Apache Software Foundation. Official Website
www.Apache.hadoop.org.
[3].Jeffrey Dean and Sanjay Chemawat,“MapReduce:
Simplified Data Processing On Large Clusters”,
CACM Jan. 2008 (PDF).
Thank you…..

More Related Content

PDF
Consumer Data Management
PPTX
1 PSUT Big Data Class, introduction
PDF
Supporting Data Services Marketplace using Data Virtualization
PDF
A Dynamic Data Catalog for Autonomy and Self-Service
PDF
Data Services Marketplace
ODP
AtlasCHUG
PDF
Cortana Analytics Workshop: Azure Data Catalog
PDF
xGem BigData
Consumer Data Management
1 PSUT Big Data Class, introduction
Supporting Data Services Marketplace using Data Virtualization
A Dynamic Data Catalog for Autonomy and Self-Service
Data Services Marketplace
AtlasCHUG
Cortana Analytics Workshop: Azure Data Catalog
xGem BigData

What's hot (15)

PDF
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
PDF
2. Smart Data Discovery
PPTX
Tamr gartner bi and analytics summit
PDF
Dallas Data Brewery - introduction
PDF
Building A Self Service Analytics Platform on Hadoop
PPTX
Big data analytics - Introduction to Big Data and Hadoop
PDF
Large Scale Data Analytics
PPTX
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
PPTX
You Need a Data Catalog. Do You Know Why?
PDF
Big Data Analytics in Bangladesh | Pridesys IT Ltd
PDF
Big dataservicesatfidel
PDF
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
PPTX
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
PDF
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Data Catalog in Denodo Platform 7.0: Creating a Data Marketplace with Data Vi...
2. Smart Data Discovery
Tamr gartner bi and analytics summit
Dallas Data Brewery - introduction
Building A Self Service Analytics Platform on Hadoop
Big data analytics - Introduction to Big Data and Hadoop
Large Scale Data Analytics
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
You Need a Data Catalog. Do You Know Why?
Big Data Analytics in Bangladesh | Pridesys IT Ltd
Big dataservicesatfidel
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Ad

Similar to Pptbig data4 (20)

PPTX
Big Data
PPTX
Special issues on big data
PPTX
Big_Data_ppt[1] (1).pptx
PPTX
Big Data ppt
PPTX
Big data
PDF
Bigdatappt 140225061440-phpapp01
PDF
Big data and analytics
PPTX
bigdata.pptx
PPTX
Big Data Analytics with Hadoop
PPTX
big-data-8722-m8RQ3h1.pptx
PPTX
bigdatappt.pptx
PDF
big data analytics introduction chapter 1
PPT
big data
PPTX
data science unit 2 bigdata introduction .pptx
PPTX
ppt final.pptx
PPTX
Big data
PPTX
Big data ppt
PDF
PPTX
Big-Data-Seminar-6-Aug-2014-Koenig
Big Data
Special issues on big data
Big_Data_ppt[1] (1).pptx
Big Data ppt
Big data
Bigdatappt 140225061440-phpapp01
Big data and analytics
bigdata.pptx
Big Data Analytics with Hadoop
big-data-8722-m8RQ3h1.pptx
bigdatappt.pptx
big data analytics introduction chapter 1
big data
data science unit 2 bigdata introduction .pptx
ppt final.pptx
Big data
Big data ppt
Big-Data-Seminar-6-Aug-2014-Koenig
Ad

Recently uploaded (20)

PPTX
Management Information system : MIS-e-Business Systems.pptx
PDF
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PPTX
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
PPT
Total quality management ppt for engineering students
PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PDF
Soil Improvement Techniques Note - Rabbi
PDF
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
PDF
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
PPTX
Software Engineering and software moduleing
PPTX
Feature types and data preprocessing steps
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PDF
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PPT
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
PPTX
communication and presentation skills 01
PDF
737-MAX_SRG.pdf student reference guides
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
Management Information system : MIS-e-Business Systems.pptx
UNIT no 1 INTRODUCTION TO DBMS NOTES.pdf
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
tack Data Structure with Array and Linked List Implementation, Push and Pop O...
Total quality management ppt for engineering students
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
Soil Improvement Techniques Note - Rabbi
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
EXPLORING LEARNING ENGAGEMENT FACTORS INFLUENCING BEHAVIORAL, COGNITIVE, AND ...
Software Engineering and software moduleing
Feature types and data preprocessing steps
distributed database system" (DDBS) is often used to refer to both the distri...
August 2025 - Top 10 Read Articles in Network Security & Its Applications
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
Fundamentals of safety and accident prevention -final (1).pptx
INTRODUCTION -Data Warehousing and Mining-M.Tech- VTU.ppt
communication and presentation skills 01
737-MAX_SRG.pdf student reference guides
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...

Pptbig data4