SlideShare a Scribd company logo
Understanding
Big Data
[ A Business Perspective ]
by
Shiva Dasharathi
1: Big Data ?
3:
How
different it
is from BI,
HANA etc. ?
2: Benefits ?
4:
Infrastructure?
hadoop?
Nosql?
5:
Challenges in Migrating to big data?
1: Big Data ?
What is big data?
Terminology
Big Data evolution
Anatomy of usecases
Big Data
“Any Data that is worth analysing”
Variety
of data
Real time analysis : for,
- Feedbacks, complaints about your
products & services in social media
-Capturing Customer behaviour
-Predicting things before they
happen
Velocity
&
Volume of data
Big Data
Characteristics of big data:
1. Volume of data
2. Velocity of Data
3. Variety of Data
* Difficult to handle in traditional ways
Complex
Big Data > Terminology
Structured data:
NAME NATURE TAG
Shiva Thinking -- Forgetful Philosophy
Shravanthi Innocent – Sensitive -- Journalist Champion
Subhash Artistic -- Descretive Champion
Shreyas Logical -- Passionate Champion
Adithya Logical -- Articulative Champion
Pallavi Outspeaking -- Friendly Champion
Sonal Dancer -- Sportive Champion
Nikhil Poetic -- Sportive Champion
Gayathri Prude -- Honest Champion
Anitha Blesser -- Gentle Champion
Amandeep Aesthetic -- Independent Champion
Malathi un known Champion
Ankitha Cricketer -- Gentle Champion
Vikram Logical -- Articulative Champion
Ankesh Honest – Passionate Champion
Tejo Managerial – Patience Leading
CJ Quick – Logical Champion
Deba Dedication – Honest Champion
Charu Managerial – Independent Leading
Ashok Social – Helping Leading
Sijesh Analytical – Eager -- Helping Balanced Leading
Tarun Shrewed – Responsive Leading
Pavan Optimization – Shrewed Administrative
Bhargav Balancing – Friendly Champion
Surya Enthusiasm – Learning Versatile
Swarnav Social – Outspeaking Administrative
Bidisha Outspeakiing – Social Administrative
Niranjan Articulative -- Friendly Leading
Understanding big data,  a business perspective
Understanding big data,  a business perspective
Big Data evolution
Commodity/Cheap Hardware
Open source software
Valuable Data
Data mining / Data analysis
using statistical modelling
techniques
Health care
Banking
Energy & Utilities
Telecom
Supply chain
Retail
Realestate
Agriculture
Sustainability
etc..
Social networking
web sites
Search engines
Job portals
News portals
Travel
Recommendation
Apps
Online movie stores
Animation industry
etc..
Space research
Bio research
Image
processing
etc..
Enterprise Analytics
Social media Apps &
Analytics
Research Oriented
Fields
Big Data > Anatomy of usecases?
2: Benefits ?
Advanced Predictions
use of predictions
2: Benefits ?
Advanced Predictions
Source 1
Source 2
Source 3
Knowledge
Models
Predictions:
Revenue / spend
forecasts;
Sentiment
analysis;
Customer
Behaviour analysis
etc..
Data mining techniques
2: Benefits ?
use of predictions
- Helps to understand & optimize the
complex business processes
- predict opportunities / risks
- Understand strengths / weaknesses
- Optimizing resource usages
*At much cheaper costs.
3:
How different
it is from
traditional
DBs?
Big Data Vs RDBMS
Big Data Vs BW, HANA
Big Data Vs RDBMS
. Big Data tools ->
- More for data Analysis
- Lack Transaction system capabilities
- Do not fully comply with A C I D properties
- Doesn’t allow to apply constraints at data level
* Do not compete with OLTP systems
Big Data Vs BW (or) HANA
- Limited by Scalability (vertical scaling)
-Can not deal with unstructured data
-Not fault tolerant
- Not well integrated with open source
analytical / data mining tools
4.
Infrastructure?
hadoop?
Nosql?
What is NOSQL
What is Hadoop
What is Hadoop
A distributed, parallel, data processing system (a type of
NOSQL)
map
map
map
map
reduce
reduce
Storage : HDFS
Programming api: Mapreduce
Well established Eco system:
Hive, Pig, Hbase, Sqoop, oozy,
Flume, Mahout, zookeeper etc…
Input files
Output files
What is NOSQL
Nosql -> Not Only SQL
Highly Available
Distributed
Fault tolerant
Schema less
NOSQL DBs:
Key Value based: Hadoop
Columnar based: Hbase, Cassandra
Graph based: Neo4j, Orient DB
Document based: Mango DB, Couch DB
5:
Challenges ? What to consider ?
What data your organization has? Size & Sources?
What Analytical usecases to be implemented ?
What infrastructure you have?
Which hardware to buy? What configuration?
What NOSQL DBs you need ? Which vendor to approach?
Migration Plan?
1. What is Big Data?
2. How will I benefit?
3. I don’t have huge data, should I still consider big
data?
4. I already have BI or HANA etc. setup, Can I leverage
them?
5. What challenges may I face?
6. How much can I Save 
(Questions Covered in the session)
Thank You 
@shivadasharathi

More Related Content

PPTX
Big Data Hadoop
ODP
Big Data Analytics - Introduction
PDF
Lecture3 business intelligence
PPTX
Big Data Analytics
PDF
01. Introduction to Data Mining and BI
PPTX
Combining Data Mining and Machine Learning for Effective User Profiling
PPT
Data analytics & its Trends
PDF
Lecture1 introduction to big data
Big Data Hadoop
Big Data Analytics - Introduction
Lecture3 business intelligence
Big Data Analytics
01. Introduction to Data Mining and BI
Combining Data Mining and Machine Learning for Effective User Profiling
Data analytics & its Trends
Lecture1 introduction to big data

What's hot (20)

PPTX
Unit i big data introduction
PDF
Introduction to BigData
PPTX
BIG DATA and USE CASES
PPT
Future of Data - Big Data
PPT
introduction to data mining tutorial
PDF
Data Mining and Business Intelligence Tools
PPTX
Introduction to BIG DATA
PDF
Lect 1 introduction
PDF
Big data analytics, research report
PPTX
Introduction to Big Data
PPT
Introduction to Data Mining
PPTX
Lect 1 introduction
PPTX
Data mining
PPTX
introduction to data warehousing and mining
PPT
Data mining 1
PDF
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
PPTX
Bp presentation business intelligence and advanced data analytics september ...
PPTX
Information Technology Data Mining
Unit i big data introduction
Introduction to BigData
BIG DATA and USE CASES
Future of Data - Big Data
introduction to data mining tutorial
Data Mining and Business Intelligence Tools
Introduction to BIG DATA
Lect 1 introduction
Big data analytics, research report
Introduction to Big Data
Introduction to Data Mining
Lect 1 introduction
Data mining
introduction to data warehousing and mining
Data mining 1
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Bp presentation business intelligence and advanced data analytics september ...
Information Technology Data Mining
Ad

Similar to Understanding big data, a business perspective (20)

PPTX
BigDataFinal.pptx
PPTX
Presentation on Big Data
PPT
big data
PPTX
big-data-8722-m8RQ3h1.pptx
PPTX
BigData.pptx
PPTX
Big_Data_ppt[1] (1).pptx
PPTX
Kartikey tripathi
PDF
Bigdatappt 140225061440-phpapp01
PPTX
Big data analytics - Introduction to Big Data and Hadoop
PPTX
Big Data : a 360° Overview
PPTX
Big-Data-Seminar-6-Aug-2014-Koenig
PPTX
Introduction to Big Data
PPTX
Big Data
PPTX
Special issues on big data
PPTX
Big Data_Big Data_Big Data-Big Data_Big Data
PPTX
Big data Analytics
PPTX
Big Data ppt
PDF
All About Big Data
PPTX
Big data
BigDataFinal.pptx
Presentation on Big Data
big data
big-data-8722-m8RQ3h1.pptx
BigData.pptx
Big_Data_ppt[1] (1).pptx
Kartikey tripathi
Bigdatappt 140225061440-phpapp01
Big data analytics - Introduction to Big Data and Hadoop
Big Data : a 360° Overview
Big-Data-Seminar-6-Aug-2014-Koenig
Introduction to Big Data
Big Data
Special issues on big data
Big Data_Big Data_Big Data-Big Data_Big Data
Big data Analytics
Big Data ppt
All About Big Data
Big data
Ad

Recently uploaded (20)

PDF
Reconciliation AND MEMORANDUM RECONCILATION
PDF
NISM Series V-A MFD Workbook v December 2024.khhhjtgvwevoypdnew one must use ...
DOCX
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
PPTX
2025 Product Deck V1.0.pptxCATALOGTCLCIA
PPTX
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
PDF
Tata consultancy services case study shri Sharda college, basrur
PDF
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
PPT
Chapter four Project-Preparation material
PDF
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PDF
COST SHEET- Tender and Quotation unit 2.pdf
PDF
Roadmap Map-digital Banking feature MB,IB,AB
PPTX
Principles of Marketing, Industrial, Consumers,
PDF
NewBase 12 August 2025 Energy News issue - 1812 by Khaled Al Awadi_compresse...
PPTX
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
PDF
How to Get Business Funding for Small Business Fast
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PDF
Deliverable file - Regulatory guideline analysis.pdf
PPTX
ICG2025_ICG 6th steering committee 30-8-24.pptx
Reconciliation AND MEMORANDUM RECONCILATION
NISM Series V-A MFD Workbook v December 2024.khhhjtgvwevoypdnew one must use ...
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
2025 Product Deck V1.0.pptxCATALOGTCLCIA
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
Tata consultancy services case study shri Sharda college, basrur
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
Chapter four Project-Preparation material
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
COST SHEET- Tender and Quotation unit 2.pdf
Roadmap Map-digital Banking feature MB,IB,AB
Principles of Marketing, Industrial, Consumers,
NewBase 12 August 2025 Energy News issue - 1812 by Khaled Al Awadi_compresse...
job Avenue by vinith.pptxvnbvnvnvbnvbnbmnbmbh
How to Get Business Funding for Small Business Fast
Digital Marketing & E-commerce Certificate Glossary.pdf.................
Deliverable file - Regulatory guideline analysis.pdf
ICG2025_ICG 6th steering committee 30-8-24.pptx

Understanding big data, a business perspective

  • 1. Understanding Big Data [ A Business Perspective ] by Shiva Dasharathi
  • 2. 1: Big Data ? 3: How different it is from BI, HANA etc. ? 2: Benefits ? 4: Infrastructure? hadoop? Nosql? 5: Challenges in Migrating to big data?
  • 3. 1: Big Data ? What is big data? Terminology Big Data evolution Anatomy of usecases
  • 4. Big Data “Any Data that is worth analysing” Variety of data
  • 5. Real time analysis : for, - Feedbacks, complaints about your products & services in social media -Capturing Customer behaviour -Predicting things before they happen Velocity & Volume of data
  • 6. Big Data Characteristics of big data: 1. Volume of data 2. Velocity of Data 3. Variety of Data * Difficult to handle in traditional ways Complex
  • 7. Big Data > Terminology Structured data: NAME NATURE TAG Shiva Thinking -- Forgetful Philosophy Shravanthi Innocent – Sensitive -- Journalist Champion Subhash Artistic -- Descretive Champion Shreyas Logical -- Passionate Champion Adithya Logical -- Articulative Champion Pallavi Outspeaking -- Friendly Champion Sonal Dancer -- Sportive Champion Nikhil Poetic -- Sportive Champion Gayathri Prude -- Honest Champion Anitha Blesser -- Gentle Champion Amandeep Aesthetic -- Independent Champion Malathi un known Champion Ankitha Cricketer -- Gentle Champion Vikram Logical -- Articulative Champion Ankesh Honest – Passionate Champion Tejo Managerial – Patience Leading CJ Quick – Logical Champion Deba Dedication – Honest Champion Charu Managerial – Independent Leading Ashok Social – Helping Leading Sijesh Analytical – Eager -- Helping Balanced Leading Tarun Shrewed – Responsive Leading Pavan Optimization – Shrewed Administrative Bhargav Balancing – Friendly Champion Surya Enthusiasm – Learning Versatile Swarnav Social – Outspeaking Administrative Bidisha Outspeakiing – Social Administrative Niranjan Articulative -- Friendly Leading
  • 10. Big Data evolution Commodity/Cheap Hardware Open source software Valuable Data Data mining / Data analysis using statistical modelling techniques
  • 11. Health care Banking Energy & Utilities Telecom Supply chain Retail Realestate Agriculture Sustainability etc.. Social networking web sites Search engines Job portals News portals Travel Recommendation Apps Online movie stores Animation industry etc.. Space research Bio research Image processing etc.. Enterprise Analytics Social media Apps & Analytics Research Oriented Fields Big Data > Anatomy of usecases?
  • 12. 2: Benefits ? Advanced Predictions use of predictions
  • 13. 2: Benefits ? Advanced Predictions Source 1 Source 2 Source 3 Knowledge Models Predictions: Revenue / spend forecasts; Sentiment analysis; Customer Behaviour analysis etc.. Data mining techniques
  • 14. 2: Benefits ? use of predictions - Helps to understand & optimize the complex business processes - predict opportunities / risks - Understand strengths / weaknesses - Optimizing resource usages *At much cheaper costs.
  • 15. 3: How different it is from traditional DBs? Big Data Vs RDBMS Big Data Vs BW, HANA
  • 16. Big Data Vs RDBMS . Big Data tools -> - More for data Analysis - Lack Transaction system capabilities - Do not fully comply with A C I D properties - Doesn’t allow to apply constraints at data level * Do not compete with OLTP systems
  • 17. Big Data Vs BW (or) HANA - Limited by Scalability (vertical scaling) -Can not deal with unstructured data -Not fault tolerant - Not well integrated with open source analytical / data mining tools
  • 19. What is Hadoop A distributed, parallel, data processing system (a type of NOSQL) map map map map reduce reduce Storage : HDFS Programming api: Mapreduce Well established Eco system: Hive, Pig, Hbase, Sqoop, oozy, Flume, Mahout, zookeeper etc… Input files Output files
  • 20. What is NOSQL Nosql -> Not Only SQL Highly Available Distributed Fault tolerant Schema less NOSQL DBs: Key Value based: Hadoop Columnar based: Hbase, Cassandra Graph based: Neo4j, Orient DB Document based: Mango DB, Couch DB
  • 21. 5: Challenges ? What to consider ? What data your organization has? Size & Sources? What Analytical usecases to be implemented ? What infrastructure you have? Which hardware to buy? What configuration? What NOSQL DBs you need ? Which vendor to approach? Migration Plan?
  • 22. 1. What is Big Data? 2. How will I benefit? 3. I don’t have huge data, should I still consider big data? 4. I already have BI or HANA etc. setup, Can I leverage them? 5. What challenges may I face? 6. How much can I Save  (Questions Covered in the session)