SlideShare a Scribd company logo
Introduction
to
Big Data Analytics
1
Dr. Amitabh Mishra
Meaning of Big Data
“Big data is a high volume, high velocity, high variety
information asset that demands cost-effective and
innovative forms of information processing for
enhanced business insight and decision making”.
Dr. Amitabh Mishra 2
• Big data involves homogeneous voluminous data that could
be:
• Structured (as in RDBMS) or
• Unstructured (as in blogs, tweets, Facebook comments, emails)
• The content may be in different varieties as-
– Audio
– Picture
– Large text
Dr. Amitabh Mishra 3
• Handling Big data need newer and innovative technologies for
-capturing, storing, searching, integrating, analysing and
presenting newly found insights.
Dr. Amitabh Mishra 4
Benefits Big Data Analytics
• Here is a list of advantages that can be achieved by using Big Data
analytics:
– Understanding and Targeting Customers
– Understanding and Optimizing Business Processes
– Re-develop your products
– Personal Quantification and Performance Optimization
– Helps in Fraud Detection & improving Security
– Perform Risk Analysis
– Customize your website in real time
– Optimizing Machine and Device Performance
Dr. Amitabh Mishra 5
Characteristics of Big Data
Dr. Amitabh Mishra 6
Characteristics
Volume
Variety
Velocity
Variability
Characteristics: Volume
• Volume is obviously the most common trait of Big Data.
• Many factors contributed to the exponential increase in data volume, such
as:
– Transaction-based data stored through the years,
– Text data constantly streaming in from social media,
– Increasing amounts of sensor data being collected,
– Automatically generated GPS data, and so on.
• With the staggering increase in data volume, even the naming of the next Big Data echelon has been a challenge. The highest mass of data that used
to be called peta bytes (PB) has left its place to zeta bytes (ZB), which is a terabytes (TB).
(1 Terabyte can hold 200,000 songs or 17,000 hours of music / 500 hours of movies)
Dr. Amitabh Mishra 7
Characteristics: Variety
• Data today comes in all types of formats formats ranging from traditional
databases to:
– To hierarchical data stores created by the end users and OLAP systems (Online
Analytical Processing)
– To text documents, e-mail, XML, meter-collected, and sensor-captured data
– To video, audio, and stock ticker data
• By some estimates, 80 to 85 percent of all organizations’ data is in some
sort of unstructured or semi - structured format (a format that is not
suitable for traditional databases schemas).
Dr. Amitabh Mishra 8
Characteristics: Velocity
• Velocity means the speed of something in a given direction.
• According to Gartner, velocity means both
– How fast data is being produced and
– How fast the data must be processed (i.e., captured, stored, and
analysed) to meet the need or demand.
• Velocity is perhaps the most overlooked characteristic of
Big Data. Reacting quickly enough to deal with velocity is a
challenge to most organizations.
Dr. Amitabh Mishra 9
Characteristics: Variability
• In addition to the increasing velocities and
varieties of data, data flows can be highly
inconsistent with periodic peaks.
• Daily, seasonal, and event triggered peak data
loads can be challenging to manage—
especially with social media involved.
Dr. Amitabh Mishra 10

More Related Content

ODP
Introduction To Analytics
PPTX
PPT
Data mining Introduction
PPTX
Basic analtyics & advanced analtyics
PPTX
intro_to_business_analytics_and_data_science_ver 1.0
PPTX
Business intelligence concepts & application
PPTX
Business intelligence
PPTX
MIS: Business Intelligence
Introduction To Analytics
Data mining Introduction
Basic analtyics & advanced analtyics
intro_to_business_analytics_and_data_science_ver 1.0
Business intelligence concepts & application
Business intelligence
MIS: Business Intelligence

What's hot (20)

PPTX
Business Intelligence
PDF
Business Analytics
PPTX
Data analytics vs. Data analysis
PPTX
Business intelligence vs business analytics
PPTX
Business analytics and data mining
PPTX
Importance of data analytics for business
PPTX
Business Intelligence Module 1
PDF
An Introduction to Advanced analytics and data mining
PPTX
Introduction to business intelligence
PPTX
Introduction to Big Data & Analytics
PDF
PPT
Data mining techniques unit 1
PPTX
Data Mining
PPTX
Data mining
PPTX
Introduction to Business Data Analytics
PDF
Introduction to data analytics
PPTX
kinds of analytics
PDF
Business Intelligence Presentation (1/2)
PPSX
Data Analytics Business Intelligence
PPTX
Data Analytics
Business Intelligence
Business Analytics
Data analytics vs. Data analysis
Business intelligence vs business analytics
Business analytics and data mining
Importance of data analytics for business
Business Intelligence Module 1
An Introduction to Advanced analytics and data mining
Introduction to business intelligence
Introduction to Big Data & Analytics
Data mining techniques unit 1
Data Mining
Data mining
Introduction to Business Data Analytics
Introduction to data analytics
kinds of analytics
Business Intelligence Presentation (1/2)
Data Analytics Business Intelligence
Data Analytics
Ad

Similar to Introduction to Big Data (20)

PPTX
Evolution & Introduction to Big data-2.pptx
PPTX
Unit – 1 introduction to big datannj.pptx
PPTX
Introduction to big data
PPTX
Bigdata Hadoop introduction
PDF
Analysis of Big Data
PPTX
Chapter 1 big data
PDF
Know The What, Why, and How of Big Data_.pdf
PDF
bda-unit-bda-unit-materail big data1.pdf
PDF
Understanding big data and data analytics big data
PPTX
Big data Presentation
PDF
Module-1.BDA lecture notes fully easy and study material
PDF
Introduction to visualizing Big Data
PPTX
Lecture #03
PPTX
Big data Ppt
PPTX
Big Data tells about the data is how denser or discrete.pptx
PPTX
Foundations of Big Data: Concepts, Techniques, and Applications
DOCX
Data and Information.docx
PDF
beyond the hype 2015 concepts methods.pdf
PDF
BIG DATA.pdf
Evolution & Introduction to Big data-2.pptx
Unit – 1 introduction to big datannj.pptx
Introduction to big data
Bigdata Hadoop introduction
Analysis of Big Data
Chapter 1 big data
Know The What, Why, and How of Big Data_.pdf
bda-unit-bda-unit-materail big data1.pdf
Understanding big data and data analytics big data
Big data Presentation
Module-1.BDA lecture notes fully easy and study material
Introduction to visualizing Big Data
Lecture #03
Big data Ppt
Big Data tells about the data is how denser or discrete.pptx
Foundations of Big Data: Concepts, Techniques, and Applications
Data and Information.docx
beyond the hype 2015 concepts methods.pdf
BIG DATA.pdf
Ad

More from Amitabh Mishra (20)

PDF
Sales Quotas & Sales Territory
PDF
Sales Promotion
PDF
Transportation Management
PDF
Sales Forecasting
PDF
Sales Organisation
PDF
Selling Process
PDF
Objectives and Nature of Sales Management
PDF
Sales and Sales Management: Meaning and Definition
PDF
Packaging and Labaling
PDF
Product Life Cycle (PLC)
PDF
Targeting, Differentiation and Positioning
PDF
Marketing Environment by Dr. Amitabh Mishra
PDF
Marketing Mix
PDF
Marketing Philosophies or Concepts
PDF
Scope of Marketing
PDF
Introduction to Marketing Management
PDF
Service Scape
PDF
Service Product and Service Flower
PDF
Distribution of Services
PDF
Service Blueprinting
Sales Quotas & Sales Territory
Sales Promotion
Transportation Management
Sales Forecasting
Sales Organisation
Selling Process
Objectives and Nature of Sales Management
Sales and Sales Management: Meaning and Definition
Packaging and Labaling
Product Life Cycle (PLC)
Targeting, Differentiation and Positioning
Marketing Environment by Dr. Amitabh Mishra
Marketing Mix
Marketing Philosophies or Concepts
Scope of Marketing
Introduction to Marketing Management
Service Scape
Service Product and Service Flower
Distribution of Services
Service Blueprinting

Recently uploaded (20)

PPTX
CTG - Business Update 2Q2025 & 6M2025.pptx
PDF
THE COMPLETE GUIDE TO BUILDING PASSIVE INCOME ONLINE
DOCX
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
PDF
NEW - FEES STRUCTURES (01-july-2024).pdf
PDF
NewBase 12 August 2025 Energy News issue - 1812 by Khaled Al Awadi_compresse...
PDF
TyAnn Osborn: A Visionary Leader Shaping Corporate Workforce Dynamics
PDF
Family Law: The Role of Communication in Mediation (www.kiu.ac.ug)
PPTX
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
PDF
Blood Collected straight from the donor into a blood bag and mixed with an an...
PDF
Booking.com The Global AI Sentiment Report 2025
PDF
Digital Marketing & E-commerce Certificate Glossary.pdf.................
PPTX
Board-Reporting-Package-by-Umbrex-5-23-23.pptx
PDF
Cours de Système d'information about ERP.pdf
PPTX
2025 Product Deck V1.0.pptxCATALOGTCLCIA
PPTX
Slide gioi thieu VietinBank Quy 2 - 2025
PDF
Nante Industrial Plug Factory: Engineering Quality for Modern Power Applications
PDF
Module 3 - Functions of the Supervisor - Part 1 - Student Resource (1).pdf
PDF
Introduction to Generative Engine Optimization (GEO)
PDF
Keppel_Proposed Divestment of M1 Limited
CTG - Business Update 2Q2025 & 6M2025.pptx
THE COMPLETE GUIDE TO BUILDING PASSIVE INCOME ONLINE
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
NEW - FEES STRUCTURES (01-july-2024).pdf
NewBase 12 August 2025 Energy News issue - 1812 by Khaled Al Awadi_compresse...
TyAnn Osborn: A Visionary Leader Shaping Corporate Workforce Dynamics
Family Law: The Role of Communication in Mediation (www.kiu.ac.ug)
svnfcksanfskjcsnvvjknsnvsdscnsncxasxa saccacxsax
Blood Collected straight from the donor into a blood bag and mixed with an an...
Booking.com The Global AI Sentiment Report 2025
Digital Marketing & E-commerce Certificate Glossary.pdf.................
Board-Reporting-Package-by-Umbrex-5-23-23.pptx
Cours de Système d'information about ERP.pdf
2025 Product Deck V1.0.pptxCATALOGTCLCIA
Slide gioi thieu VietinBank Quy 2 - 2025
Nante Industrial Plug Factory: Engineering Quality for Modern Power Applications
Module 3 - Functions of the Supervisor - Part 1 - Student Resource (1).pdf
Introduction to Generative Engine Optimization (GEO)
Keppel_Proposed Divestment of M1 Limited

Introduction to Big Data

  • 2. Meaning of Big Data “Big data is a high volume, high velocity, high variety information asset that demands cost-effective and innovative forms of information processing for enhanced business insight and decision making”. Dr. Amitabh Mishra 2
  • 3. • Big data involves homogeneous voluminous data that could be: • Structured (as in RDBMS) or • Unstructured (as in blogs, tweets, Facebook comments, emails) • The content may be in different varieties as- – Audio – Picture – Large text Dr. Amitabh Mishra 3
  • 4. • Handling Big data need newer and innovative technologies for -capturing, storing, searching, integrating, analysing and presenting newly found insights. Dr. Amitabh Mishra 4
  • 5. Benefits Big Data Analytics • Here is a list of advantages that can be achieved by using Big Data analytics: – Understanding and Targeting Customers – Understanding and Optimizing Business Processes – Re-develop your products – Personal Quantification and Performance Optimization – Helps in Fraud Detection & improving Security – Perform Risk Analysis – Customize your website in real time – Optimizing Machine and Device Performance Dr. Amitabh Mishra 5
  • 6. Characteristics of Big Data Dr. Amitabh Mishra 6 Characteristics Volume Variety Velocity Variability
  • 7. Characteristics: Volume • Volume is obviously the most common trait of Big Data. • Many factors contributed to the exponential increase in data volume, such as: – Transaction-based data stored through the years, – Text data constantly streaming in from social media, – Increasing amounts of sensor data being collected, – Automatically generated GPS data, and so on. • With the staggering increase in data volume, even the naming of the next Big Data echelon has been a challenge. The highest mass of data that used to be called peta bytes (PB) has left its place to zeta bytes (ZB), which is a terabytes (TB). (1 Terabyte can hold 200,000 songs or 17,000 hours of music / 500 hours of movies) Dr. Amitabh Mishra 7
  • 8. Characteristics: Variety • Data today comes in all types of formats formats ranging from traditional databases to: – To hierarchical data stores created by the end users and OLAP systems (Online Analytical Processing) – To text documents, e-mail, XML, meter-collected, and sensor-captured data – To video, audio, and stock ticker data • By some estimates, 80 to 85 percent of all organizations’ data is in some sort of unstructured or semi - structured format (a format that is not suitable for traditional databases schemas). Dr. Amitabh Mishra 8
  • 9. Characteristics: Velocity • Velocity means the speed of something in a given direction. • According to Gartner, velocity means both – How fast data is being produced and – How fast the data must be processed (i.e., captured, stored, and analysed) to meet the need or demand. • Velocity is perhaps the most overlooked characteristic of Big Data. Reacting quickly enough to deal with velocity is a challenge to most organizations. Dr. Amitabh Mishra 9
  • 10. Characteristics: Variability • In addition to the increasing velocities and varieties of data, data flows can be highly inconsistent with periodic peaks. • Daily, seasonal, and event triggered peak data loads can be challenging to manage— especially with social media involved. Dr. Amitabh Mishra 10