SlideShare a Scribd company logo
What is Big Data , 5'v of BIG DATA and Challenges
What is Big Data , 5'v of BIG DATA and Challenges
What is Big Data , 5'v of BIG DATA and Challenges
We produce a massive amount of data each day, whether we know
about it or not.
Every click on the internet,
every bank transaction,
 every video we watch on YouTube,
every email we send,
every like on our Instagram post makes up data for tech
companies.
With such a massive amount of data being collected, it only makes
sense for companies to use this data to understand their customers
and their behavior better.
This is the reason why the popularity of Data Science has grown
manifold over the last few years. Let’s try to understand what is big
data and its benefits and uses!
What is Big Data?
Big data is exactly what the name suggests, a “big” amount of
data. Big Data means a data set that is large in terms of
volume and is more complex.
Big data refers to extremely large and diverse collections of
structured, unstructured, and semi-structured data that
continues to grow exponentially over time.
These datasets are so huge and complex in volume, velocity,
and variety, that traditional data management systems
cannot store, process, and analyze them.
Big data is used in machine learning, predictive modeling,
and other advanced analytics to solve business problems
and make informed decisions.
The amount and availability of data is growing
rapidly, spurred on by digital technology
advancements, such as connectivity, mobility, the
Internet of Things (IoT), and artificial intelligence (AI).
What is Big Data , 5'v of BIG DATA and Challenges
 Big Data allows companies to address issues they are facing in
their business,
 and solve these problems effectively using Big Data Analytics.
 Companies try to identify patterns and draw insights from this
sea of data so that it can be acted upon to solve the problem(s)
at hand.
What is Big Data , 5'v of BIG DATA and Challenges
What is Big Data , 5'v of BIG DATA and Challenges
What is Big Data , 5'v of BIG DATA and Challenges
How Does Big Data Work?
Big data involves collecting, processing, and analyzing vast amounts of
data from multiple sources to uncover patterns, relationships, and
insights that can inform decision-making.
The process involves several steps:
What is Big Data , 5'v of BIG DATA and Challenges
How to Store and Process Big Data?
The volume and velocity of Big Data can be huge, which makes it
almost impossible to store it in traditional data warehouses.
Although some and sensitive information can be stored on
company premises, for most of the data, companies have to opt
for cloud storage or Hadoop.
Cloud storage allows businesses to store their data on the internet with
the help of a cloud service provider (like Amazon Web Services,
Microsoft Azure, or Google Cloud Platform) who takes the responsibility
of managing and storing the data. The data can be accessed easily and
quickly with an API.
Hadoop also does the same thing, by giving you the ability to store and
process large amounts of data at once. Hadoop is an open-source
software framework and is free. It allows users to process large
datasets across clusters of computers.
What are the main challenges?
For all its benefits, there are still some challenges to overcome
with Big Data.
1. Data Growth
Managing datasets having terabytes of information can be a big
challenge for companies.
As datasets grow in size, storing them not only becomes a challenge but
also becomes an expensive affair for companies.
2. Data Security
Data security is often prioritized quite low in the Big Data workflow,
which can backfire at times. With such a large amount of data being
collected, security challenges are bound to come up sooner or later.
Mining of sensitive information, fake data generation, and lack of
cryptographic protection (encryption) are some of the challenges
businesses face when trying to adopt Big Data techniques.
3. Data Integration
Data is coming in from a lot of different sources (social media
applications, emails, customer verification documents, survey forms,
etc.). It often becomes a very big operational challenge for
companies to combine and reconcile all of this data.
There are several Big Data solution vendors that offer ETL (Extract,
Transform, Load) and data integration solutions to companies that
are trying to overcome data integration problems.

More Related Content

PDF
Big Data at a Glance
PDF
Big Data Fundamentals
PPTX
basic of data science and big data......
PDF
Unit III.pdf
PDF
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
PPTX
Chapter 4 : Introduction to BigData.pptx
PDF
Data foundation for analytics excellence
PDF
An Encyclopedic Overview Of Big Data Analytics
Big Data at a Glance
Big Data Fundamentals
basic of data science and big data......
Unit III.pdf
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Chapter 4 : Introduction to BigData.pptx
Data foundation for analytics excellence
An Encyclopedic Overview Of Big Data Analytics

Similar to What is Big Data , 5'v of BIG DATA and Challenges (20)

PPTX
DOCX
Bidata
PDF
The book of elephant tattoo
PPTX
Big Data
PDF
Bda assignment can also be used for BDA notes and concept understanding.
PPTX
Big data
PPTX
Big data
DOCX
Introduction to big data – convergences.
PPTX
PRESTAdASFDGFHGHKJLKKHGFDSsadsfdgfhfgghjA.pptx
PPTX
Security issues in big data
PPTX
How to tackle big data from a security
PDF
Mastering Big Data: Tools, Techniques, and Applications
PDF
Unit No2 Introduction to big data.pdf
PDF
Converting Big Data To Smart Data | The Step-By-Step Guide!
PDF
Ab cs of big data
PDF
Big data's impact on online marketing
PDF
Snowball Group Whitepaper - Spotlight on Big Data
Bidata
The book of elephant tattoo
Big Data
Bda assignment can also be used for BDA notes and concept understanding.
Big data
Big data
Introduction to big data – convergences.
PRESTAdASFDGFHGHKJLKKHGFDSsadsfdgfhfgghjA.pptx
Security issues in big data
How to tackle big data from a security
Mastering Big Data: Tools, Techniques, and Applications
Unit No2 Introduction to big data.pdf
Converting Big Data To Smart Data | The Step-By-Step Guide!
Ab cs of big data
Big data's impact on online marketing
Snowball Group Whitepaper - Spotlight on Big Data
Ad

More from anjanasharma77573 (20)

PPTX
In- Built Math function in java script..
PPTX
In Built Math functions in java script..
PPTX
What is tidyverse in R languages and different packages
PPTX
What is big data and 5'v of big data....
PPTX
Basic of data and different type of data
PPTX
Basic of data science, and type of data.
PPTX
Role of Infogram, power bi and google charts
PPTX
DATA VISUALIZATION TOOLS e.g Power bi..
PPTX
type of vector data in vectors and geometries
PPTX
Introduction to vectors and geometry - ..
PPTX
type of vector data in vectors and geometry
PPTX
Introduction to vectors and geometry -....
PPTX
basic of SQL constraints in database management system
PPTX
SQL subqueries in database management system
PPTX
practices of C programming function concepts
PPTX
Practice of c PROGRAMMING logics and concepts
PPTX
programming concepts with c ++..........
PPTX
basic of c programming practicals.......
PPTX
Detailed concept of function in c programming
PPTX
Implemintation of looping programs......
In- Built Math function in java script..
In Built Math functions in java script..
What is tidyverse in R languages and different packages
What is big data and 5'v of big data....
Basic of data and different type of data
Basic of data science, and type of data.
Role of Infogram, power bi and google charts
DATA VISUALIZATION TOOLS e.g Power bi..
type of vector data in vectors and geometries
Introduction to vectors and geometry - ..
type of vector data in vectors and geometry
Introduction to vectors and geometry -....
basic of SQL constraints in database management system
SQL subqueries in database management system
practices of C programming function concepts
Practice of c PROGRAMMING logics and concepts
programming concepts with c ++..........
basic of c programming practicals.......
Detailed concept of function in c programming
Implemintation of looping programs......
Ad

Recently uploaded (20)

PDF
Lecture1 pattern recognition............
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Business Analytics and business intelligence.pdf
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Foundation of Data Science unit number two notes
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Database Infoormation System (DBIS).pptx
Lecture1 pattern recognition............
STUDY DESIGN details- Lt Col Maksud (21).pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Supervised vs unsupervised machine learning algorithms
Business Ppt On Nestle.pptx huunnnhhgfvu
Miokarditis (Inflamasi pada Otot Jantung)
Business Analytics and business intelligence.pdf
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
.pdf is not working space design for the following data for the following dat...
Foundation of Data Science unit number two notes
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Quality review (1)_presentation of this 21
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
ISS -ESG Data flows What is ESG and HowHow
Introduction to Knowledge Engineering Part 1
Database Infoormation System (DBIS).pptx

What is Big Data , 5'v of BIG DATA and Challenges

  • 4. We produce a massive amount of data each day, whether we know about it or not. Every click on the internet, every bank transaction,  every video we watch on YouTube, every email we send, every like on our Instagram post makes up data for tech companies. With such a massive amount of data being collected, it only makes sense for companies to use this data to understand their customers and their behavior better. This is the reason why the popularity of Data Science has grown manifold over the last few years. Let’s try to understand what is big data and its benefits and uses!
  • 5. What is Big Data? Big data is exactly what the name suggests, a “big” amount of data. Big Data means a data set that is large in terms of volume and is more complex. Big data refers to extremely large and diverse collections of structured, unstructured, and semi-structured data that continues to grow exponentially over time. These datasets are so huge and complex in volume, velocity, and variety, that traditional data management systems cannot store, process, and analyze them. Big data is used in machine learning, predictive modeling, and other advanced analytics to solve business problems and make informed decisions.
  • 6. The amount and availability of data is growing rapidly, spurred on by digital technology advancements, such as connectivity, mobility, the Internet of Things (IoT), and artificial intelligence (AI).
  • 8.  Big Data allows companies to address issues they are facing in their business,  and solve these problems effectively using Big Data Analytics.  Companies try to identify patterns and draw insights from this sea of data so that it can be acted upon to solve the problem(s) at hand.
  • 12. How Does Big Data Work? Big data involves collecting, processing, and analyzing vast amounts of data from multiple sources to uncover patterns, relationships, and insights that can inform decision-making. The process involves several steps:
  • 14. How to Store and Process Big Data? The volume and velocity of Big Data can be huge, which makes it almost impossible to store it in traditional data warehouses. Although some and sensitive information can be stored on company premises, for most of the data, companies have to opt for cloud storage or Hadoop.
  • 15. Cloud storage allows businesses to store their data on the internet with the help of a cloud service provider (like Amazon Web Services, Microsoft Azure, or Google Cloud Platform) who takes the responsibility of managing and storing the data. The data can be accessed easily and quickly with an API. Hadoop also does the same thing, by giving you the ability to store and process large amounts of data at once. Hadoop is an open-source software framework and is free. It allows users to process large datasets across clusters of computers.
  • 16. What are the main challenges? For all its benefits, there are still some challenges to overcome with Big Data. 1. Data Growth Managing datasets having terabytes of information can be a big challenge for companies. As datasets grow in size, storing them not only becomes a challenge but also becomes an expensive affair for companies.
  • 17. 2. Data Security Data security is often prioritized quite low in the Big Data workflow, which can backfire at times. With such a large amount of data being collected, security challenges are bound to come up sooner or later. Mining of sensitive information, fake data generation, and lack of cryptographic protection (encryption) are some of the challenges businesses face when trying to adopt Big Data techniques.
  • 18. 3. Data Integration Data is coming in from a lot of different sources (social media applications, emails, customer verification documents, survey forms, etc.). It often becomes a very big operational challenge for companies to combine and reconcile all of this data. There are several Big Data solution vendors that offer ETL (Extract, Transform, Load) and data integration solutions to companies that are trying to overcome data integration problems.