SlideShare a Scribd company logo
25-07-2019Big Data Analytics by Vikram Neerugatti1
Big Data Analytics
Vikram Neerugatti
Sri Venkateswara University, Tirupati.
vikramneerugatti@gmail.com
vikram@smartnutsandbolts.com
www.vikramneerugatti.com
www.smartnutsandbolts.com
Vikram Neerugatti
Vikram Nandini
Content
 What is Big Data
 Varieties of Data
 Unstructured Data
 Trends in Data Storage
 Industry Examples of Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti3
What is Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti4
Two men operating a mainframe computer, circa 1960. It’s amazing how
today’s smartphone holds so much more data than this huge 1960’s
relic. (Photo by Pictorial Parade/Archive Photos)
What is Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti5
 Big Data is the next generation of data
warehousing.
 twenty-first century, when the Age of Big Data
Analytics was in its infancy.
 It ’s not an overnight phenomenon.
 Reasons for now
 Computing perfect storm
 Data perfect storm
 Convergence perfect storm
Computing perfect storm
25-07-2019Big Data Analytics by Vikram Neerugatti6
 Big Data analytics are the natural result of four
major global trends:
1. Moore ’s Law (which basically says that
technology always gets cheaper),
2. mobile computing (that smart phone or mobile
tablet in your hand),
3. social networking (Facebook, Foursquare,
Pinterest, etc.),
4. and cloud computing (you don ’t even have to own
hardware or software anymore; you can rent or
lease someone else ’s).
Data Perfect Storm
25-07-2019Big Data Analytics by Vikram Neerugatti7
 Volumes of transactional data have been around
for decades for most big firms, but the flood gates
have now opened
 with more volume , and the velocity and variety—
the three Vs—of data that has arrived in
unprecedented ways.
 This perfect storm of the three Vs makes it
extremely complex and cumbersome
 with the current data management and
 analytics technology and practices.
Convergence perfect storm
25-07-2019Big Data Analytics by Vikram Neerugatti8
 Traditional data management and analytics
software and hardware
 technologies, open-source technology, and
commodity hardware
 are merging to create new alternatives for IT and
business executives
 to address Big Data analytics.
What is Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti9
 People are able to store that much data now and
more than they ever before.
 We have reached this tipping point where they
don ’t have to make decisions about which half to
keep or how much history to keep.
 It ’s now economically feasible to keep all of your
history and all of your variables and go back later
when you have a new question and start looking
for an answer.
 That hadn ’t been practical up until just recently.
What is Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti10
 Certainly the advances in blade technology and
the idea
 that Google brought to market of you take lots
and lots of small Intel
 servers and you gang them together and use
their potential in aggregate.
 That is the super computer of the future.
Evolution of data systems
25-07-2019Big Data Analytics by Vikram Neerugatti11
 Dependent (Early Days).
 Data systems were fairly new and users didn't
know quite know what they wanted. IT assumed
that “Build it and they shall come.”
 Independent (Recent Years).
 Users understood what an analytical platform was
and worked together with IT to define the business
needs and approach for deriving insights for their fi
rm.
 Interdependent (Big Data Era).
 Interactional stage between various companies,
creating more social collaboration beyond your
firm’s walls.
Big data
25-07-2019Big Data Analytics by Vikram Neerugatti12
 Here is how the McKinsey study defi nes Big
Data:
 Big data refers to datasets whose size is beyond
the ability of typical
 database software tools to capture, store,
manage, and analyze.
 big data in many sectors today will range from a
few
 dozen terabytes to multiple petabytes (thousands
of terabytes). 2
Big Data Analytics
25-07-2019Big Data Analytics by Vikram Neerugatti13
 The real challenge is identifying or developing
most cost-effective and reliable methods for
extracting value from all the terabytes and
petabytes of data now available.
 That ’s where Big Data analytics become
necessary.
 Comparing traditional analytics to Big Data
analytics is like comparing a cart to a tractor
 The differences in speed, scale, and complexity
are tremendous
Why now?
25-07-2019Big Data Analytics by Vikram Neerugatti14
Timeline of Recent Technology Developments
Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti15
 The industry has an evolving definition around
Big Data that is currently defined by three
dimensions:
 1. Volume
 2. Variety
 3. Velocity
Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti16
 Volume
 Data variety is the assortment of data.
 Traditionally data, especially operational data, is
“structured” as it is put into a database based on the type
of data (i.e., character, numeric, floating point, etc.).
 Over the past couple of decades, data has increasingly
become “unstructured” as the sources of data have
proliferated beyond operational applications.
 Oftentimes, text, audio, video, image, geospatial, and
Internet data (including click streams and log files) are
considered unstructured data .
 Semi-structured data
 customer name + date of call + complaint
 Velocity
Varieties of Data
25-07-2019Big Data Analytics by Vikram Neerugatti17
 The variety of data sources continues to increase.
 Internet data (i.e., clickstream, social media, social
networking links)
 Primary research (i.e., surveys, experiments,
observations)
 Secondary research (i.e., competitive and
marketplace data, industry reports, consumer data,
business data)
 Location data (i.e., mobile device data, geospatial
data)
 Image data (i.e., video, satellite image, surveillance)
 Supply chain data (i.e., EDI, vendor catalogs and
pricing, quality information)
 Device data (i.e., sensors, PLCs, RF devices, LIMs,
Varieties of Data
25-07-2019Big Data Analytics by Vikram Neerugatti18
 The wide variety of data leads to complexities in
ingesting the data into data storage.
 The variety of data also complicates the
transformation (or the changing of data into a
form
 that can be used in analytics processing) and
analytic computation of the processing of the
data.
Unstructured Data
25-07-2019Big Data Analytics by Vikram Neerugatti19
 structured data (the kind that is easy to define,
store, and analyze)
 Unstructured data (the kind that tends to defy
easy definition, takes up lots of storage capacity,
and is typically more difficult to analyze).
 Unstructured data is basically information that
either does not have a
 predefined data model and/or does not fi t well
into a relational database.
 Unstructured information is typically text heavy,
but may contain data such as dates, numbers,
and facts as well.
Unstructured Data
25-07-2019Big Data Analytics by Vikram Neerugatti20
 The term semi-structured data is used to describe
structured data that doesn't ’t fit into a formal
structure of data models.
 However, semi-structured data does contain tags
that separate semantic elements, which includes
the capability to enforce hierarchies within the
data.
Unstructured Data
25-07-2019Big Data Analytics by Vikram Neerugatti21
 but here are the main takeaways that we would like to
share with you:
 The amount of data (all data, everywhere) is doubling every
two years.
 Our world is becoming more transparent. We, in turn, are
beginning to accept this as we become more comfortable with
parting with data that we used to consider sacred and private.
 Most new data is unstructured. Specifically, unstructured data
represents almost 95 percent of new data, while structured
data represents only 5 percent.
 Unstructured data tends to grow exponentially, unlike
structured data, which tends to grow in a more linear fashion.
 Unstructured data is vastly underutilized. Imagine huge
deposits of oil or other natural resources that are just sitting
there, waiting to be used. That ’s the current state of
unstructured data as of today. Tomorrow will be a different
story because there ’s a lot of money to be made for smart
individuals and companies that can mine unstructured data
Is Big Data analytics worth the effort?
25-07-2019Big Data Analytics by Vikram Neerugatti22
 Yes, without a doubt Big Data analytics is worth
the effort.
 It will be a competitive advantage, and it ’s likely
to play a key role in sorting winners from losers in
our ultracompetitive global economy.
From a business perspective, you
’ll need to learn how to:
25-07-2019Big Data Analytics by Vikram Neerugatti23
 Use Big Data analytics to drive value for your
enterprise that aligns with your core competencies
and creates a competitive advantage for your
enterprise
 Capitalize on new technology capabilities and
leverage your existing technology assets
 Enable the appropriate organizational change to
move towards fact based decisions, adoption of new
technologies, and uniting people from multiple
disciplines into a single multidisciplinary team
 Deliver faster and superior results by embracing and
capitalizing on the ever-increasing rate of change that
is occurring in the global market place
Big Data analytics uses a wide
variety of advanced analytics
25-07-2019Big Data Analytics by Vikram Neerugatti24
Advanced Analytics to provide:
25-07-2019Big Data Analytics by Vikram Neerugatti25
Big Data Business Models
25-07-2019Big Data Analytics by Vikram Neerugatti26
Enabling Big Data Analytic
Applications
25-07-2019Big Data Analytics by Vikram Neerugatti27
The key to success for organizations seeking to
take advantage of this opportunity is:
25-07-2019Big Data Analytics by Vikram Neerugatti28
 Leverage all your current data and enrich it with
new data sources
 Enforce data quality policies and leverage today’
s best technology and people to support the
policies
 Relentlessly seek opportunities to imbue your
enterprise with fact based decision making
 Embed your analytic insights throughout your
organization
Trends in Data Storage
25-07-2019Big Data Analytics by Vikram Neerugatti29
 Following are differing types of storage systems:
 Distributed File Systems
 NoSQL Databases
 NewSQL Databases
 Big Data Querying Platforms
Trends in Data Storage
25-07-2019Big Data Analytics by Vikram Neerugatti30
Trends in Data Storage
25-07-2019Big Data Analytics by Vikram Neerugatti31
 Big Data Querying Platforms:
 Technologies that provide query facades infront of
big data stores such as distributed file systems or
NoSQL databases.
 The main concern is providing a high-level
interface, e.g. via SQL3 like query languages and
achieving low query latencies.
Industry Examples of Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti32
 collected from thought leaders in subjects and
industries such as
 Digital Marketing
 Financial Services
 Advertising
 and Healthcare.
Digital Marketing and the Non-line
World
25-07-2019Big Data Analytics by Vikram Neerugatti33
 Don ’t Abdicate Relationships
 Is IT Losing Control of Web Analytics?
Database Marketers, Pioneers of
Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti34
 Database marketing is really concerned with
building databases containing information about
individuals, using that information to better
understand those individuals, and communicating
effectively with some of those individuals to drive
business value.
Big Data and the New School of
Marketing
25-07-2019Big Data Analytics by Vikram Neerugatti35
 New School marketers deliver what today ’s
consumers want: relevant interactive
communication across the digital power channels:
email, mobile, social, display and the web.
 Consumers Have Changed. So Must
Marketers
 The Right Approach: Cross-Channel Lifecycle
Marketing
 It really starts with the capture of customer
permission, contact information, and preferences for
multiple channels.
Lifecycle Marketing approach: conversion, repurchase,
stickiness, win-back, and re-permission
25-07-2019Big Data Analytics by Vikram Neerugatti36
Marketing
25-07-2019Big Data Analytics by Vikram Neerugatti37
 Social and Affiliate Marketing
 Empowering Marketing with Social
Intelligence
Fraud and Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti38
 One of the most common forms of fraudulent
activity is credit card fraud.
 The credit card fraud rate in United States and
other countries is increasing.
 In order to prevent the fraud, credit card
transactions are monitored and checked in near
real time.
 If the checks identify pattern inconsistencies and
suspicious activity, the transaction is identified for
review and scalation.
Fraud and Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti39
 Big Data technologies provide an optimal
technology solution based on the following three
Vs:
 High volume. Years of customer records and
transactions (150 billion1 records per year)
 High velocity. Dynamic transactions and social
media information
 High variety. Social media plus other unstructured
data such as customer emails, call centre
conversations, as well as transactional structured
data
25-07-2019Big Data Analytics by Vikram Neerugatti40
Risk and Big Data
25-07-2019Big Data Analytics by Vikram Neerugatti41
 The two most common types of risk management
are credit risk management and market risk
management.
 A third type of risk, operational risk management,
isn’t as common as credit and market risk.
 Credit risk analytics focus on past credit
behaviors to predict the likelihood that a borrower
will.
 Market risk analytics focus on understanding the
likelihood that the value of a portfolio will
decrease due to the change in stock prices,
interest rates, foreign exchange rates, and
commodity prices.
Credit Risk Management
25-07-2019Big Data Analytics by Vikram Neerugatti42
Big Data and Advances in Health
Care
25-07-2019Big Data Analytics by Vikram Neerugatti43
Any questions
25-07-2019Big Data Analytics by Vikram Neerugatti44
Big Data Analytics
Vikram Neerugatti
Sri Venkateswara University, Tirupati.
vikramneerugatti@gmail.com
vikram@smartnutsandbolts.com
www.vikramneerugatti.com
www.smartnutsandbolts.com
Vikram Neerugatti
Vikram Nandini
25-07-2019Big Data Analytics by Vikram Neerugatti46

More Related Content

PPTX
Big Data Analytics
PPTX
Big Data Analytics
PPTX
1. Data Analytics-introduction
PDF
Java exception handling ppt
PPTX
Presentation [Full] Effective Communication Skills
PPTX
Pandas Data Cleaning and Preprocessing PPT.pptx
PPTX
introdution to SQL and SQL functions
PPSX
Applications of Big Data Analytics in Businesses
Big Data Analytics
Big Data Analytics
1. Data Analytics-introduction
Java exception handling ppt
Presentation [Full] Effective Communication Skills
Pandas Data Cleaning and Preprocessing PPT.pptx
introdution to SQL and SQL functions
Applications of Big Data Analytics in Businesses

What's hot (20)

PPTX
Introduction to data science
PPTX
Big Data - The 5 Vs Everyone Must Know
PDF
Data science presentation
PPTX
Big data by Mithlesh sadh
PPTX
Chapter 1 big data
PDF
Introduction to Data Science
PPTX
What is big data?
PPTX
Presentation on Big Data
PPTX
Data quality and data profiling
PPTX
Introduction to Big Data
PDF
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
PPTX
Big Data Analytics
PPTX
Data science
PPT
Big data analysis using map/reduce
PPTX
Big Data ppt
PDF
Big Data
PPTX
Big data
PPTX
Big data-ppt
PPTX
Big data
PPTX
Introduction to data science
Introduction to data science
Big Data - The 5 Vs Everyone Must Know
Data science presentation
Big data by Mithlesh sadh
Chapter 1 big data
Introduction to Data Science
What is big data?
Presentation on Big Data
Data quality and data profiling
Introduction to Big Data
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Analytics
Data science
Big data analysis using map/reduce
Big Data ppt
Big Data
Big data
Big data-ppt
Big data
Introduction to data science
Ad

Similar to Big data analytics (20)

PDF
Analysis of Big Data
PDF
IRJET- Scope of Big Data Analytics in Industrial Domain
PPTX
Big data seminar at Broadridge
PPTX
Big Data seminar BR-new
PPTX
Big Data Seminar At Broadridge
PDF
Big Data idea implementation in organizations: potential, roadblocks
PDF
What is AI without Data?
PPTX
Introduction to Big Data
PDF
CAN DATA SCIENCE COMMAND THE FUTURE OF BUSINESSES IN 2025.pdf
PDF
Data - A Big Disrupter!
PDF
Bigdata final(이지은)
PDF
Know The What, Why, and How of Big Data_.pdf
PDF
Presentation by Luc Delanglez (DataLumen) at the Data Vault Modelling and Dat...
PPTX
How Big Data Shaping The Supply Chain
PDF
Top Trends & Predictions That Will Drive Data Science in 2022.pdf
PDF
Gartner eBook on Big Data
PDF
The boom in Xaas and the knowledge graph
PPTX
2019 Enterprise IT Predictions
PDF
Data foundation for analytics excellence
PDF
Big Data: Where is the Real Opportunity?
Analysis of Big Data
IRJET- Scope of Big Data Analytics in Industrial Domain
Big data seminar at Broadridge
Big Data seminar BR-new
Big Data Seminar At Broadridge
Big Data idea implementation in organizations: potential, roadblocks
What is AI without Data?
Introduction to Big Data
CAN DATA SCIENCE COMMAND THE FUTURE OF BUSINESSES IN 2025.pdf
Data - A Big Disrupter!
Bigdata final(이지은)
Know The What, Why, and How of Big Data_.pdf
Presentation by Luc Delanglez (DataLumen) at the Data Vault Modelling and Dat...
How Big Data Shaping The Supply Chain
Top Trends & Predictions That Will Drive Data Science in 2022.pdf
Gartner eBook on Big Data
The boom in Xaas and the knowledge graph
2019 Enterprise IT Predictions
Data foundation for analytics excellence
Big Data: Where is the Real Opportunity?
Ad

More from Vikram Nandini (20)

PDF
IoT: From Copper strip to Gold Bar
PDF
Design Patterns
PDF
Linux File Trees and Commands
PDF
Introduction to Linux & Basic Commands
PDF
INTRODUCTION to OOAD
PDF
PDF
Manufacturing - II Part
PDF
Manufacturing
PDF
Business Models
PDF
Prototyping Online Components
PDF
Artificial Neural Networks
PDF
IoT-Prototyping
PDF
Design Principles for Connected Devices
PDF
Introduction to IoT
PDF
Embedded decices
PDF
Communication in the IoT
PDF
Introduction to Cyber Security
PDF
cloud computing UNIT-2.pdf
PDF
Introduction to Web Technologies
PDF
Cascading Style Sheets
IoT: From Copper strip to Gold Bar
Design Patterns
Linux File Trees and Commands
Introduction to Linux & Basic Commands
INTRODUCTION to OOAD
Manufacturing - II Part
Manufacturing
Business Models
Prototyping Online Components
Artificial Neural Networks
IoT-Prototyping
Design Principles for Connected Devices
Introduction to IoT
Embedded decices
Communication in the IoT
Introduction to Cyber Security
cloud computing UNIT-2.pdf
Introduction to Web Technologies
Cascading Style Sheets

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Classroom Observation Tools for Teachers
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Institutional Correction lecture only . . .
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Business Ethics Teaching Materials for college
PDF
RMMM.pdf make it easy to upload and study
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Cell Types and Its function , kingdom of life
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Abdominal Access Techniques with Prof. Dr. R K Mishra
Classroom Observation Tools for Teachers
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PPH.pptx obstetrics and gynecology in nursing
Renaissance Architecture: A Journey from Faith to Humanism
102 student loan defaulters named and shamed – Is someone you know on the list?
FourierSeries-QuestionsWithAnswers(Part-A).pdf
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Microbial diseases, their pathogenesis and prophylaxis
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Institutional Correction lecture only . . .
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Business Ethics Teaching Materials for college
RMMM.pdf make it easy to upload and study
Microbial disease of the cardiovascular and lymphatic systems
TR - Agricultural Crops Production NC III.pdf
Cell Types and Its function , kingdom of life

Big data analytics

  • 1. 25-07-2019Big Data Analytics by Vikram Neerugatti1
  • 2. Big Data Analytics Vikram Neerugatti Sri Venkateswara University, Tirupati. vikramneerugatti@gmail.com vikram@smartnutsandbolts.com www.vikramneerugatti.com www.smartnutsandbolts.com Vikram Neerugatti Vikram Nandini
  • 3. Content  What is Big Data  Varieties of Data  Unstructured Data  Trends in Data Storage  Industry Examples of Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti3
  • 4. What is Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti4 Two men operating a mainframe computer, circa 1960. It’s amazing how today’s smartphone holds so much more data than this huge 1960’s relic. (Photo by Pictorial Parade/Archive Photos)
  • 5. What is Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti5  Big Data is the next generation of data warehousing.  twenty-first century, when the Age of Big Data Analytics was in its infancy.  It ’s not an overnight phenomenon.  Reasons for now  Computing perfect storm  Data perfect storm  Convergence perfect storm
  • 6. Computing perfect storm 25-07-2019Big Data Analytics by Vikram Neerugatti6  Big Data analytics are the natural result of four major global trends: 1. Moore ’s Law (which basically says that technology always gets cheaper), 2. mobile computing (that smart phone or mobile tablet in your hand), 3. social networking (Facebook, Foursquare, Pinterest, etc.), 4. and cloud computing (you don ’t even have to own hardware or software anymore; you can rent or lease someone else ’s).
  • 7. Data Perfect Storm 25-07-2019Big Data Analytics by Vikram Neerugatti7  Volumes of transactional data have been around for decades for most big firms, but the flood gates have now opened  with more volume , and the velocity and variety— the three Vs—of data that has arrived in unprecedented ways.  This perfect storm of the three Vs makes it extremely complex and cumbersome  with the current data management and  analytics technology and practices.
  • 8. Convergence perfect storm 25-07-2019Big Data Analytics by Vikram Neerugatti8  Traditional data management and analytics software and hardware  technologies, open-source technology, and commodity hardware  are merging to create new alternatives for IT and business executives  to address Big Data analytics.
  • 9. What is Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti9  People are able to store that much data now and more than they ever before.  We have reached this tipping point where they don ’t have to make decisions about which half to keep or how much history to keep.  It ’s now economically feasible to keep all of your history and all of your variables and go back later when you have a new question and start looking for an answer.  That hadn ’t been practical up until just recently.
  • 10. What is Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti10  Certainly the advances in blade technology and the idea  that Google brought to market of you take lots and lots of small Intel  servers and you gang them together and use their potential in aggregate.  That is the super computer of the future.
  • 11. Evolution of data systems 25-07-2019Big Data Analytics by Vikram Neerugatti11  Dependent (Early Days).  Data systems were fairly new and users didn't know quite know what they wanted. IT assumed that “Build it and they shall come.”  Independent (Recent Years).  Users understood what an analytical platform was and worked together with IT to define the business needs and approach for deriving insights for their fi rm.  Interdependent (Big Data Era).  Interactional stage between various companies, creating more social collaboration beyond your firm’s walls.
  • 12. Big data 25-07-2019Big Data Analytics by Vikram Neerugatti12  Here is how the McKinsey study defi nes Big Data:  Big data refers to datasets whose size is beyond the ability of typical  database software tools to capture, store, manage, and analyze.  big data in many sectors today will range from a few  dozen terabytes to multiple petabytes (thousands of terabytes). 2
  • 13. Big Data Analytics 25-07-2019Big Data Analytics by Vikram Neerugatti13  The real challenge is identifying or developing most cost-effective and reliable methods for extracting value from all the terabytes and petabytes of data now available.  That ’s where Big Data analytics become necessary.  Comparing traditional analytics to Big Data analytics is like comparing a cart to a tractor  The differences in speed, scale, and complexity are tremendous
  • 14. Why now? 25-07-2019Big Data Analytics by Vikram Neerugatti14 Timeline of Recent Technology Developments
  • 15. Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti15  The industry has an evolving definition around Big Data that is currently defined by three dimensions:  1. Volume  2. Variety  3. Velocity
  • 16. Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti16  Volume  Data variety is the assortment of data.  Traditionally data, especially operational data, is “structured” as it is put into a database based on the type of data (i.e., character, numeric, floating point, etc.).  Over the past couple of decades, data has increasingly become “unstructured” as the sources of data have proliferated beyond operational applications.  Oftentimes, text, audio, video, image, geospatial, and Internet data (including click streams and log files) are considered unstructured data .  Semi-structured data  customer name + date of call + complaint  Velocity
  • 17. Varieties of Data 25-07-2019Big Data Analytics by Vikram Neerugatti17  The variety of data sources continues to increase.  Internet data (i.e., clickstream, social media, social networking links)  Primary research (i.e., surveys, experiments, observations)  Secondary research (i.e., competitive and marketplace data, industry reports, consumer data, business data)  Location data (i.e., mobile device data, geospatial data)  Image data (i.e., video, satellite image, surveillance)  Supply chain data (i.e., EDI, vendor catalogs and pricing, quality information)  Device data (i.e., sensors, PLCs, RF devices, LIMs,
  • 18. Varieties of Data 25-07-2019Big Data Analytics by Vikram Neerugatti18  The wide variety of data leads to complexities in ingesting the data into data storage.  The variety of data also complicates the transformation (or the changing of data into a form  that can be used in analytics processing) and analytic computation of the processing of the data.
  • 19. Unstructured Data 25-07-2019Big Data Analytics by Vikram Neerugatti19  structured data (the kind that is easy to define, store, and analyze)  Unstructured data (the kind that tends to defy easy definition, takes up lots of storage capacity, and is typically more difficult to analyze).  Unstructured data is basically information that either does not have a  predefined data model and/or does not fi t well into a relational database.  Unstructured information is typically text heavy, but may contain data such as dates, numbers, and facts as well.
  • 20. Unstructured Data 25-07-2019Big Data Analytics by Vikram Neerugatti20  The term semi-structured data is used to describe structured data that doesn't ’t fit into a formal structure of data models.  However, semi-structured data does contain tags that separate semantic elements, which includes the capability to enforce hierarchies within the data.
  • 21. Unstructured Data 25-07-2019Big Data Analytics by Vikram Neerugatti21  but here are the main takeaways that we would like to share with you:  The amount of data (all data, everywhere) is doubling every two years.  Our world is becoming more transparent. We, in turn, are beginning to accept this as we become more comfortable with parting with data that we used to consider sacred and private.  Most new data is unstructured. Specifically, unstructured data represents almost 95 percent of new data, while structured data represents only 5 percent.  Unstructured data tends to grow exponentially, unlike structured data, which tends to grow in a more linear fashion.  Unstructured data is vastly underutilized. Imagine huge deposits of oil or other natural resources that are just sitting there, waiting to be used. That ’s the current state of unstructured data as of today. Tomorrow will be a different story because there ’s a lot of money to be made for smart individuals and companies that can mine unstructured data
  • 22. Is Big Data analytics worth the effort? 25-07-2019Big Data Analytics by Vikram Neerugatti22  Yes, without a doubt Big Data analytics is worth the effort.  It will be a competitive advantage, and it ’s likely to play a key role in sorting winners from losers in our ultracompetitive global economy.
  • 23. From a business perspective, you ’ll need to learn how to: 25-07-2019Big Data Analytics by Vikram Neerugatti23  Use Big Data analytics to drive value for your enterprise that aligns with your core competencies and creates a competitive advantage for your enterprise  Capitalize on new technology capabilities and leverage your existing technology assets  Enable the appropriate organizational change to move towards fact based decisions, adoption of new technologies, and uniting people from multiple disciplines into a single multidisciplinary team  Deliver faster and superior results by embracing and capitalizing on the ever-increasing rate of change that is occurring in the global market place
  • 24. Big Data analytics uses a wide variety of advanced analytics 25-07-2019Big Data Analytics by Vikram Neerugatti24
  • 25. Advanced Analytics to provide: 25-07-2019Big Data Analytics by Vikram Neerugatti25
  • 26. Big Data Business Models 25-07-2019Big Data Analytics by Vikram Neerugatti26
  • 27. Enabling Big Data Analytic Applications 25-07-2019Big Data Analytics by Vikram Neerugatti27
  • 28. The key to success for organizations seeking to take advantage of this opportunity is: 25-07-2019Big Data Analytics by Vikram Neerugatti28  Leverage all your current data and enrich it with new data sources  Enforce data quality policies and leverage today’ s best technology and people to support the policies  Relentlessly seek opportunities to imbue your enterprise with fact based decision making  Embed your analytic insights throughout your organization
  • 29. Trends in Data Storage 25-07-2019Big Data Analytics by Vikram Neerugatti29  Following are differing types of storage systems:  Distributed File Systems  NoSQL Databases  NewSQL Databases  Big Data Querying Platforms
  • 30. Trends in Data Storage 25-07-2019Big Data Analytics by Vikram Neerugatti30
  • 31. Trends in Data Storage 25-07-2019Big Data Analytics by Vikram Neerugatti31  Big Data Querying Platforms:  Technologies that provide query facades infront of big data stores such as distributed file systems or NoSQL databases.  The main concern is providing a high-level interface, e.g. via SQL3 like query languages and achieving low query latencies.
  • 32. Industry Examples of Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti32  collected from thought leaders in subjects and industries such as  Digital Marketing  Financial Services  Advertising  and Healthcare.
  • 33. Digital Marketing and the Non-line World 25-07-2019Big Data Analytics by Vikram Neerugatti33  Don ’t Abdicate Relationships  Is IT Losing Control of Web Analytics?
  • 34. Database Marketers, Pioneers of Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti34  Database marketing is really concerned with building databases containing information about individuals, using that information to better understand those individuals, and communicating effectively with some of those individuals to drive business value.
  • 35. Big Data and the New School of Marketing 25-07-2019Big Data Analytics by Vikram Neerugatti35  New School marketers deliver what today ’s consumers want: relevant interactive communication across the digital power channels: email, mobile, social, display and the web.  Consumers Have Changed. So Must Marketers  The Right Approach: Cross-Channel Lifecycle Marketing  It really starts with the capture of customer permission, contact information, and preferences for multiple channels.
  • 36. Lifecycle Marketing approach: conversion, repurchase, stickiness, win-back, and re-permission 25-07-2019Big Data Analytics by Vikram Neerugatti36
  • 37. Marketing 25-07-2019Big Data Analytics by Vikram Neerugatti37  Social and Affiliate Marketing  Empowering Marketing with Social Intelligence
  • 38. Fraud and Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti38  One of the most common forms of fraudulent activity is credit card fraud.  The credit card fraud rate in United States and other countries is increasing.  In order to prevent the fraud, credit card transactions are monitored and checked in near real time.  If the checks identify pattern inconsistencies and suspicious activity, the transaction is identified for review and scalation.
  • 39. Fraud and Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti39  Big Data technologies provide an optimal technology solution based on the following three Vs:  High volume. Years of customer records and transactions (150 billion1 records per year)  High velocity. Dynamic transactions and social media information  High variety. Social media plus other unstructured data such as customer emails, call centre conversations, as well as transactional structured data
  • 40. 25-07-2019Big Data Analytics by Vikram Neerugatti40
  • 41. Risk and Big Data 25-07-2019Big Data Analytics by Vikram Neerugatti41  The two most common types of risk management are credit risk management and market risk management.  A third type of risk, operational risk management, isn’t as common as credit and market risk.  Credit risk analytics focus on past credit behaviors to predict the likelihood that a borrower will.  Market risk analytics focus on understanding the likelihood that the value of a portfolio will decrease due to the change in stock prices, interest rates, foreign exchange rates, and commodity prices.
  • 42. Credit Risk Management 25-07-2019Big Data Analytics by Vikram Neerugatti42
  • 43. Big Data and Advances in Health Care 25-07-2019Big Data Analytics by Vikram Neerugatti43
  • 44. Any questions 25-07-2019Big Data Analytics by Vikram Neerugatti44
  • 45. Big Data Analytics Vikram Neerugatti Sri Venkateswara University, Tirupati. vikramneerugatti@gmail.com vikram@smartnutsandbolts.com www.vikramneerugatti.com www.smartnutsandbolts.com Vikram Neerugatti Vikram Nandini
  • 46. 25-07-2019Big Data Analytics by Vikram Neerugatti46