SlideShare a Scribd company logo
Applications of Hadoop
Applications of Hadoop Framework
Advertising , Media and Entertainment
Click Stream Data Analysis
Analysis of Server Log Data
Analysis of Geolocation Data(Predictive Analysis)
Fraud Detection and Prevention
Advertising , Media and Entertainment industries are using
Hadoop in many ways.
Benefits of using Hadoop in this field are:
 Improved ad targeting, analysis, forecasting and
optimization.
 Personalized recommendations.
 Enhanced game player engagement.
Advertising, Media and Entertainment
Big Data Click Stream Analysis
Click stream analysis is the process of collecting, analyzing and reporting aggregate data
about which pages visitors visit. So we can say clickstream analysis is the tracking and
analysis of visits to websites.
A customer visits a shopping website to buy products.
 He clicks to read the specification of that product or item.
 After going through the items that are of his interest, he adds those
items to the shopping cart.
 He then proceeds to checkout.
 The buyer looks at the item’s shipping cost, decides not to buy that product and close the
browser window.
Every click he has made and then stopped making has the potential to offer valuable insight to
the company behind this website.
Web-based browsing and Buying Experience
Analysis of Server log data is done by Hadoop. Log processing is used to extract a variety
of information. The most common uses of log processing is to extract errors or count the
occurrence of some events within a system that might be login failure.
Analysis of Server Log Data
The enterprises mines server log data to gain a good
understanding of:
Customer Habits
Social Media Use
Web Advertisement Effectiveness
Other Metrics that Inform Business Decisions
Analysis of Server Log Data
Predictive analysis(Geolocation Data) is the area of data mining
concerned with forecasting probabilities and trends.
Geolocation data is the information associated with an electronic device that
can be used to identify its physical location.
Hadoop helps, reduce data storage costs while providing value driven
intelligence from asset tracking to predicting behavior.
Analysis of Geolocation Data
Why go for Hadoop for creating Fraud Detection
Model:
Hadoop process large datasets and data sampling does not
work for rare events.
Hadoop can solve much harder problems by leveraging multiple
cores across thousands of machines and search through much
larger problem domains.
Hadoop maintains an agile environment, it allows different
kinds of analysis and changes to the existing models.
Fraud Detection and Prevention
Fraud is a deliberate misrepresentation which causes another person to suffer damages.

More Related Content

PDF
Lenddo-FS-Insights-Basic-2016
PPTX
Daniel dropiksymposium
PDF
Online Fraud Detection Using Big Data Analytics Webinar
PDF
Big Data in Banking (White paper)
PDF
Anti-Money Laundering Solution
PDF
Big data analytics use cases: all you need to know
PPTX
Centrifuge Systems Overview 2 14
PPTX
Analystics in banking and financial services
Lenddo-FS-Insights-Basic-2016
Daniel dropiksymposium
Online Fraud Detection Using Big Data Analytics Webinar
Big Data in Banking (White paper)
Anti-Money Laundering Solution
Big data analytics use cases: all you need to know
Centrifuge Systems Overview 2 14
Analystics in banking and financial services

What's hot (18)

PPTX
Machine learning with sabyasachi upadhya
PDF
IBM Smarter Analytics Solution for insurance
PDF
Masters thesis - Fraud & Big Data
PPTX
Big data analytics and large-scale computers
PDF
Correspondent Banking Networks
PPT
PDF
PDF
Lenddo-Scoring-Factsheet-2016
PDF
Lenddo-Verification-Factsheet-2016
PDF
10 Best Big Data Management Tools
PPTX
Predictive Analytics: Business Perspective & Use Cases
PDF
Top Uses of Twitter Data
PDF
Data analytics
PPTX
Predictive Marketing Analytics
PPTX
Data mining
PPTX
Big Data in Customer Relationship Management (CRM)
PDF
Credit Card Fraud Detection Using ML In Databricks
PPTX
Recorded Future: Analyzing internet ideas about what comes next
Machine learning with sabyasachi upadhya
IBM Smarter Analytics Solution for insurance
Masters thesis - Fraud & Big Data
Big data analytics and large-scale computers
Correspondent Banking Networks
Lenddo-Scoring-Factsheet-2016
Lenddo-Verification-Factsheet-2016
10 Best Big Data Management Tools
Predictive Analytics: Business Perspective & Use Cases
Top Uses of Twitter Data
Data analytics
Predictive Marketing Analytics
Data mining
Big Data in Customer Relationship Management (CRM)
Credit Card Fraud Detection Using ML In Databricks
Recorded Future: Analyzing internet ideas about what comes next
Ad

Similar to Applications of Big Data & Hadoop (20)

PDF
Big Data Use Cases – Hadoop, Spark and Flink Case Studies.pdf
PDF
Analyzing Multi-Structured Data
PPTX
Tools and Methods for Big Data Analytics by Dahl Winters
PPTX
Tools and Methods for Big Data Analytics by Dahl Winters
PPTX
Apache Spark Streaming -Real time web server log analytics
PDF
20130117 - Big Data Architectures
PDF
Extending the Data Warehouse with Hadoop - Hadoop world 2011
PPTX
Big Data Analytics with Hadoop
PDF
Hadoop Data Reservoir Webinar
PPT
data analytics lecture3 nice pdf to learn
PPTX
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
PDF
Big Data Tutorial - Marko Grobelnik - 25 May 2012
PPT
Big Data = Big Decisions
DOCX
hadoop seminar training report
PDF
EDF2013: Big Data Tutorial: Marko Grobelnik
PPTX
How Startups can leverage big data?
PDF
Analytics&IoT
KEY
Big data 4 webmonday
PPTX
Click stream analysis and hadoop framwork
PPT
data analytics lecture3.ppt
Big Data Use Cases – Hadoop, Spark and Flink Case Studies.pdf
Analyzing Multi-Structured Data
Tools and Methods for Big Data Analytics by Dahl Winters
Tools and Methods for Big Data Analytics by Dahl Winters
Apache Spark Streaming -Real time web server log analytics
20130117 - Big Data Architectures
Extending the Data Warehouse with Hadoop - Hadoop world 2011
Big Data Analytics with Hadoop
Hadoop Data Reservoir Webinar
data analytics lecture3 nice pdf to learn
Bigdataissueschallengestoolsngoodpractices 141130054740-conversion-gate01
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data = Big Decisions
hadoop seminar training report
EDF2013: Big Data Tutorial: Marko Grobelnik
How Startups can leverage big data?
Analytics&IoT
Big data 4 webmonday
Click stream analysis and hadoop framwork
data analytics lecture3.ppt
Ad

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Institutional Correction lecture only . . .
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Pharma ospi slides which help in ospi learning
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Insiders guide to clinical Medicine.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Cell Structure & Organelles in detailed.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
RMMM.pdf make it easy to upload and study
PDF
Sports Quiz easy sports quiz sports quiz
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
O7-L3 Supply Chain Operations - ICLT Program
Module 4: Burden of Disease Tutorial Slides S2 2025
GDM (1) (1).pptx small presentation for students
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Institutional Correction lecture only . . .
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Pharma ospi slides which help in ospi learning
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
TR - Agricultural Crops Production NC III.pdf
Insiders guide to clinical Medicine.pdf
PPH.pptx obstetrics and gynecology in nursing
human mycosis Human fungal infections are called human mycosis..pptx
Cell Structure & Organelles in detailed.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
RMMM.pdf make it easy to upload and study
Sports Quiz easy sports quiz sports quiz
Microbial disease of the cardiovascular and lymphatic systems
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf

Applications of Big Data & Hadoop

  • 2. Applications of Hadoop Framework Advertising , Media and Entertainment Click Stream Data Analysis Analysis of Server Log Data Analysis of Geolocation Data(Predictive Analysis) Fraud Detection and Prevention
  • 3. Advertising , Media and Entertainment industries are using Hadoop in many ways. Benefits of using Hadoop in this field are:  Improved ad targeting, analysis, forecasting and optimization.  Personalized recommendations.  Enhanced game player engagement. Advertising, Media and Entertainment
  • 4. Big Data Click Stream Analysis Click stream analysis is the process of collecting, analyzing and reporting aggregate data about which pages visitors visit. So we can say clickstream analysis is the tracking and analysis of visits to websites.
  • 5. A customer visits a shopping website to buy products.  He clicks to read the specification of that product or item.  After going through the items that are of his interest, he adds those items to the shopping cart.  He then proceeds to checkout.  The buyer looks at the item’s shipping cost, decides not to buy that product and close the browser window. Every click he has made and then stopped making has the potential to offer valuable insight to the company behind this website. Web-based browsing and Buying Experience
  • 6. Analysis of Server log data is done by Hadoop. Log processing is used to extract a variety of information. The most common uses of log processing is to extract errors or count the occurrence of some events within a system that might be login failure. Analysis of Server Log Data
  • 7. The enterprises mines server log data to gain a good understanding of: Customer Habits Social Media Use Web Advertisement Effectiveness Other Metrics that Inform Business Decisions Analysis of Server Log Data
  • 8. Predictive analysis(Geolocation Data) is the area of data mining concerned with forecasting probabilities and trends. Geolocation data is the information associated with an electronic device that can be used to identify its physical location. Hadoop helps, reduce data storage costs while providing value driven intelligence from asset tracking to predicting behavior. Analysis of Geolocation Data
  • 9. Why go for Hadoop for creating Fraud Detection Model: Hadoop process large datasets and data sampling does not work for rare events. Hadoop can solve much harder problems by leveraging multiple cores across thousands of machines and search through much larger problem domains. Hadoop maintains an agile environment, it allows different kinds of analysis and changes to the existing models. Fraud Detection and Prevention Fraud is a deliberate misrepresentation which causes another person to suffer damages.