SlideShare a Scribd company logo
Data Mining & Business Analysis
1
Presented By
Kazi Md Zuber
1.Click Stream Analysis
2. Hadoop Framework
 Click Stream Analysis
 What is Click Stream Analysis ?
“The electronic path a user takes while navigating
from site to site, and from page to page within a
..site”.
 In simple word we can say ,
“A user's activities on the World Wide Web as
represented by the sequence of links they click on”.
2
 Why need Click stream
analysis ?
 Area of Interests
 Personal Information
 Which Devices Use More (PC/Mobaile)
 Country wise click stream data
 Visitor’s data such as Male/Female or Adult/Child
 History (User’s Navigation History)
 Etc..
3
To Know User’s…
Data sources of Click Stream
analysis
 E-Commerce website
 Share Market
 YouTube
 Clouds
 Servers
 Telecom
 Etc.
4
 Who Use Click Stream Analyzed
Data
5
Amazon analyze
user’s behavior
Such as,
- Clicked items,
- User location,
- More selling
..Products
- Visited pages
- Review
On particular
website or pages
More user can
target for show
specific types of
Advertise.
-visitors online
-Comments
-Time
-Etc..
-Searched
.Videos
-Daily Viewer
-Etc.
Business companies are use this data
To Convert customers, Sell Product,
Increase business growth and for So many
Reasons they use Click Stream Analyzed data.
So it’s call Data Mining & Business Intelligence.
6
0
50000
100000
150000
200000
250000
300000
350000
400000
450000
500000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
US
UK
Singapore
NZ
NULL
India
Europe
Australia
Asia Pacific
 Country wise viewers
Accessed Pages By Users
7
 From these analyzed data Google
AdSense set Advertised on most of
visited page.
Number of clicks by hour of
day
8
0
50000
100000
150000
200000
250000
300000
350000
400000
450000
500000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Number of clicks by hour of day
Number of clicks
 At which time target
more visitors for
advertise or some other
event.
 This idea can get from
these analyzed.
 A Full Month of Browsing
9
 Example 10
Example 11
Click streaming data sources
 E-Commerce website Data
 Share Market Data
 YouTube Data
 Clouds Data
 Servers Data
 Telecom Data
 Satellite Data
 Business Data
12
Big Data
But Exactly What is
Big data ?
?
Just one second 13
Big Data & HADOOP
 Extremely large data sets that may be analysed computationally to reveal
patterns, trends, and associations, especially relating to human behaviour
and interactions.
 Big Data Handled By HADOOP
 Hadoop is java based platform
 FDS, HDFS, Name node (xml data), Job tracker(meta data)
 Hadoop Cluster
14
 Applications of Hadoop
Framework
 Advertising , Media and Entertainment
 Click Stream Data Analysis
 Analysis of Server Log Data
 Analysis of Geolocation Data
 Fraud Detection and Prevention
15
 Which Companies are using
HADOOP
16
References
 NASA NEX (www.aws.amazon.com)
(https://aws.amazon.com/public-datasets/nasa-nex/)
 Data Mill North (www.datamillnorth.org)
 Applications of Big Data & Hadoop (www.slidshare.com)
 Google AdSense (www.google.ads.com)
 36 Mind Blowing YouTube Facts, Figures and Statistics – 2017 | Digital
Marketing Education (fortunelords.com)
 www.nasa.gov.com
17
Question ???
18
What is Hadoop Cluster?
19

More Related Content

PDF
Sources of data collection for business applications
PPTX
Big Data
PPTX
Web log & clickstream
PDF
Clickstream Data Warehouse - Turning clicks into customers
PPTX
Clickstream ppt copy
PDF
Clickstream Analysis
PDF
Point of View on Cambridge Analytica Scandal
PDF
Entity Aware Click Graph
Sources of data collection for business applications
Big Data
Web log & clickstream
Clickstream Data Warehouse - Turning clicks into customers
Clickstream ppt copy
Clickstream Analysis
Point of View on Cambridge Analytica Scandal
Entity Aware Click Graph

What's hot (10)

PPT
The Search for Opportunities in the Internet Using Internet Marketing Mechanisms
PDF
FuhSen
PDF
whitehurst
PDF
What You Need to Know About Big Data
PPT
Fishbowl Model - Team enClair
PDF
Utilising Data In Your Digital & PR Strategy - 3XE Digital
PDF
Dow Jones: Reimagining the News as a Knowledge Graph
PPS
PDF
Linkurious SDK: Build enterprise-ready graph applications faster
PPTX
Ten New Media Opportunities for Traditional Media
The Search for Opportunities in the Internet Using Internet Marketing Mechanisms
FuhSen
whitehurst
What You Need to Know About Big Data
Fishbowl Model - Team enClair
Utilising Data In Your Digital & PR Strategy - 3XE Digital
Dow Jones: Reimagining the News as a Knowledge Graph
Linkurious SDK: Build enterprise-ready graph applications faster
Ten New Media Opportunities for Traditional Media
Ad

Similar to Click stream analysis and hadoop framwork (20)

PPTX
Clickstream data with spark
PPTX
How Startups can leverage big data?
PDF
Hortonworks and HP Vertica Webinar
PPTX
Big Data Analytics with Hadoop
PDF
Applications of Big Data & Hadoop
PDF
Pivotal - Advanced Analytics for Telecommunications
PDF
Hadoop Data Reservoir Webinar
PPTX
Big datapresentation
PDF
7 ‘Hidden’ Sources of Big Data That You Have
PPTX
Introduction to Harnessing Big Data
PPT
Web analytics
PPTX
Big data4businessusers
PDF
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
PPTX
bigdata.pptx
PDF
Rapid Data Exploration With Hadoop
PDF
EDF2013: Big Data Tutorial: Marko Grobelnik
PDF
Customer value analysis of big data products
PPTX
big-data-8722-m8RQ3h1.pptx
PDF
Big data analytics with Apache Hadoop
PPTX
BDA UNIT 1big data – web analytics – big data applications– big data technolo...
Clickstream data with spark
How Startups can leverage big data?
Hortonworks and HP Vertica Webinar
Big Data Analytics with Hadoop
Applications of Big Data & Hadoop
Pivotal - Advanced Analytics for Telecommunications
Hadoop Data Reservoir Webinar
Big datapresentation
7 ‘Hidden’ Sources of Big Data That You Have
Introduction to Harnessing Big Data
Web analytics
Big data4businessusers
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
bigdata.pptx
Rapid Data Exploration With Hadoop
EDF2013: Big Data Tutorial: Marko Grobelnik
Customer value analysis of big data products
big-data-8722-m8RQ3h1.pptx
Big data analytics with Apache Hadoop
BDA UNIT 1big data – web analytics – big data applications– big data technolo...
Ad

Recently uploaded (20)

PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Lesson notes of climatology university.
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
RMMM.pdf make it easy to upload and study
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
Classroom Observation Tools for Teachers
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Microbial disease of the cardiovascular and lymphatic systems
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPH.pptx obstetrics and gynecology in nursing
VCE English Exam - Section C Student Revision Booklet
102 student loan defaulters named and shamed – Is someone you know on the list?
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
GDM (1) (1).pptx small presentation for students
Renaissance Architecture: A Journey from Faith to Humanism
O5-L3 Freight Transport Ops (International) V1.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
Lesson notes of climatology university.
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Complications of Minimal Access Surgery at WLH
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
RMMM.pdf make it easy to upload and study
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Pre independence Education in Inndia.pdf
Classroom Observation Tools for Teachers
3rd Neelam Sanjeevareddy Memorial Lecture.pdf

Click stream analysis and hadoop framwork

  • 1. Data Mining & Business Analysis 1 Presented By Kazi Md Zuber 1.Click Stream Analysis 2. Hadoop Framework
  • 2.  Click Stream Analysis  What is Click Stream Analysis ? “The electronic path a user takes while navigating from site to site, and from page to page within a ..site”.  In simple word we can say , “A user's activities on the World Wide Web as represented by the sequence of links they click on”. 2
  • 3.  Why need Click stream analysis ?  Area of Interests  Personal Information  Which Devices Use More (PC/Mobaile)  Country wise click stream data  Visitor’s data such as Male/Female or Adult/Child  History (User’s Navigation History)  Etc.. 3 To Know User’s…
  • 4. Data sources of Click Stream analysis  E-Commerce website  Share Market  YouTube  Clouds  Servers  Telecom  Etc. 4
  • 5.  Who Use Click Stream Analyzed Data 5 Amazon analyze user’s behavior Such as, - Clicked items, - User location, - More selling ..Products - Visited pages - Review On particular website or pages More user can target for show specific types of Advertise. -visitors online -Comments -Time -Etc.. -Searched .Videos -Daily Viewer -Etc. Business companies are use this data To Convert customers, Sell Product, Increase business growth and for So many Reasons they use Click Stream Analyzed data. So it’s call Data Mining & Business Intelligence.
  • 6. 6 0 50000 100000 150000 200000 250000 300000 350000 400000 450000 500000 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 US UK Singapore NZ NULL India Europe Australia Asia Pacific  Country wise viewers
  • 7. Accessed Pages By Users 7  From these analyzed data Google AdSense set Advertised on most of visited page.
  • 8. Number of clicks by hour of day 8 0 50000 100000 150000 200000 250000 300000 350000 400000 450000 500000 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 Number of clicks by hour of day Number of clicks  At which time target more visitors for advertise or some other event.  This idea can get from these analyzed.
  • 9.  A Full Month of Browsing 9
  • 12. Click streaming data sources  E-Commerce website Data  Share Market Data  YouTube Data  Clouds Data  Servers Data  Telecom Data  Satellite Data  Business Data 12 Big Data But Exactly What is Big data ? ?
  • 14. Big Data & HADOOP  Extremely large data sets that may be analysed computationally to reveal patterns, trends, and associations, especially relating to human behaviour and interactions.  Big Data Handled By HADOOP  Hadoop is java based platform  FDS, HDFS, Name node (xml data), Job tracker(meta data)  Hadoop Cluster 14
  • 15.  Applications of Hadoop Framework  Advertising , Media and Entertainment  Click Stream Data Analysis  Analysis of Server Log Data  Analysis of Geolocation Data  Fraud Detection and Prevention 15
  • 16.  Which Companies are using HADOOP 16
  • 17. References  NASA NEX (www.aws.amazon.com) (https://aws.amazon.com/public-datasets/nasa-nex/)  Data Mill North (www.datamillnorth.org)  Applications of Big Data & Hadoop (www.slidshare.com)  Google AdSense (www.google.ads.com)  36 Mind Blowing YouTube Facts, Figures and Statistics – 2017 | Digital Marketing Education (fortunelords.com)  www.nasa.gov.com 17
  • 18. Question ??? 18 What is Hadoop Cluster?
  • 19. 19