SlideShare a Scribd company logo
DATA ANALYTICS WITH DRUID
YOU SUN JEONG
DATA ANALYTICS WITH DRUID
WHO AM I ?
Senior Software Engineer of SK Telecom
Commercial Products
Big Data Discovery Solution (~’16)
Hadoop DW (~’15)
PaaS(CloudFoundry) (~’13)
Iaas (OpenStack) (~’13)
Mail to : jerryjung@apache.org
2
DATA ANALYTICS WITH DRUID
FOOTPRINTS
2014
2015 

- Hadoop DW 

- Realtime NW Analytics
2016 

- Big Data Discovery

- Streaming Processing
3
DATA ANALYTICS WITH DRUID
AGENDA
‣ History
‣ What is Druid?
‣ Druid Architecture
‣ Real-Time Ingestion Demo (15m)
‣ Cohort Analysis (15m)
4
DATA ANALYTICS WITH DRUID
HISTORY
▸ Development started at Meta markets in 2011
▸ Apache V2 in early 2015
▸ 150+ contributors today
▸ https://github.com/druid-io
5
DATA ANALYTICS WITH DRUID
DATA LAKE
6
https://www.linkedin.com/pulse/more-analytics-than-just-fishing-data-lake-john-poppelaars
DATA ANALYTICS WITH DRUID
DW VS DATA LAKE
http://www.kdnuggets.com/2015/09/data-lake-vs-data-warehouse-key-differences.html
7
DATA ANALYTICS WITH DRUID
WHAT IS DRUID
Distributed, 

In-memory Multi-dimensional
OLAP store
8
DATA ANALYTICS WITH DRUID
PROBLEMS
timestamp domain user gender clicked
2011-01-01T00:01:35Z bieber.com 4312345532 Female 1
2011-01-01T00:03:03Z bieber.com 3484920241 Female 0
2011-01-01T00:04:51Z ultra.com 9530174728 Male 1
2011-01-01T00:05:33Z ultra.com 4098310573 Male 1
2011-01-01T00:05:53Z ultra.com 5832057930 Female 0
2011-01-01T00:06:17Z ultra.com 5789283478 Female 1
2011-01-01T00:23:15Z bieber.com 4730093842 Female 0
2011-01-01T00:38:51Z ultra.com 3909846810 Male 1
2011-01-01T00:49:33Z bieber.com 4930097162 Female 1
2011-01-01T00:49:53Z ultra.com 0381837193 Female 0
timestamp impressions clicks
2011-01-01T00:00:00Z 10 6
timestamp domain user gender clicked
2011-01-01T00:01:35Z bieber.com 4312345532 Female 1
2011-01-01T00:03:03Z bieber.com 3484920241 Female 0
2011-01-01T00:04:51Z ultra.com 9530174728 Male 1
2011-01-01T00:05:33Z ultra.com 4098310573 Male 1
2011-01-01T00:05:53Z ultra.com 5832057930 Female 0
2011-01-01T00:06:17Z ultra.com 5789283478 Female 1
2011-01-01T00:23:15Z bieber.com 4730093842 Female 0
2011-01-01T00:38:51Z ultra.com 9530174728 Male 1
2011-01-01T00:49:33Z bieber.com 4930097162 Female 1
2011-01-01T00:49:53Z ultra.com 0381837193 Female 0
timestamp domain gender impressions clicks
2011-01-01T00:00:00Z bieber.com Female 4 2
2011-01-01T00:00:00Z ultra.com Female 3 1
2011-01-01T00:00:00Z ultra.com Male 3 2
9
DATA ANALYTICS WITH DRUID
BIG DATA DISCOVERY
▸ Roll-up
▸ Summarizing over a dimension
▸ Drill-down
▸ Focusing (zooming in)
▸ Slicing and dicing
▸ Reducing dimensions (slice)
▸ Picking values of specific dimensions (dice)
▸ Pivoting
▸ Rotating multi-dimensional cube
10
DATA ANALYTICS WITH DRUID
OLAP CUBE
▸ Slice and Dice
11
DATA ANALYTICS WITH DRUID
IN-MEMORY
12
DATA ANALYTICS WITH DRUID
COLUMNAR STORAGE
13
DATA ANALYTICS WITH DRUID
DRUID TERMS
▸ Data
▸ Timestamp
▸ Dimension
▸ Metric
▸ Datasource
▸ Segment
▸ Granularity
14
DATA ANALYTICS WITH DRUID
DRUID ARCHITECTURE
REALTIME
BROKER HISTORICAL
15
DATA ANALYTICS WITH DRUID
ARCHITECTURE - BATCH INGESTION
HDFS
HISTORICAL
NODE
HISTORICAL
NODE
HISTORICAL
NODE
BROKER
NODE
Segments
Queries
16
DATA ANALYTICS WITH DRUID
ARCHITECTURE - STREAMING INGESTION
REALTIME
NODE
HISTORICAL
NODE
HISTORICAL
NODE
HISTORICAL
NODE
BROKER
NODE
Segments
Queries
Streaming
17
DATA ANALYTICS WITH DRUID
ARCHITECTURE - LAMBDA
REALTIME
NODE
HISTORICAL
NODE
HISTORICAL
NODE
HISTORICAL
NODE
BROKER
NODE
Segments
Queries
Streaming
HDFS
18
DATA ANALYTICS WITH DRUID
GLUE ARCHITECTURE
REAL TIME
TASK
HISTORICAL
NODE
HISTORICAL
NODE
HISTORICAL
NODE
BROKER
NODE
Segments
Queries
Streaming
STREAM
PROCESSOR

(TRANQUILITY)
Kafka Indexing Service
19
DATA ANALYTICS WITH DRUID
REAL WORLD ARCHITECTURE
DATA 

NODE #1
DATA 

NODE #N
OVERLORD
MIDDLE
MANAGE

#1
COORDI

NATOR
MYSQL
HA 

PROXY
MEMCACHED

#2
BROKER
NODE

#1
BROKER
NODE

#1
MEMCACHED

#3
MEMCACHED

#1
HISTORICAL
NODE #1
HISTORICAL
NODE #N
MIDDLE
MANAGE

#N
ZK1
ZK2
ZK3
20
DATA ANALYTICS WITH DRUID
DRUID MONITORING
21
http://www.slideshare.net/CharlesAllen9/programmatic-bidding-data-streams-druid
DATA ANALYTICS WITH DRUID
DRUID DATASOURCE
22
RDRUID
DATA ANALYTICS WITH DRUID
https://github.com/druid-io/RDruid
23
DATA ANALYTICS WITH DRUID
PYDROID
24
https://github.com/druid-io/pydruid
DATA ANALYTICS WITH DRUID
DEMO
▸ Jupyter Notebook(PyDruid)
▸ Mobile App User Events for 1 week 

: 2 billion events
▸ Scenario 

: Unique users

Cohort Analysis
25
DEMO
DATA ANALYTICS WITH DRUID
MAY THE FORCE BE WITH YOU
27
DATA ANALYTICS WITH DRUID
REFERENCES
▸ Druid

: http://www.popit.kr/tag/druid/ 

(https://www.facebook.com/popitkr/)

: http://druid.io/
▸ Cohort Analysis

: http://www.gregreda.com/2015/08/23/cohort-analysis-
with-python/
▸ Druid Meetup@Seoul

: http://www.meetup.com/Druid-Seoul/
28
DATA ANALYTICS WITH DRUID
POPIT
29
https://www.facebook.com/popitkr/
Q&A
THANK YOU
DATA ANALYTICS WITH DRUID 30

More Related Content

PDF
Druid @ branch
PDF
Real-time Analytics with Apache Flink and Druid
PDF
PDF
Real-time analytics with Druid at Appsflyer
PPTX
Programmatic Bidding Data Streams & Druid
PDF
Aggregated queries with Druid on terrabytes and petabytes of data
PDF
PPTX
July 2014 HUG : Pushing the limits of Realtime Analytics using Druid
Druid @ branch
Real-time Analytics with Apache Flink and Druid
Real-time analytics with Druid at Appsflyer
Programmatic Bidding Data Streams & Druid
Aggregated queries with Druid on terrabytes and petabytes of data
July 2014 HUG : Pushing the limits of Realtime Analytics using Druid

What's hot (20)

PPTX
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and Superset
PDF
Lambda Architectures in Practice
PPTX
Apache Druid Design and Future prospect
PDF
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
PPTX
Druid realtime indexing
PDF
ClickHouse Analytical DBMS: Introduction and Case Studies, by Alexander Zaitsev
PDF
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
PPTX
Druid at Hadoop Ecosystem
PDF
Druid meetup 4th_sql_on_druid
PDF
Benchmarking Apache Druid
PDF
Using ClickHouse for Experimentation
PDF
Webinar: Managing Real Time Risk Analytics with MongoDB
PDF
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
PPTX
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
PDF
Imply at Apache Druid Meetup in London 1-15-20
PPTX
Data Modeling IoT and Time Series data in NoSQL
PPTX
ClickHouse Paris Meetup. ClickHouse Analytical DBMS, Introduction. By Alexand...
PDF
ClickHouse Data Warehouse 101: The First Billion Rows, by Alexander Zaitsev a...
PDF
ClickHouse on Plug-n-Play Cloud, by Som Sikdar, Kodiak Data
PPTX
Lightning Talk: MongoDB Sharding
Interactive Realtime Dashboards on Data Streams using Kafka, Druid and Superset
Lambda Architectures in Practice
Apache Druid Design and Future prospect
Data Analytics and Processing at Snap - Druid Meetup LA - September 2018
Druid realtime indexing
ClickHouse Analytical DBMS: Introduction and Case Studies, by Alexander Zaitsev
Gregorry Letribot - Druid at Criteo - NoSQL matters 2015
Druid at Hadoop Ecosystem
Druid meetup 4th_sql_on_druid
Benchmarking Apache Druid
Using ClickHouse for Experimentation
Webinar: Managing Real Time Risk Analytics with MongoDB
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
Pulsar: Real-time Analytics at Scale with Kafka, Kylin and Druid
Imply at Apache Druid Meetup in London 1-15-20
Data Modeling IoT and Time Series data in NoSQL
ClickHouse Paris Meetup. ClickHouse Analytical DBMS, Introduction. By Alexand...
ClickHouse Data Warehouse 101: The First Billion Rows, by Alexander Zaitsev a...
ClickHouse on Plug-n-Play Cloud, by Som Sikdar, Kodiak Data
Lightning Talk: MongoDB Sharding
Ad

Viewers also liked (16)

PPTX
Scalable Real-time analytics using Druid
PPT
Case Study: Realtime Analytics with Druid
PDF
Interactive analytics at scale with druid
PDF
Druid at SF Big Analytics 2015-12-01
PPTX
Big Data Day LA 2016/ Big Data Track - Real Time Analytics with Druid - Guill...
PDF
Open Source Lambda Architecture with Hadoop, Kafka, Samza and Druid
PDF
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
PPTX
Monitoring @ scale over diverse data sources @ PayPal - Druid, TSDB, Hadoop
PPTX
Using druid for interactive count distinct queries at scale @ nmc
PPTX
PayPal Real Time Analytics
PPTX
Dmp hadoop getting_start
PDF
Creating a Next-Generation Big Data Architecture
PDF
Presto, Zeppelin을 이용한 초간단 BI 구축 사례
PDF
Apache Zeppelin으로 데이터 분석하기
PDF
Zeppelin notebook 만들기
PDF
Zeppelin(Spark)으로 데이터 분석하기
Scalable Real-time analytics using Druid
Case Study: Realtime Analytics with Druid
Interactive analytics at scale with druid
Druid at SF Big Analytics 2015-12-01
Big Data Day LA 2016/ Big Data Track - Real Time Analytics with Druid - Guill...
Open Source Lambda Architecture with Hadoop, Kafka, Samza and Druid
OLAP for Big Data (Druid vs Apache Kylin vs Apache Lens)
Monitoring @ scale over diverse data sources @ PayPal - Druid, TSDB, Hadoop
Using druid for interactive count distinct queries at scale @ nmc
PayPal Real Time Analytics
Dmp hadoop getting_start
Creating a Next-Generation Big Data Architecture
Presto, Zeppelin을 이용한 초간단 BI 구축 사례
Apache Zeppelin으로 데이터 분석하기
Zeppelin notebook 만들기
Zeppelin(Spark)으로 데이터 분석하기
Ad

Similar to Data Analytics with Druid (20)

PPTX
Understanding apache-druid
PDF
Apache Druid 101
PPTX
The of Operational Analytics Data Store
PPTX
Druid Overview by Rachel Pedreschi
PPTX
Scalable olap with druid
PPTX
Our journey with druid - from initial research to full production scale
PPTX
Introduction to Druid and Druidry
PDF
Fast analytics kudu to druid
PDF
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
PPTX
Apache Drill
PPTX
Apache drill
PPTX
Using druid for interactive count distinct queries at scale
PDF
Web analytics at scale with Druid at naver.com
PPTX
Druid Scaling Realtime Analytics
PPTX
No sql and sql - open analytics summit
PPTX
Time-series data analysis and persistence with Druid
PPTX
Big data processing engines, Atlanta Meetup 4/30
PDF
Premier Inside-Out: Apache Druid
PDF
NoSQL no more: SQL on Druid with Apache Calcite
PPTX
Interactive Analytics at Scale in Apache Hive Using Druid
Understanding apache-druid
Apache Druid 101
The of Operational Analytics Data Store
Druid Overview by Rachel Pedreschi
Scalable olap with druid
Our journey with druid - from initial research to full production scale
Introduction to Druid and Druidry
Fast analytics kudu to druid
Building an Enterprise-Scale Dashboarding/Analytics Platform Powered by the C...
Apache Drill
Apache drill
Using druid for interactive count distinct queries at scale
Web analytics at scale with Druid at naver.com
Druid Scaling Realtime Analytics
No sql and sql - open analytics summit
Time-series data analysis and persistence with Druid
Big data processing engines, Atlanta Meetup 4/30
Premier Inside-Out: Apache Druid
NoSQL no more: SQL on Druid with Apache Calcite
Interactive Analytics at Scale in Apache Hive Using Druid

More from Yousun Jeong (9)

PDF
Stsg17 speaker yousunjeong
PDF
Spark day 2017 - Spark on Kubernetes
PDF
Kubernetes on aws
PDF
Kafka for begginer
PDF
IEEE International Conference on Data Engineering 2015
PDF
Spark streaming , Spark SQL
PDF
Big Telco Real-Time Network Analytics
PDF
Enterprise 환경에서의 오픈소스 기반 아키텍처 적용 사례
PDF
2012 07 28_cloud_reference_architecture_openplatform
Stsg17 speaker yousunjeong
Spark day 2017 - Spark on Kubernetes
Kubernetes on aws
Kafka for begginer
IEEE International Conference on Data Engineering 2015
Spark streaming , Spark SQL
Big Telco Real-Time Network Analytics
Enterprise 환경에서의 오픈소스 기반 아키텍처 적용 사례
2012 07 28_cloud_reference_architecture_openplatform

Recently uploaded (20)

PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Lecture1 pattern recognition............
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
annual-report-2024-2025 original latest.
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Lecture1 pattern recognition............
climate analysis of Dhaka ,Banglades.pptx
Supervised vs unsupervised machine learning algorithms
Clinical guidelines as a resource for EBP(1).pdf
.pdf is not working space design for the following data for the following dat...
Business Acumen Training GuidePresentation.pptx
Database Infoormation System (DBIS).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Fluorescence-microscope_Botany_detailed content
Business Ppt On Nestle.pptx huunnnhhgfvu
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
The THESIS FINAL-DEFENSE-PRESENTATION.pptx

Data Analytics with Druid

  • 1. DATA ANALYTICS WITH DRUID YOU SUN JEONG
  • 2. DATA ANALYTICS WITH DRUID WHO AM I ? Senior Software Engineer of SK Telecom Commercial Products Big Data Discovery Solution (~’16) Hadoop DW (~’15) PaaS(CloudFoundry) (~’13) Iaas (OpenStack) (~’13) Mail to : jerryjung@apache.org 2
  • 3. DATA ANALYTICS WITH DRUID FOOTPRINTS 2014 2015 
 - Hadoop DW 
 - Realtime NW Analytics 2016 
 - Big Data Discovery
 - Streaming Processing 3
  • 4. DATA ANALYTICS WITH DRUID AGENDA ‣ History ‣ What is Druid? ‣ Druid Architecture ‣ Real-Time Ingestion Demo (15m) ‣ Cohort Analysis (15m) 4
  • 5. DATA ANALYTICS WITH DRUID HISTORY ▸ Development started at Meta markets in 2011 ▸ Apache V2 in early 2015 ▸ 150+ contributors today ▸ https://github.com/druid-io 5
  • 6. DATA ANALYTICS WITH DRUID DATA LAKE 6 https://www.linkedin.com/pulse/more-analytics-than-just-fishing-data-lake-john-poppelaars
  • 7. DATA ANALYTICS WITH DRUID DW VS DATA LAKE http://www.kdnuggets.com/2015/09/data-lake-vs-data-warehouse-key-differences.html 7
  • 8. DATA ANALYTICS WITH DRUID WHAT IS DRUID Distributed, 
 In-memory Multi-dimensional OLAP store 8
  • 9. DATA ANALYTICS WITH DRUID PROBLEMS timestamp domain user gender clicked 2011-01-01T00:01:35Z bieber.com 4312345532 Female 1 2011-01-01T00:03:03Z bieber.com 3484920241 Female 0 2011-01-01T00:04:51Z ultra.com 9530174728 Male 1 2011-01-01T00:05:33Z ultra.com 4098310573 Male 1 2011-01-01T00:05:53Z ultra.com 5832057930 Female 0 2011-01-01T00:06:17Z ultra.com 5789283478 Female 1 2011-01-01T00:23:15Z bieber.com 4730093842 Female 0 2011-01-01T00:38:51Z ultra.com 3909846810 Male 1 2011-01-01T00:49:33Z bieber.com 4930097162 Female 1 2011-01-01T00:49:53Z ultra.com 0381837193 Female 0 timestamp impressions clicks 2011-01-01T00:00:00Z 10 6 timestamp domain user gender clicked 2011-01-01T00:01:35Z bieber.com 4312345532 Female 1 2011-01-01T00:03:03Z bieber.com 3484920241 Female 0 2011-01-01T00:04:51Z ultra.com 9530174728 Male 1 2011-01-01T00:05:33Z ultra.com 4098310573 Male 1 2011-01-01T00:05:53Z ultra.com 5832057930 Female 0 2011-01-01T00:06:17Z ultra.com 5789283478 Female 1 2011-01-01T00:23:15Z bieber.com 4730093842 Female 0 2011-01-01T00:38:51Z ultra.com 9530174728 Male 1 2011-01-01T00:49:33Z bieber.com 4930097162 Female 1 2011-01-01T00:49:53Z ultra.com 0381837193 Female 0 timestamp domain gender impressions clicks 2011-01-01T00:00:00Z bieber.com Female 4 2 2011-01-01T00:00:00Z ultra.com Female 3 1 2011-01-01T00:00:00Z ultra.com Male 3 2 9
  • 10. DATA ANALYTICS WITH DRUID BIG DATA DISCOVERY ▸ Roll-up ▸ Summarizing over a dimension ▸ Drill-down ▸ Focusing (zooming in) ▸ Slicing and dicing ▸ Reducing dimensions (slice) ▸ Picking values of specific dimensions (dice) ▸ Pivoting ▸ Rotating multi-dimensional cube 10
  • 11. DATA ANALYTICS WITH DRUID OLAP CUBE ▸ Slice and Dice 11
  • 12. DATA ANALYTICS WITH DRUID IN-MEMORY 12
  • 13. DATA ANALYTICS WITH DRUID COLUMNAR STORAGE 13
  • 14. DATA ANALYTICS WITH DRUID DRUID TERMS ▸ Data ▸ Timestamp ▸ Dimension ▸ Metric ▸ Datasource ▸ Segment ▸ Granularity 14
  • 15. DATA ANALYTICS WITH DRUID DRUID ARCHITECTURE REALTIME BROKER HISTORICAL 15
  • 16. DATA ANALYTICS WITH DRUID ARCHITECTURE - BATCH INGESTION HDFS HISTORICAL NODE HISTORICAL NODE HISTORICAL NODE BROKER NODE Segments Queries 16
  • 17. DATA ANALYTICS WITH DRUID ARCHITECTURE - STREAMING INGESTION REALTIME NODE HISTORICAL NODE HISTORICAL NODE HISTORICAL NODE BROKER NODE Segments Queries Streaming 17
  • 18. DATA ANALYTICS WITH DRUID ARCHITECTURE - LAMBDA REALTIME NODE HISTORICAL NODE HISTORICAL NODE HISTORICAL NODE BROKER NODE Segments Queries Streaming HDFS 18
  • 19. DATA ANALYTICS WITH DRUID GLUE ARCHITECTURE REAL TIME TASK HISTORICAL NODE HISTORICAL NODE HISTORICAL NODE BROKER NODE Segments Queries Streaming STREAM PROCESSOR
 (TRANQUILITY) Kafka Indexing Service 19
  • 20. DATA ANALYTICS WITH DRUID REAL WORLD ARCHITECTURE DATA 
 NODE #1 DATA 
 NODE #N OVERLORD MIDDLE MANAGE
 #1 COORDI
 NATOR MYSQL HA 
 PROXY MEMCACHED
 #2 BROKER NODE
 #1 BROKER NODE
 #1 MEMCACHED
 #3 MEMCACHED
 #1 HISTORICAL NODE #1 HISTORICAL NODE #N MIDDLE MANAGE
 #N ZK1 ZK2 ZK3 20
  • 21. DATA ANALYTICS WITH DRUID DRUID MONITORING 21 http://www.slideshare.net/CharlesAllen9/programmatic-bidding-data-streams-druid
  • 22. DATA ANALYTICS WITH DRUID DRUID DATASOURCE 22
  • 23. RDRUID DATA ANALYTICS WITH DRUID https://github.com/druid-io/RDruid 23
  • 24. DATA ANALYTICS WITH DRUID PYDROID 24 https://github.com/druid-io/pydruid
  • 25. DATA ANALYTICS WITH DRUID DEMO ▸ Jupyter Notebook(PyDruid) ▸ Mobile App User Events for 1 week 
 : 2 billion events ▸ Scenario 
 : Unique users
 Cohort Analysis 25
  • 26. DEMO
  • 27. DATA ANALYTICS WITH DRUID MAY THE FORCE BE WITH YOU 27
  • 28. DATA ANALYTICS WITH DRUID REFERENCES ▸ Druid
 : http://www.popit.kr/tag/druid/ 
 (https://www.facebook.com/popitkr/)
 : http://druid.io/ ▸ Cohort Analysis
 : http://www.gregreda.com/2015/08/23/cohort-analysis- with-python/ ▸ Druid Meetup@Seoul
 : http://www.meetup.com/Druid-Seoul/ 28
  • 29. DATA ANALYTICS WITH DRUID POPIT 29 https://www.facebook.com/popitkr/