SlideShare a Scribd company logo
Data-driven culture & infrastructure
from the ground up
January 2017
Data-driven culture and infrastructure from the ground up
Agenda
● R&D and IT consultancy firm
● Clients - Silicon Valley Startups from Round A to Round D
● Hardcore Big Data Analytics team in Kazan
Data-driven culture and infrastructure from the ground up
January 2017
People driven
Data-driven culture and infrastructure from the ground up
Excel driven
January 2017
Data-driven culture and infrastructure from the ground up
BI driven
January 2017
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product Manager
Owners
Engineer UI Data Scientist
Sales/Marketing DevOps Customer
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Engineer
Alerts, Errors
Metrics
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Engineer
Alerts, Errors
Metrics
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Engineer
Alerts, Errors
Metrics
Data Scientist
Scripts
Models
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Owners
Revenue trends
Prescriptions
Insights
Engineer
Alerts, Errors
Metrics
Data Scientist
Scripts
Models
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Owners
Revenue trends
Prescriptions
Insights
Engineer
Alerts, Errors
Metrics
Data Scientist
Scripts
Models
Marketing
BI tools
A/B test
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Owners
Revenue trends
Prescriptions
Insights
Engineer
Alerts, Errors
Metrics
DevOps
Alerts, Errors
Metrics
Monitoring
Data Scientist
Scripts
Models
Marketing
BI tools
A/B test
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Customer
Dashboards
Personalization
Insights
Reports
Owners
Revenue trends
Prescriptions
Insights
Engineer
Alerts, Errors
Metrics
DevOps
Alerts, Errors
Metrics
Monitoring
Data Scientist
Scripts
Models
Marketing
BI tools
A/B test
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
Data Driven Team
January 2017
Product
Metrics
Ad-hoc reports
Reports
Customer
Dashboards
Personalization
Insights
Reports
Owners
Revenue trends
Prescriptions
Insights
Engineer
Alerts, Errors
Metrics
DevOps
Alerts, Errors
Metrics
Monitoring
Data Scientist
Scripts
Models
Marketing
BI tools
A/B test
UX
Usage Patterns
A/B tests
Data-driven culture and infrastructure from the ground up
● Product Managers - SQL
● Engineers - Production, Product, Metrics, SOFT SKILLS
● UX - Marketing, Analytics
● Sales Engineer
● Business - BI, Reports, Metrics
January 2017
New Cross Skills - Evangelism
Data-driven culture and infrastructure from the ground up
Infrastructure
Data-driven culture and infrastructure from the ground up
● You could not just BUY it
● It’s not a Data Warehouse
● It’s not easy at all
January 2017
Myths
Data-driven culture and infrastructure from the ground up
High Level
January 2017
App Servers
Production
database
Logs
Storage
BI
Prescriptions
Application Data Infrastructure Visualisation
Data-driven culture and infrastructure from the ground up
Micro
January 2017
App Servers
Production
database Logs agg/
Papertrail
Keen.io
Clicdata
Datadog
Prometheus
Application Data Infrastructure Visualisation
GA
Data-driven culture and infrastructure from the ground up
Small
January 2017
App Servers
Production
database
Log agg/
Papertrail
S3
Clicdata
Interana
Looker
Datadog
Prometheus
Application Data Infrastructure Visualisation
Slave replica
Data-driven culture and infrastructure from the ground up
Medium
January 2017
App Servers
Prod
Log stream S3
Clicdata
Interana
Looker
Datadog
Prometheus
Application Data Infrastructure Visualisation
Slave
replica
ETL RedShift
Ad-hoc SQL
Data-driven culture and infrastructure from the ground up
Medium
January 2017
App Servers
Prod
AWS Kinesis S3
Clicdata
Interana
Looker
Datadog
Application Data Infrastructure Visualisation
Slave
replica
AWS
Lambda RedShift
Ad-hoc SQL
Data-driven culture and infrastructure from the ground up
Medium
January 2017
App Servers
Prod
Kafka S3
Clicdata
Interana
Looker
Datadog
Prometheus
Application Data Infrastructure Visualisation
Slave
replica
Spark
Streaming RedShift
Ad-hoc SQL
Data-driven culture and infrastructure from the ground up
Big
January 2017
App Servers
Prod
Tableau
Clicdata
Interana
Looker
Application Data Infrastructure Visualisation
Slave
replica
ETL/Spark/Streaming
RedShift
Ad-hoc SQL
Hadoop/Parquet Impala
Spark SQL
Data Science
www.squadex.com
125 University Avenue,
Suite 290, Palo Alto,
California, 94301
Questions, details?
We would be happy to answer!

More Related Content

PPTX
Annual Management Meeting - Year 2016
PPTX
Big data @ Bukalapak
PDF
ebiz 2017: Why It Is Important to Have Great Quality Data in your ERP Systems?
PDF
Data Processing Workflow for the National Geographic World Atlas Mobile App
PDF
eBiz 2017: Beyond the 43 - the IDW Beyond the 43 Critical Fields
DOCX
Resume - Mr. Kim Weilage
PPTX
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]
Annual Management Meeting - Year 2016
Big data @ Bukalapak
ebiz 2017: Why It Is Important to Have Great Quality Data in your ERP Systems?
Data Processing Workflow for the National Geographic World Atlas Mobile App
eBiz 2017: Beyond the 43 - the IDW Beyond the 43 Critical Fields
Resume - Mr. Kim Weilage
Data Science at Netflix - Principles for Speed & Scale [Rev 2019 keynote]

What's hot (10)

PDF
Streaming analytics @ ING by David Vaquero at Big Data Spain 2017
PDF
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
PPTX
Hub16: ”Flexible” supply chain planning technology and its impact on B2B and ...
PDF
The impact of Big Data and applied analytics along the value chain by Guy Per...
PPTX
Notebooks @ Netflix: From analytics to engineering with Jupyter notebooks
PDF
Brian Lee Resume (1)
PDF
Riding the big data wave with Excel and Power BI - SMBNation 2015
PPTX
Hub16: Workforce planning at Tableau: Finding time compression and process ag...
PDF
The culture trip snowplow implementation
PDF
eBiz 2017: The Future of Warehouse Technology
Streaming analytics @ ING by David Vaquero at Big Data Spain 2017
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Hub16: ”Flexible” supply chain planning technology and its impact on B2B and ...
The impact of Big Data and applied analytics along the value chain by Guy Per...
Notebooks @ Netflix: From analytics to engineering with Jupyter notebooks
Brian Lee Resume (1)
Riding the big data wave with Excel and Power BI - SMBNation 2015
Hub16: Workforce planning at Tableau: Finding time compression and process ag...
The culture trip snowplow implementation
eBiz 2017: The Future of Warehouse Technology
Ad

Viewers also liked (12)

PPTX
Insights to Action: Inform Your Engagement Marketing Strategy with Behavioral...
PDF
Amplitude wave architecture - Test
PDF
Boi, Dell'Orto, Raffaldi & Trombetti - input2012
PPTX
Finding your mobile growth
PPTX
Canary releases & Blue green deployment
PPTX
Big Data Day LA 2016/ Big Data Track - Puree through Trillion of Clicks in Se...
PDF
Bolivia seismic properties k. rainer massarsch
PDF
A 7-step framework for measuring the impact of your next feature release
PDF
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
PPTX
UNDERGROUND HIGH POWER TRANSMISSION LINES
PDF
Strata Designing Delightful Data Products
PPTX
English project underground davidcarpintero
Insights to Action: Inform Your Engagement Marketing Strategy with Behavioral...
Amplitude wave architecture - Test
Boi, Dell'Orto, Raffaldi & Trombetti - input2012
Finding your mobile growth
Canary releases & Blue green deployment
Big Data Day LA 2016/ Big Data Track - Puree through Trillion of Clicks in Se...
Bolivia seismic properties k. rainer massarsch
A 7-step framework for measuring the impact of your next feature release
Joining the Club: Using Spark to Accelerate Big Data at Dollar Shave Club
UNDERGROUND HIGH POWER TRANSMISSION LINES
Strata Designing Delightful Data Products
English project underground davidcarpintero
Ad

Similar to Data driven culture & infrastructure from the ground up (20)

PDF
Gartner EA: The Rise of Data-driven Architectures
PPTX
Creating data-driven-org
PPTX
Data Culture Keynote and Exec Track Birm Dec 8th
PPTX
How to use your data science team: Becoming a data-driven organization
PDF
Start With Why: Build Product Progress with a Strong Data Culture
PPTX
Start With Why: Build Product Progress with a Strong Data Culture
PDF
How to Create a Data Analytics Roadmap
 
PPTX
[DSC Europe 24] Josip Saban - Buidling cloud data platforms in enterprises
PPTX
UCSD: Building a Big Data Culture - It Takes a Village
PPTX
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
PDF
Chief data-officers-guide-on-transforming-to-a-data-driven-organization
PDF
Creating a Data Culture
PDF
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
PDF
Becoming a Data-Driven Enterprise
PDF
What it really takes to become a data driven marketing organization
PPTX
Great Data Delivery: A model-based approach
PDF
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
PDF
A Fully Data Driven World
PDF
Data Is Eating The World
PDF
Slow Data Kills Business eBook - Improve the Customer Experience
Gartner EA: The Rise of Data-driven Architectures
Creating data-driven-org
Data Culture Keynote and Exec Track Birm Dec 8th
How to use your data science team: Becoming a data-driven organization
Start With Why: Build Product Progress with a Strong Data Culture
Start With Why: Build Product Progress with a Strong Data Culture
How to Create a Data Analytics Roadmap
 
[DSC Europe 24] Josip Saban - Buidling cloud data platforms in enterprises
UCSD: Building a Big Data Culture - It Takes a Village
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
Chief data-officers-guide-on-transforming-to-a-data-driven-organization
Creating a Data Culture
Driven by data - Why we need a Modern Enterprise Data Analytics Platform
Becoming a Data-Driven Enterprise
What it really takes to become a data driven marketing organization
Great Data Delivery: A model-based approach
Dataiku - data driven nyc - april 2016 - the solitude of the data team m...
A Fully Data Driven World
Data Is Eating The World
Slow Data Kills Business eBook - Improve the Customer Experience

More from SQUADEX (7)

PPTX
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
PPTX
Osdn serverless technologies with kubernetes
PPTX
Spark as etl_squadex
PPTX
Squadex DevOps Trainings
PPTX
Enterprise level cloud CI
PPTX
Building DevOps culture from bottom up
PPTX
Kubernetes as a cloud for CI
Tooling for Machine Learning: AWS Products, Open Source Tools, and DevOps Pra...
Osdn serverless technologies with kubernetes
Spark as etl_squadex
Squadex DevOps Trainings
Enterprise level cloud CI
Building DevOps culture from bottom up
Kubernetes as a cloud for CI

Recently uploaded (20)

PPTX
Modernising the Digital Integration Hub
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
observCloud-Native Containerability and monitoring.pptx
PPTX
The various Industrial Revolutions .pptx
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPTX
1. Introduction to Computer Programming.pptx
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Hybrid model detection and classification of lung cancer
PDF
Architecture types and enterprise applications.pdf
PPTX
Tartificialntelligence_presentation.pptx
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPT
What is a Computer? Input Devices /output devices
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Web App vs Mobile App What Should You Build First.pdf
Modernising the Digital Integration Hub
WOOl fibre morphology and structure.pdf for textiles
Programs and apps: productivity, graphics, security and other tools
observCloud-Native Containerability and monitoring.pptx
The various Industrial Revolutions .pptx
A novel scalable deep ensemble learning framework for big data classification...
A contest of sentiment analysis: k-nearest neighbor versus neural network
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
1. Introduction to Computer Programming.pptx
O2C Customer Invoices to Receipt V15A.pptx
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Hybrid model detection and classification of lung cancer
Architecture types and enterprise applications.pdf
Tartificialntelligence_presentation.pptx
Final SEM Unit 1 for mit wpu at pune .pptx
What is a Computer? Input Devices /output devices
Group 1 Presentation -Planning and Decision Making .pptx
Web App vs Mobile App What Should You Build First.pdf

Data driven culture & infrastructure from the ground up

  • 1. Data-driven culture & infrastructure from the ground up January 2017
  • 2. Data-driven culture and infrastructure from the ground up Agenda ● R&D and IT consultancy firm ● Clients - Silicon Valley Startups from Round A to Round D ● Hardcore Big Data Analytics team in Kazan
  • 3. Data-driven culture and infrastructure from the ground up January 2017 People driven
  • 4. Data-driven culture and infrastructure from the ground up Excel driven January 2017
  • 5. Data-driven culture and infrastructure from the ground up BI driven January 2017
  • 6. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Manager Owners Engineer UI Data Scientist Sales/Marketing DevOps Customer
  • 7. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports
  • 8. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Engineer Alerts, Errors Metrics
  • 9. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Engineer Alerts, Errors Metrics UX Usage Patterns A/B tests
  • 10. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Engineer Alerts, Errors Metrics Data Scientist Scripts Models UX Usage Patterns A/B tests
  • 11. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Owners Revenue trends Prescriptions Insights Engineer Alerts, Errors Metrics Data Scientist Scripts Models UX Usage Patterns A/B tests
  • 12. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Owners Revenue trends Prescriptions Insights Engineer Alerts, Errors Metrics Data Scientist Scripts Models Marketing BI tools A/B test UX Usage Patterns A/B tests
  • 13. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Owners Revenue trends Prescriptions Insights Engineer Alerts, Errors Metrics DevOps Alerts, Errors Metrics Monitoring Data Scientist Scripts Models Marketing BI tools A/B test UX Usage Patterns A/B tests
  • 14. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Customer Dashboards Personalization Insights Reports Owners Revenue trends Prescriptions Insights Engineer Alerts, Errors Metrics DevOps Alerts, Errors Metrics Monitoring Data Scientist Scripts Models Marketing BI tools A/B test UX Usage Patterns A/B tests
  • 15. Data-driven culture and infrastructure from the ground up Data Driven Team January 2017 Product Metrics Ad-hoc reports Reports Customer Dashboards Personalization Insights Reports Owners Revenue trends Prescriptions Insights Engineer Alerts, Errors Metrics DevOps Alerts, Errors Metrics Monitoring Data Scientist Scripts Models Marketing BI tools A/B test UX Usage Patterns A/B tests
  • 16. Data-driven culture and infrastructure from the ground up ● Product Managers - SQL ● Engineers - Production, Product, Metrics, SOFT SKILLS ● UX - Marketing, Analytics ● Sales Engineer ● Business - BI, Reports, Metrics January 2017 New Cross Skills - Evangelism
  • 17. Data-driven culture and infrastructure from the ground up Infrastructure
  • 18. Data-driven culture and infrastructure from the ground up ● You could not just BUY it ● It’s not a Data Warehouse ● It’s not easy at all January 2017 Myths
  • 19. Data-driven culture and infrastructure from the ground up High Level January 2017 App Servers Production database Logs Storage BI Prescriptions Application Data Infrastructure Visualisation
  • 20. Data-driven culture and infrastructure from the ground up Micro January 2017 App Servers Production database Logs agg/ Papertrail Keen.io Clicdata Datadog Prometheus Application Data Infrastructure Visualisation GA
  • 21. Data-driven culture and infrastructure from the ground up Small January 2017 App Servers Production database Log agg/ Papertrail S3 Clicdata Interana Looker Datadog Prometheus Application Data Infrastructure Visualisation Slave replica
  • 22. Data-driven culture and infrastructure from the ground up Medium January 2017 App Servers Prod Log stream S3 Clicdata Interana Looker Datadog Prometheus Application Data Infrastructure Visualisation Slave replica ETL RedShift Ad-hoc SQL
  • 23. Data-driven culture and infrastructure from the ground up Medium January 2017 App Servers Prod AWS Kinesis S3 Clicdata Interana Looker Datadog Application Data Infrastructure Visualisation Slave replica AWS Lambda RedShift Ad-hoc SQL
  • 24. Data-driven culture and infrastructure from the ground up Medium January 2017 App Servers Prod Kafka S3 Clicdata Interana Looker Datadog Prometheus Application Data Infrastructure Visualisation Slave replica Spark Streaming RedShift Ad-hoc SQL
  • 25. Data-driven culture and infrastructure from the ground up Big January 2017 App Servers Prod Tableau Clicdata Interana Looker Application Data Infrastructure Visualisation Slave replica ETL/Spark/Streaming RedShift Ad-hoc SQL Hadoop/Parquet Impala Spark SQL Data Science
  • 26. www.squadex.com 125 University Avenue, Suite 290, Palo Alto, California, 94301 Questions, details? We would be happy to answer!