SlideShare a Scribd company logo
Enabling
Full Stack
Data Scientists
at Stitch Fix
Juliet Hougland
@j_houg
April 2018
Agenda
@j_houg 2
WHAT WE DO
HOW WE OPERATE
TRANSFORMING THE WAY
DATA SCIENTISTS
DO WHAT THEY LOVE
@j_houg
The Business
The Customer Experience
What We Do
@j_houg
@j_houg
Transforming The
Way People Find
What They Love
@j_houg
@j_houg
The Algorithms Org
Operating Principles
Data Platform
How We Operate
@j_houg
Algorithms
Data Platform Data Science
ClientStyling Operations Horizontal
@j_houg
Autonomy &
Ownership
@j_houg
Typical Data Science Department
Data
Engineers
Data
Scientists
Software
Engineers
Data Infrastructure Engineers
provide
resources &
support
@j_houg
Typical Data Science Department
Data Driven Capability
Data Engineers
Data Scientists
Software Engineers
Infrastructure Engineers
@j_houg
Another View
Capability A Capability B Capability C Capability D
Data Engineers
Data Scientists
Software Engineers
Infrastructure Engineers
specialized by function
@j_houg
High Coordination Cost
Capability A Capability B Capability C Capability D
Data Engineers
Data Scientists
Software Engineers
Infrastructure Engineers
specialization by function
tightly
coupled,
loosely
aligned
@j_houg
A Different Way to Organize
Data Driven
Capability
DataScientists
@j_houg
Organized by Capability
specialized by capability
Capability A Capability B Capability C Capability D
DataScientists
DataScientists
DataScientists
DataScientists
@j_houg
Focus on Capabilities
Data Driven
Capability
DataScientists
Full Stack Data
Scientists:
" Full Stack Data Scientists Own:
" Implementation
" Coordination with the business
" Production support (sort of)
@j_houg
Where did the
engineers go?
@j_houg
Focus on Capabilities
Data Platform Engineers
loosely
coupled,
tightly
aligned
specialized by function
Capability A Capability B Capability C Capability D
DataScientists
specialized by capability
DataScientists
DataScientists
DataScientists
@j_houg
Capabilities
Data Platform Engineers
specialized by function
Capability A Capability B Capability C Capability D
DataScientists
specialized by capability
DataScientists
DataScientists
DataScientists
@j_houg
Infrastructure
@j_houg
Black Box Data Scientists
@j_houg
Data Transport
@j_houg
The Black Box
@j_houg
Analysis &
Model Building
@j_houg
Analysis &
Model Building
@j_houg
Analysis &
Model Building
Hard:
•Reproducible
•Accurate
•Collaborative
Opinionated Analysis Development —
Hilary Parker
https://peerj.com/preprints/3210/
@j_houg
Analysis &
Model Building
@j_houg
Analysis &
Model Building
@j_houg
Dashboards
@j_houg
Services
Model
Application
Code
@j_houg
Services
@j_houg
What works
well?
@j_houg
What are the
challenges?
@j_houg
Thanks!
@j_houg
Interested in
what you heard?
We are hiring!
@j_houg
Questions?

More Related Content

DOCX
A Simple Approach for Screening Profile for Recruitment
PPTX
Data science using r multisoft systems
PDF
Data Discoverability at SpotHero
PDF
Production model deployment
PDF
Production Model Deployment - StitchFix - 2018
PPTX
DataScienceConnect Atlanta 2019 - Building Data & Analytics Teams
PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
PDF
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
A Simple Approach for Screening Profile for Recruitment
Data science using r multisoft systems
Data Discoverability at SpotHero
Production model deployment
Production Model Deployment - StitchFix - 2018
DataScienceConnect Atlanta 2019 - Building Data & Analytics Teams
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t
Adi Wijaya - Scrum in Data Science, What Works and What Doesn’t

Similar to Enabling full stack data scientists (20)

PPTX
The Best Data Analyst Jobs in the USA Data Analyst
PDF
[Brighton SEO] Audience Intelligence & SEO: How to integrate data sources to ...
PDF
Data Discoverability with DataHub
PDF
The Convergence of Data Science and Software Development
PPTX
final_presentation[1]2333.pptx for acharya ploytechnic
PDF
Big data careers
PPTX
Data Science
PDF
Getting Content Out The Door Quickly with Scraping, Outsourcing and Team Work...
PPTX
Nonprofits + Data: Pathway to Innovation
PPTX
Analytics Organizations & The New Emerging Roles
PDF
IMGS 2015 - Ordnance Survey Ireland - Hugh Mangan
PDF
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
PDF
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
PDF
AI Deep Dive_ A Journey through Heroku_OpenAI Integration.pdf
PDF
How to Use Data for Good
PDF
Big Data LDN 2018: FIGHTING DATA CHAOS: CONNECTING USERS TO DATA AT SCALE
PDF
Strata Conference NYC 2013
PDF
Data Science Pipelines in Python using Luigi
PPTX
Building Data Science Pipelines in Python using Luigi
PDF
Come diventare data scientist - Paolo Pellegrini
The Best Data Analyst Jobs in the USA Data Analyst
[Brighton SEO] Audience Intelligence & SEO: How to integrate data sources to ...
Data Discoverability with DataHub
The Convergence of Data Science and Software Development
final_presentation[1]2333.pptx for acharya ploytechnic
Big data careers
Data Science
Getting Content Out The Door Quickly with Scraping, Outsourcing and Team Work...
Nonprofits + Data: Pathway to Innovation
Analytics Organizations & The New Emerging Roles
IMGS 2015 - Ordnance Survey Ireland - Hugh Mangan
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
AI Deep Dive_ A Journey through Heroku_OpenAI Integration.pdf
How to Use Data for Good
Big Data LDN 2018: FIGHTING DATA CHAOS: CONNECTING USERS TO DATA AT SCALE
Strata Conference NYC 2013
Data Science Pipelines in Python using Luigi
Building Data Science Pipelines in Python using Luigi
Come diventare data scientist - Paolo Pellegrini
Ad

More from Stitch Fix Algorithms (10)

PPTX
Progression by Regression: How to increase your A/B Test Velocity
PPTX
Deep recommendations in PyTorch
PDF
Tracking data lineage at Stitch Fix
PDF
Improving ad hoc and production workflows at Stitch Fix
PDF
A compute infrastructure for data scientists
PPTX
Moment-based estimation for hierarchical models in Apache Spark
PPTX
Optimizing Spark
PPTX
When We Spark and When We Don’t: Developing Data and ML Pipelines
PPTX
Incrementality
PDF
Apache Spark & ML Workflows
Progression by Regression: How to increase your A/B Test Velocity
Deep recommendations in PyTorch
Tracking data lineage at Stitch Fix
Improving ad hoc and production workflows at Stitch Fix
A compute infrastructure for data scientists
Moment-based estimation for hierarchical models in Apache Spark
Optimizing Spark
When We Spark and When We Don’t: Developing Data and ML Pipelines
Incrementality
Apache Spark & ML Workflows
Ad

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Approach and Philosophy of On baking technology
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
Teaching material agriculture food technology
PPTX
Big Data Technologies - Introduction.pptx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
KodekX | Application Modernization Development
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
NewMind AI Weekly Chronicles - August'25 Week I
Advanced methodologies resolving dimensionality complications for autism neur...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Encapsulation_ Review paper, used for researhc scholars
Approach and Philosophy of On baking technology
Programs and apps: productivity, graphics, security and other tools
Reach Out and Touch Someone: Haptics and Empathic Computing
Understanding_Digital_Forensics_Presentation.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Empathic Computing: Creating Shared Understanding
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Dropbox Q2 2025 Financial Results & Investor Presentation
Teaching material agriculture food technology
Big Data Technologies - Introduction.pptx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
KodekX | Application Modernization Development
Chapter 3 Spatial Domain Image Processing.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Spectral efficient network and resource selection model in 5G networks
NewMind AI Weekly Chronicles - August'25 Week I

Enabling full stack data scientists