SlideShare a Scribd company logo
How to Become a Data Scientist
Ryan Orban
Co-Founder & CEO
ryan@zipfianacademy.com
@ryanorban
Why are we talking about data science?
Data Analyst Shortage
Source: http://www.delphianalytics.net/wp-content/uploads/2013/04/GrowthOfDataVsDataAnalysts.png
What is data science?
How to Become a Data Scientist
Perfect Storm
Technology
Source: http://www.jcmit.com/diskprice.htm
0
1000
2000
3000
4000
1992 1997 2002 2007 2012
Capacity (GB) Cost per GB (USD)
Unprecedented Data Growth
Enter the Data Scientist
What is Data Science?
+ Communication
What do people look for in a data
scientist?
Broad-range generalist
Deepexpertise
T-Shaped Skillset
T-Shaped Skillset
Machine Learning,
Statistics, Domain Knowledge
Softw
are
EngineeringBusiness
Acum
en
Distributed
Com
puting
Com
m
unication
Data Science Roles
How to I become a data scientist?
Data scientists need to know
how to code.
Python R Julia
Java C++/GoScala/Clojure
High-level
Lower-level
Learn to Code
Learn to Code
Data scientists need to be
comfortable with mathematics
& statistics.
Mathematics Statistical Analysis
Mathematics & Statistics
Distributions (Binomial,
Poisson, etc.)
Summary Statistics
(Mean, Variance, etc.)
Hypothesis Testing
Bayesian Analysis
Linear Algebra
(Matrix Factorization)
Calculus
(Integrals, Derivatives,
etc)
Graph Theory
Probability/
Combinatorics
Mathematics & Statistics
Data scientists need know
machine learning & software
engineering.
Distributed
Computing
Supervised
(SVM, Random Forest)
NLP / Information
Retrieval
Algorithms & Data
Structures
Data Visualization
Data Munging
Machine Learning & Software Engineering
Machine Learning
Software
Engineering
Validation, Model
Comparison
Unsupervised
(K-means, LDA)
Open-Source Data Science Masters
How to Become a Data Scientist
How to Become a Data Scientist
SlideRule
DataTau
Learning data science can be
really hard.
How to Become a Data Scientist
≠ Data Science
Learning data science can be
really hard.
Context is King
It’s about putting the
pieces together
Pathways:
MS/PhD in Data Science
Internship
Immersive Programs
Self-study
You don’t need a PhD to do
data science.
Backgrounds
Educational Background
BS
MS
PhD
0 4 8 12 16
Backgrounds
Disciplines
Software Engineering
Analysts
Finance/Economics
Engineering
Physics
Physical Sciences
Mathematics
Statistics
Astronomy
Linguistics
Professional Poker
0 2 4 6 8
Backgrounds
94% Placement Rate91% Placement
$115k avg. salary
The Program
• 12-week immersive bootcamp in San Francisco
• Project-based curriculum with real datasets,
solving actual problems
• Guest lectures from leaders in the field
• Personal mentorship to help students grow
Timeline
STRUCTURED CURRICULUM
HIRING 	

DAY
CAPSTONE	

PROJECT
GRADUATION
1 8 11 12
INTERVIEW	

PREP
Program Timeline
Learning Techniques
Hiring Partners
!
• Working knowledge of programming
• Background in a quantitative
discipline
• Comfortable with mathematics and
statistics
• Child-like curiosity
What We Look For
Zipfian
Academy
Data Science
Immersive
Data Fellowship
Data Engineering
Immersive
Weekend
Workshops
Zipfian
Academy
@ZipfianAcademy
Data Science Immersive
12-weeks (Sep 8th)
Weekend Workshops
http://zipfianacademy.com/apply
http://zipfianacademy.com/workshops
Next: Interactive Visualizations w/ d3.js ( July 19 )
The best way to learn data
science is by doing data
science.
https://github.com/ipython/ipython/wiki/A-gallery-of-
interesting-IPython-Notebooks
Checklist:
Learn the fundamentals
Build out a project portfolio
Apply!
Blog about your experience
A Practical Intro to Data Science
http://bit.ly/learndatascience
Thank You!
Ryan Orban
Co-Founder
ryan@zipfianacademy.com
@ryanorban

More Related Content

PPTX
Introduction to Data Science
PDF
Data Visualization in Data Science
PPTX
Data Visualization.pptx
PPTX
Introduction to ML (Machine Learning)
PPTX
ppt on machine learning to deep learning (1).pptx
PPTX
Azure Databricks - An Introduction (by Kris Bock)
PPTX
introduction to data science
PPTX
CHATGPT VS BARD AI
Introduction to Data Science
Data Visualization in Data Science
Data Visualization.pptx
Introduction to ML (Machine Learning)
ppt on machine learning to deep learning (1).pptx
Azure Databricks - An Introduction (by Kris Bock)
introduction to data science
CHATGPT VS BARD AI

What's hot (20)

PDF
Introduction to Data Science
PDF
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
PPTX
Data science & data scientist
PPTX
Introduction to data science.pptx
PPTX
Introduction to Data Science
PPTX
Data science
PDF
Data science
PDF
Introduction to Data Science
PDF
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PPTX
Introduction to data science club
PDF
Introduction to data science
PPTX
Introduction to data science
PDF
Data science
KEY
Intro to Data Science for Enterprise Big Data
PDF
Real World End to End machine Learning Pipeline
PDF
Full-stack Data Scientist
PDF
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
PPTX
Introduction to Data Analytics
PPTX
Data analytics
Introduction to Data Science
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Data science & data scientist
Introduction to data science.pptx
Introduction to Data Science
Data science
Data science
Introduction to Data Science
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Introduction to data science club
Introduction to data science
Introduction to data science
Data science
Intro to Data Science for Enterprise Big Data
Real World End to End machine Learning Pipeline
Full-stack Data Scientist
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Introduction to Data Analytics
Data analytics
Ad

Viewers also liked (20)

PDF
Hands-on Deep Learning in Python
PDF
How to Interview a Data Scientist
PDF
Data By The People, For The People
PDF
A Statistician's View on Big Data and Data Science (Version 1)
PPTX
Hadoop and Machine Learning
PDF
10 Lessons Learned from Building Machine Learning Systems
PDF
A tutorial on deep learning at icml 2013
PPTX
Deep Learning for Natural Language Processing
PDF
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
PDF
Introduction to Mahout and Machine Learning
PDF
Machine Learning and Data Mining: 12 Classification Rules
PDF
Myths and Mathemagical Superpowers of Data Scientists
PPTX
Tutorial on Deep learning and Applications
PDF
Tips for data science competitions
PPTX
Deep neural networks
PPTX
Introduction to Big Data/Machine Learning
PPTX
Artificial neural network
PPTX
10 R Packages to Win Kaggle Competitions
PPTX
Artificial Intelligence Presentation
PDF
Introduction to Data Science - ESCP Europe
Hands-on Deep Learning in Python
How to Interview a Data Scientist
Data By The People, For The People
A Statistician's View on Big Data and Data Science (Version 1)
Hadoop and Machine Learning
10 Lessons Learned from Building Machine Learning Systems
A tutorial on deep learning at icml 2013
Deep Learning for Natural Language Processing
An Introduction to Supervised Machine Learning and Pattern Classification: Th...
Introduction to Mahout and Machine Learning
Machine Learning and Data Mining: 12 Classification Rules
Myths and Mathemagical Superpowers of Data Scientists
Tutorial on Deep learning and Applications
Tips for data science competitions
Deep neural networks
Introduction to Big Data/Machine Learning
Artificial neural network
10 R Packages to Win Kaggle Competitions
Artificial Intelligence Presentation
Introduction to Data Science - ESCP Europe
Ad

Similar to How to Become a Data Scientist (20)

PDF
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
PDF
Blended learning and flipped classrooms for data science at Dallas Startup Week
PPTX
Data Science.pptx
PPTX
Data Science Course after 12th A Comprehensive Guide.pptx
PPTX
Become a successful Data Scientist. Start Now!
PDF
ONLINE DATA SCIENCE COURSE_20241008_180844_0000.pdf
PPTX
Data Science Course in Noida
PPTX
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
PDF
What Managers Need to Know about Data Science
PDF
Best Data Science Institute in Delhi^.pdf
PDF
Data science-retreat-how it works plus advice for upcoming data scientists
PDF
Bridging the Gap Between Data Science & Engineer: Building High-Performance T...
PDF
Data science presentation
PPTX
What is data_science_by_khawar_shehzad
PDF
How to Become a Data Scientist in 10 Steps - CETPA Infotech
PDF
HOW TO BECOME AN EFFECTIVE DATA SCIENTIST (WORKSHOP) - MARC WARNER
PDF
Thinkful DC - Intro to Data Science
PDF
Who is a data scientist
PDF
From Data to Discovery: The Journey of a Data Scientist
PDF
Building successful data science teams
How to Become a Data Scientist – By Ryan Orban, VP of Operations and Expansio...
Blended learning and flipped classrooms for data science at Dallas Startup Week
Data Science.pptx
Data Science Course after 12th A Comprehensive Guide.pptx
Become a successful Data Scientist. Start Now!
ONLINE DATA SCIENCE COURSE_20241008_180844_0000.pdf
Data Science Course in Noida
Careers in Data Science _ Navigating the Digital Frontier (1).pptx
What Managers Need to Know about Data Science
Best Data Science Institute in Delhi^.pdf
Data science-retreat-how it works plus advice for upcoming data scientists
Bridging the Gap Between Data Science & Engineer: Building High-Performance T...
Data science presentation
What is data_science_by_khawar_shehzad
How to Become a Data Scientist in 10 Steps - CETPA Infotech
HOW TO BECOME AN EFFECTIVE DATA SCIENTIST (WORKSHOP) - MARC WARNER
Thinkful DC - Intro to Data Science
Who is a data scientist
From Data to Discovery: The Journey of a Data Scientist
Building successful data science teams

Recently uploaded (20)

PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Taxes Foundatisdcsdcsdon Certificate.pdf
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
PPTX
A Quantitative-WPS Office.pptx research study
PPT
Quality review (1)_presentation of this 21
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
.pdf is not working space design for the following data for the following dat...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Taxes Foundatisdcsdcsdon Certificate.pdf
Business Ppt On Nestle.pptx huunnnhhgfvu
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Introduction-to-Cloud-ComputingFinal.pptx
Reliability_Chapter_ presentation 1221.5784
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
climate analysis of Dhaka ,Banglades.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
IB Computer Science - Internal Assessment.pptx
Major-Components-ofNKJNNKNKNKNKronment.pptx
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
A Quantitative-WPS Office.pptx research study
Quality review (1)_presentation of this 21
oil_refinery_comprehensive_20250804084928 (1).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Clinical guidelines as a resource for EBP(1).pdf
.pdf is not working space design for the following data for the following dat...

How to Become a Data Scientist