How to Hire and Test for Data Skills:
A One-Size-Fits-All Interview Kit
09/28/2017
Tanya Cashorali
@TanyaCash21
2004 201220072005 2006 20142013 2015
SPEAKER PROFILE
TANYA CASHORALI
@TANYACASH21
2
FIRST THINGS FIRST:
WHAT IS A DATA SCIENTIST?
2005 – R IS SCARY
4
Source: https://www.youtube.com/watch?v=2SQ0O_oPpe4
DATA SCIENTIST JOB POSTING 2008
Be challenged at LinkedIn. We’re looking for
superb analytical minds of all levels to expand
our small team that will build some of the
most innovative products at LinkedIn
No specific technical skills are
required(we’ll help you learn SQL, Python,
and R). You should be extremely intelligent,
have quantitative background, and be able to
learn quickly and work independently. This is
the perfect job for someone who’s really
smart, driven, and extremely skilled at
creatively solving problems. You’ll learn
statistics, data mining, programming, and
product design, but you’ve gotta start with
what we can’t teach –intellectual sharpness
and creativity.
5
#BLAMEDREWCONWAY
6
7
8
9
10
11
12
Source: https://ironholds.org/arbitrary-things/
Strata 2017 NYC - How to Hire and Test for Data Skills: A One-Size-Fits-All Interview Kit
JOB DESCRIPTIONS NOW
STAHP, PLZ
Recommend Reading: http://www.espn.com/nba/story/_/id/17678246/basic-concepts-not-math-heart-sports-analytics
UNREALISTIC REQUIREMENTS
39% of Data Scientist
postings require an
advanced degree
Source: https://www-01.ibm.com/common/ssi/cgi-bin/ssialias?htmlfid=IML14576USEN&
DEMAND > SUPPLY
Maybe your hiring
process is flawed?
DATA MATURITY PROGRESSION
19Source: https://www.americaninno.com/boston/startup-institute-and-stattleship-sports-data-science-and-analytics-class/
Huge
Leap
WHAT SKILLS REALLY MATTER?
0% 10% 20% 30% 40% 50% 60%
Knowledge of Algorithms
Strong Statistical / Mathematical Skills
Coding Efficiency (R/Python)
Other - please describe
Communication Skills
Problem Solving / Curiosity
What is the Most Important Attribute You Look for
When Hiring a Data Scientist?
Source: https://www.youtube.com/watch?time_continue=1287&v=I7IW9Z3h20Y 20
BUILD A TEAM THAT MEETS YOUR NEEDS
Business/
Data Analyst
Engineer Statistician
 Be realistic with your
expectations
 Hire a great team instead of
holding out for a unicorn
 Clearly define goals for your
data team
 Understand the measures of
success for the roles
21
INTERVIEW TECHNIQUES THAT
WON’T CHASE AWAY
YOUR CANDIDATES
WHITEBOARDING – THE WATERBOARDING OF TECH INTERVIEWS
23
Recommended Reading:
https://medium.freecodecamp.org/why-is-hiring-broken-it-starts-at-the-whiteboard-34b088e5a5db
https://medium.com/@evnowandforever/f-you-i-quit-hiring-is-broken-bb8f3a48d324
THE ON-SITE ALL DAY GAUNTLET
24Source: https://www.forbes.com/sites/susanadams/2014/04/16/how-to-survive-a-marathon-job-interview/#716e1178517b
Don’t expect to eat at lunch.
Though a company like Lending Club claims that lunch is a time for candidates to
take a breather and relax, don’t. Your interviewers care about whether you are
socially skilled and easy to be around. This is a good opportunity to ask questions.
Query your dining companion about their career and how they like their employer.
Remember that you are still being evaluated. You may not manage more than a
few bites of food. Lierman recommends that you pack a small water bottle and
snack in your bag which you can nibble when you excuse yourself to go to the
restroom.
Jot down notes when you take a
bathroom break.
Don’t take notes during a meal or in interviews. When you go to the rest room, jot
down some points. These will come in handy when you follow up with thank-you
notes. Pay particular attention to descriptions of the company’s challenges. You
want to come off as a problem solver.
25
CLEAR AND SIMPLE QUESTIONS
26
1.Which brewery produces the strongest beers by ABV%?
2.If you had to pick 3 beers to recommend using only this data,
which would you pick?
3.Which of the factors (aroma, taste, appearance, palette) are most
important in determining the overall quality of a beer?
4.Lastly, if I typically enjoy a beer due to its aroma and appearance,
which beer style should I try?
Dataset: https://s3.amazonaws.com/demo-datasets/beer_reviews.tar.gz
27
ELEMENTS OF A SUCCESSFUL RESPONSE
28
Visualize &
Communicate
Summarize &
Explore
Aggregate &
Manipulate
Reception to
Feedback &
Criticism
Problem Solving &
Writing Skills
Communicate
Findings
COMPONENTS OF A SUCCESSFUL PRESENTATION
29
Aggregations and summary statistics should be expected, visualize
and communicate results to the business
More complex beer recommendation system, talk through a
Bayesian approach.1 Communicate results to technical peers.
Load the data into MongoDB or on Postgres RDS in the cloud,
provide instructions to an analyst on how they can query the data.
Communicate results to analyst team.
1. http://nbviewer.jupyter.org/github/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-
Hackers/blob/master/Chapter4_TheGreatestTheoremNeverTold/Ch4_LawOfLargeNumbers_PyMC2.ipynb
TEST FOR THE ROLE (EXAMPLES)
30
Business/
Data Analyst
Engineer
Statistician
KEY TAKEAWAYS
31
 Let’s be an inclusive and approachable field
 Let’s be more realistic about expectations
 Clearly define what you need
 Give this take-home test or something similar using your own
data
 Have candidate present to the right team (interview only lasts 1
hour!)
SPECIAL THANKS TO…
32
Mara Averick
@dataandme
Oliver Keyes
@kopshtik
Renee Teate
@BecomingDataSci
Tom Neyarapally Jiggy Parikh

More Related Content

PPTX
When and Why to Use Shiny for Commercial Applications
PPTX
Popular Industry Applications of R
PDF
How To Structure Your Search Team for Success
PDF
How to Best Develop Requirements for SharePoint Projects @ #SPSATL
PPTX
Enterprise SEO
PPTX
Rapid Prototyping Data Products in Shiny - ODSC 2017
PPTX
Big Data on Azure Tutorial
PDF
Crafting a Compelling Data Science Resume
When and Why to Use Shiny for Commercial Applications
Popular Industry Applications of R
How To Structure Your Search Team for Success
How to Best Develop Requirements for SharePoint Projects @ #SPSATL
Enterprise SEO
Rapid Prototyping Data Products in Shiny - ODSC 2017
Big Data on Azure Tutorial
Crafting a Compelling Data Science Resume

Similar to Strata 2017 NYC - How to Hire and Test for Data Skills: A One-Size-Fits-All Interview Kit (20)

PDF
Lessons Learned from Hiring and Retaining Data Practitioners
PDF
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
PDF
The Research Blueprint: Excelling in Data science, Data Analysis and AI
PPTX
How to start your data career
PDF
Story of hiring great data scientists - Srinidhi Rao (Senior Partner, TheMath...
PDF
Building Data Science Teams
 
PDF
Math in data
PPTX
Are you ready for Data science? A 12 point test
PDF
Building successful data science teams
PPTX
Analysis of "Data Scientist: the sexiest job of the 21st century" by Thomas H...
PDF
3Ds of digital
PDF
DataScholar.io
PDF
Data Science Career Paths at N2 Conference
PDF
PPTX
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
PDF
From Rocket Science to Data Science
PDF
How to become a data scientist
PDF
Making the Cut: A Review of Open Talent Analytics Job Postings
PPTX
Building a Career in Data Science -WiCDS meetup
PDF
How to Build Data Science Teams
Lessons Learned from Hiring and Retaining Data Practitioners
Smart Data Slides: Data Science and Business Analysis - A Look at Best Practi...
The Research Blueprint: Excelling in Data science, Data Analysis and AI
How to start your data career
Story of hiring great data scientists - Srinidhi Rao (Senior Partner, TheMath...
Building Data Science Teams
 
Math in data
Are you ready for Data science? A 12 point test
Building successful data science teams
Analysis of "Data Scientist: the sexiest job of the 21st century" by Thomas H...
3Ds of digital
DataScholar.io
Data Science Career Paths at N2 Conference
Licensed to Analyze? Strata Data NY 2019 IADSS Session - Usama Fayyad, Hamit ...
From Rocket Science to Data Science
How to become a data scientist
Making the Cut: A Review of Open Talent Analytics Job Postings
Building a Career in Data Science -WiCDS meetup
How to Build Data Science Teams
Ad

More from Tanya Cashorali (7)

PPTX
Rapid Prototyping Data Products in Shiny - RStudio::Conf 2018
PPTX
SportsDataViz using Plotly, Shiny and Flexdashboard - PlotCon 2016
PPTX
Solving Real Business Problems with Big Data: Measuring Customer Loyalty in t...
PPTX
DataCon Talk
PPTX
Big data meetup_10_9_2013
PPTX
Front endrequirements 09_25_2013
PPTX
Microsoft NERD Talk - R and Tableau - 2-4-2013
Rapid Prototyping Data Products in Shiny - RStudio::Conf 2018
SportsDataViz using Plotly, Shiny and Flexdashboard - PlotCon 2016
Solving Real Business Problems with Big Data: Measuring Customer Loyalty in t...
DataCon Talk
Big data meetup_10_9_2013
Front endrequirements 09_25_2013
Microsoft NERD Talk - R and Tableau - 2-4-2013
Ad

Recently uploaded (20)

PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPT
Geologic Time for studying geology for geologist
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Hindi spoken digit analysis for native and non-native speakers
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PPT
What is a Computer? Input Devices /output devices
PDF
Five Habits of High-Impact Board Members
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
CloudStack 4.21: First Look Webinar slides
PDF
Getting Started with Data Integration: FME Form 101
PPTX
Modernising the Digital Integration Hub
PDF
A novel scalable deep ensemble learning framework for big data classification...
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
A comparative study of natural language inference in Swahili using monolingua...
Geologic Time for studying geology for geologist
O2C Customer Invoices to Receipt V15A.pptx
Hindi spoken digit analysis for native and non-native speakers
Module 1.ppt Iot fundamentals and Architecture
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Enhancing emotion recognition model for a student engagement use case through...
Univ-Connecticut-ChatGPT-Presentaion.pdf
DP Operators-handbook-extract for the Mautical Institute
What is a Computer? Input Devices /output devices
Five Habits of High-Impact Board Members
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Group 1 Presentation -Planning and Decision Making .pptx
WOOl fibre morphology and structure.pdf for textiles
A review of recent deep learning applications in wood surface defect identifi...
CloudStack 4.21: First Look Webinar slides
Getting Started with Data Integration: FME Form 101
Modernising the Digital Integration Hub
A novel scalable deep ensemble learning framework for big data classification...

Strata 2017 NYC - How to Hire and Test for Data Skills: A One-Size-Fits-All Interview Kit

  • 1. How to Hire and Test for Data Skills: A One-Size-Fits-All Interview Kit 09/28/2017 Tanya Cashorali @TanyaCash21
  • 2. 2004 201220072005 2006 20142013 2015 SPEAKER PROFILE TANYA CASHORALI @TANYACASH21 2
  • 3. FIRST THINGS FIRST: WHAT IS A DATA SCIENTIST?
  • 4. 2005 – R IS SCARY 4
  • 5. Source: https://www.youtube.com/watch?v=2SQ0O_oPpe4 DATA SCIENTIST JOB POSTING 2008 Be challenged at LinkedIn. We’re looking for superb analytical minds of all levels to expand our small team that will build some of the most innovative products at LinkedIn No specific technical skills are required(we’ll help you learn SQL, Python, and R). You should be extremely intelligent, have quantitative background, and be able to learn quickly and work independently. This is the perfect job for someone who’s really smart, driven, and extremely skilled at creatively solving problems. You’ll learn statistics, data mining, programming, and product design, but you’ve gotta start with what we can’t teach –intellectual sharpness and creativity. 5
  • 7. 7
  • 8. 8
  • 9. 9
  • 10. 10
  • 11. 11
  • 12. 12
  • 16. STAHP, PLZ Recommend Reading: http://www.espn.com/nba/story/_/id/17678246/basic-concepts-not-math-heart-sports-analytics
  • 17. UNREALISTIC REQUIREMENTS 39% of Data Scientist postings require an advanced degree Source: https://www-01.ibm.com/common/ssi/cgi-bin/ssialias?htmlfid=IML14576USEN&
  • 18. DEMAND > SUPPLY Maybe your hiring process is flawed?
  • 19. DATA MATURITY PROGRESSION 19Source: https://www.americaninno.com/boston/startup-institute-and-stattleship-sports-data-science-and-analytics-class/ Huge Leap
  • 20. WHAT SKILLS REALLY MATTER? 0% 10% 20% 30% 40% 50% 60% Knowledge of Algorithms Strong Statistical / Mathematical Skills Coding Efficiency (R/Python) Other - please describe Communication Skills Problem Solving / Curiosity What is the Most Important Attribute You Look for When Hiring a Data Scientist? Source: https://www.youtube.com/watch?time_continue=1287&v=I7IW9Z3h20Y 20
  • 21. BUILD A TEAM THAT MEETS YOUR NEEDS Business/ Data Analyst Engineer Statistician  Be realistic with your expectations  Hire a great team instead of holding out for a unicorn  Clearly define goals for your data team  Understand the measures of success for the roles 21
  • 22. INTERVIEW TECHNIQUES THAT WON’T CHASE AWAY YOUR CANDIDATES
  • 23. WHITEBOARDING – THE WATERBOARDING OF TECH INTERVIEWS 23 Recommended Reading: https://medium.freecodecamp.org/why-is-hiring-broken-it-starts-at-the-whiteboard-34b088e5a5db https://medium.com/@evnowandforever/f-you-i-quit-hiring-is-broken-bb8f3a48d324
  • 24. THE ON-SITE ALL DAY GAUNTLET 24Source: https://www.forbes.com/sites/susanadams/2014/04/16/how-to-survive-a-marathon-job-interview/#716e1178517b Don’t expect to eat at lunch. Though a company like Lending Club claims that lunch is a time for candidates to take a breather and relax, don’t. Your interviewers care about whether you are socially skilled and easy to be around. This is a good opportunity to ask questions. Query your dining companion about their career and how they like their employer. Remember that you are still being evaluated. You may not manage more than a few bites of food. Lierman recommends that you pack a small water bottle and snack in your bag which you can nibble when you excuse yourself to go to the restroom. Jot down notes when you take a bathroom break. Don’t take notes during a meal or in interviews. When you go to the rest room, jot down some points. These will come in handy when you follow up with thank-you notes. Pay particular attention to descriptions of the company’s challenges. You want to come off as a problem solver.
  • 25. 25
  • 26. CLEAR AND SIMPLE QUESTIONS 26 1.Which brewery produces the strongest beers by ABV%? 2.If you had to pick 3 beers to recommend using only this data, which would you pick? 3.Which of the factors (aroma, taste, appearance, palette) are most important in determining the overall quality of a beer? 4.Lastly, if I typically enjoy a beer due to its aroma and appearance, which beer style should I try? Dataset: https://s3.amazonaws.com/demo-datasets/beer_reviews.tar.gz
  • 27. 27
  • 28. ELEMENTS OF A SUCCESSFUL RESPONSE 28 Visualize & Communicate Summarize & Explore Aggregate & Manipulate
  • 29. Reception to Feedback & Criticism Problem Solving & Writing Skills Communicate Findings COMPONENTS OF A SUCCESSFUL PRESENTATION 29
  • 30. Aggregations and summary statistics should be expected, visualize and communicate results to the business More complex beer recommendation system, talk through a Bayesian approach.1 Communicate results to technical peers. Load the data into MongoDB or on Postgres RDS in the cloud, provide instructions to an analyst on how they can query the data. Communicate results to analyst team. 1. http://nbviewer.jupyter.org/github/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for- Hackers/blob/master/Chapter4_TheGreatestTheoremNeverTold/Ch4_LawOfLargeNumbers_PyMC2.ipynb TEST FOR THE ROLE (EXAMPLES) 30 Business/ Data Analyst Engineer Statistician
  • 31. KEY TAKEAWAYS 31  Let’s be an inclusive and approachable field  Let’s be more realistic about expectations  Clearly define what you need  Give this take-home test or something similar using your own data  Have candidate present to the right team (interview only lasts 1 hour!)
  • 32. SPECIAL THANKS TO… 32 Mara Averick @dataandme Oliver Keyes @kopshtik Renee Teate @BecomingDataSci Tom Neyarapally Jiggy Parikh

Editor's Notes

  • #5: Printed out entire CRAN docs, that said I didn’t know ANYTHING, provided value with good mentors within a few months
  • #6: So then what happened?
  • #7: This happened
  • #8: anyone here have a PhD in neurocomputing? AI?
  • #9: Drew’s venn diagram within a venn diagram, accountant, head of IT
  • #10: Art & Design, NLP, Mashups
  • #14: And it got so bad DS themselves started making fun of them
  • #18: Burning Glass mined its database of over 130 million unique current and historical job listings and worked with IBM and Business-Higher Education Forum (BHEF) to identify the key roles and skills that make up the DSA jobs ecosystem. First, Burning Glass identified a set of over 300 analytical skills that represent the key DSArelated tools and competencies requested in the labor market. These skills range from such general analytical competencies as database architecture, data analysis, and data visualization to specific technologies used to perform DSA-related tasks, such as R, Hadoop, and Tableau.
  • #20: Huge talent gap, teaching experience. Are you developing algorithms? Understand the math? Just need applications?
  • #24: Whiteboarding doesn’t work, humiliating and dehumanizing process
  • #25: Whiteboarding doesn’t work, humiliating and dehumanizing process
  • #26: Pick a real problem, if you don’t have one we will give you one, and it’s fun (beer!), This test can be given to PhD level data scientists or entry level data analysts, scales to find right fit based on your problems. Task the candidate with presenting their results to your team.
  • #27: Open ended #2
  • #28: same question, 4 different responses, all correct
  • #30: Review the candidate’s coding and writing skills in their written results. This should reflect their ability to understand a question, use the right data to answer the question, and document their results to promote easy collaboration. Talk about example where guy communicated poorly Have them present these findings not only to the technical team, but also to executives if you expect this candidate to be presenting complicated results to business stakeholders. They should tailor their presentation accordingly. Be mindful of their ability to take feedback and constructive criticism from your team. Some candidates may be more defensive about their approach and not respond well to questions. This is a clear sign of someone that may be difficult to work with in a team-based environment.
  • #31: These are just examples
  • #32: You are a data scientist if you solve problems using data