SlideShare a Scribd company logo
2
Most read
4
Most read
KDD 2022, W26 Half Day Workshop – Tuesday August 16
3rd IADSS Workshop on Data Science Standards – Hiring, Assessing and Upskilling Data Science Talent
IADSS link https://www.iadss.org/
Workshop link https://www.iadss.org/kdd2022
One Hiring Process – Quiz for Learning and Reasoning
Greg Makowski
Head of Data Science Solutions
greg.Makowski@jci.com
www.LinkedIn.com/in/GregMakowski
Building Technologies & Solutions
2
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Context
 My background
• I have been deploying data mining and data science models since 1992
• I have been hiring and building DS teams since 2010
 Assumptions driving requirements and design
• You can have candidates describe their past models deployed and development
process. That is not hard.
– However, that may not be a close match for upcoming work
• The types of future algorithms and architectures will likely be constantly changing,
between the project requirements and what is developing as State Of The Art (SOTA)
• Therefore, it would be good to get a sense of how a candidate would do on future,
upcoming projects and new algorithms
• There is a large variety of new things to learn, of new algorithms or tweaks at a
conference – but you shouldn’t expect the candidate to know all
•  Expect that constant learning is needed in the future
•  interview for the candidates learning process
3
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Context – other parts of the interview process
• Job description – looking for a person that has ”deployed 3+ models”, like a ”medical residency”. 3-12 yrs
• Other parts of the interview process already cover common basics
– basic coding or software development skills
– algorithm skills on algorithms they know well or have deployed
• Treat the interview process as a “dating game” to get a good match for the candidate as well as the
employer
– What the company has to offer, existing enterprise software, tools, past projects
– Career path in Data Science
– “A day in the life of”, a “week in the life of” the candidate in the role
– Personal responsibility, ownership, how the they can contribute to the employers OKR’s (Objectives and Key
Results)
– Intellectual variety – vertical markets, customer personas, algorithm families
– Opportunities to intellectually grow, learn and adapt
• In final round, have candidate present on past project for 45 min, which includes Q&A
– Focus on their DS “value add”, avoid anything confidential to past client or co
– Like an internal DS presentation or to DS contacts at a client site, to make it easier
– Can be a reused conference or meetup presentation
– A chance for all the DS staff in a compatible time zone to understand the past project and ask questions
Exciting Mission
• Save the world
• Save people
• Save machines
4
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Design of one part of the DS phone interview process
 ”Teach” for 15 minutes, going over content in slides
• Let the candidate ask questions, it is fine to ask to go back and forth in the slides
• Give a problem context, a client role in a vertical domain, their values and KPIs
• Pick something that goes deep into an algorithm, not at the “surface level” of a Python
function call, but on how the algorithm interacts with the data
• Every few slides, ask if they understand or have questions
• Describe in terms of different modalities (math, geometry, process)
 “ask reasoning questions” that may take 15-30 min
5
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Design of one part of the DS phone interview process
 “ask reasoning questions” that may take 15-30 min
• One question is more algorithmic or geometrical
• One question is more about “fiduciary responsibility to the values and KPIs of the
client”, for the DS to think as a consultant or in the business deployment terms
• Tell the candidate they can ask to go back and forth over any of the training slides
• They can ask questions of the client contact. Then roleplay as a client (not knowing
DS).
• Track the time it takes per question, as one metric to compare candidates. It is mostly
used for comparisons of large differences
• For any ”binary” answers, the candidate has to explain “why”. Ask probing questions
• Ask the candidate to declare a “final answer state”. Don’t stop them if you hear
something close. They should be confident and describe their reasoning
– When deploying in the future, they have to be confident to know their “final answer state”
6
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Design of one part of the “on-site” interview process
 Go over a short 2 week POC project in slides (something I know well and
worked on). Provide background slides w text and diagrams for 15 min
• Client requirements
• Data, it’s noise, what it represents in terms of a physical process or human decisions,
sample tables of different structures
• Initial EDA that you could take for granted that most people would do
• Offer to “role play” or be a “holodeck simulator” to answer questions
– Of the client, who knows objectives and data at a business process level
– Of the data, for a specified EDA, program or model, get a high level answer
– A fellow DS on the team, who doesn’t know the problem – may trigger questions
– Can go back and forth to any of the background slides
• Questions for ~30 min:
– How to structure the data to fit some algorithm family (very involved for this project)
– How to train the models, model development process (less time, maybe 10 min)
– Once you have an algorithm result, so what? What business process do you change to get
value or a business KPI metric? How to estimate the value? (not obvious for this project)
7
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
When designing questions, consider:
 Pick an algorithm or algorithm feature that is intentionally obscure
• Want to see a candidate learning and adapting experience, how they ask questions
• Want to avoid 1 in 10 candidates who already know the answer – then you can’t
compare how they “would learn X the first time”
 In the age of remote interviews, a person can web search easily
• Check common web searches in advance, what they return and how that could be
used. Consider how you may tweak your question or setup
• Maybe don’t use an algorithm or feature name, just call it “alg X”
8
© Copyright Johnson Controls. All rights reserved.
Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential
Questions, discussions, alternate approaches, advice?
 Since 2010, candidates hired this way have been good at
• adapting to new projects and requirements
• No retention surprises or mis-hires

More Related Content

PDF
System Design Interview - from both sides of the table.pdf
PDF
Practical_Business_Rules_Development_and_Use
PDF
Using the power of OpenAI with your own data: what's possible and how to start?
PPTX
Microsoft interview questions Microsoft sde sdet jobs Microsoft Careers
PDF
Usability Testing for Qualitative Researchers - QRCA NYC Chapter event
PPTX
Google interview questions
PDF
Europython how to make it recruiting suck less?
PDF
Bpma contextual inquiry
System Design Interview - from both sides of the table.pdf
Practical_Business_Rules_Development_and_Use
Using the power of OpenAI with your own data: what's possible and how to start?
Microsoft interview questions Microsoft sde sdet jobs Microsoft Careers
Usability Testing for Qualitative Researchers - QRCA NYC Chapter event
Google interview questions
Europython how to make it recruiting suck less?
Bpma contextual inquiry

Similar to A Successful Hiring Process for Data Scientists (20)

PPTX
Lessons after working as a data scientist for 1 year
PDF
Jump Start Agile Testing with Acceptance Test Driven Development
PPT
req engg (1).ppt
PPTX
Requirement Analysis-2.pptxrfgghrkjbnrjb
PPT
Lecture 9 understanding requirements
PPTX
Conference Room Prototype – a low cost, high value approach to selecting the ...
DOC
Naveen Prakash Resume
DOCX
I2t2
PPT
Genesis10jobs1
PPT
Designing Better Applications, Websites and Intranets
PPT
Designing Better Applications, Website and Intranets
PDF
Quality & Risk Management Challenges When Acquiring Enterprise Systems
PDF
Model Factory at ING Bank
PDF
Barga Data Science lecture 2
PPTX
Microsoft Copilot Studio - Implementation Guide (1.5).pptx
PDF
Demystifying ML/AI
PDF
Mechanical Turk Under the Hood
PDF
KSU IT4983 Capstone Projects Report 2017 Update
PPTX
Best practice for_agile_ds_projects
PDF
KSU IT Capstone Report 2012-2017.pdf
Lessons after working as a data scientist for 1 year
Jump Start Agile Testing with Acceptance Test Driven Development
req engg (1).ppt
Requirement Analysis-2.pptxrfgghrkjbnrjb
Lecture 9 understanding requirements
Conference Room Prototype – a low cost, high value approach to selecting the ...
Naveen Prakash Resume
I2t2
Genesis10jobs1
Designing Better Applications, Websites and Intranets
Designing Better Applications, Website and Intranets
Quality & Risk Management Challenges When Acquiring Enterprise Systems
Model Factory at ING Bank
Barga Data Science lecture 2
Microsoft Copilot Studio - Implementation Guide (1.5).pptx
Demystifying ML/AI
Mechanical Turk Under the Hood
KSU IT4983 Capstone Projects Report 2017 Update
Best practice for_agile_ds_projects
KSU IT Capstone Report 2012-2017.pdf
Ad

More from Greg Makowski (18)

PPTX
Defense Against LLM Scheming 2025_04_28.pptx
PPTX
Understanding Hallucinations in LLMs - 2023 09 29.pptx
PPTX
Future of AI - 2023 07 25.pptx
PDF
Kdd 2019: Standardizing Data Science to Help Hiring
PPTX
Tales from an ip worker in consulting and software
PPTX
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
PDF
Production model lifecycle management 2016 09
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications
PDF
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
PDF
SFbayACM ACM Data Science Camp 2015 10 24
PDF
How to Create 80% of a Big Data Pilot Project
PDF
Powering Real­time Decision Engines in Finance and Healthcare using Open Sour...
PDF
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
PDF
Heuristic design of experiments w meta gradient search
PDF
Three case studies deploying cluster analysis
PDF
Linked In Slides 2009 02 24 B
PDF
The 360º Leader (Section 2 of 6)
PPT
The 360º Leader (Section 1 of 6)
Defense Against LLM Scheming 2025_04_28.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptx
Future of AI - 2023 07 25.pptx
Kdd 2019: Standardizing Data Science to Help Hiring
Tales from an ip worker in consulting and software
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Production model lifecycle management 2016 09
Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
SFbayACM ACM Data Science Camp 2015 10 24
How to Create 80% of a Big Data Pilot Project
Powering Real­time Decision Engines in Finance and Healthcare using Open Sour...
Kamanja: Driving Business Value through Real-Time Decisioning Solutions
Heuristic design of experiments w meta gradient search
Three case studies deploying cluster analysis
Linked In Slides 2009 02 24 B
The 360º Leader (Section 2 of 6)
The 360º Leader (Section 1 of 6)
Ad

Recently uploaded (20)

PPTX
SAP 2 completion done . PRESENTATION.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPT
Quality review (1)_presentation of this 21
PPTX
modul_python (1).pptx for professional and student
PDF
Introduction to the R Programming Language
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Computer network topology notes for revision
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Mega Projects Data Mega Projects Data
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
Transcultural that can help you someday.
SAP 2 completion done . PRESENTATION.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
Quality review (1)_presentation of this 21
modul_python (1).pptx for professional and student
Introduction to the R Programming Language
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Reliability_Chapter_ presentation 1221.5784
[EN] Industrial Machine Downtime Prediction
Computer network topology notes for revision
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
oil_refinery_comprehensive_20250804084928 (1).pptx
Qualitative Qantitative and Mixed Methods.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Mega Projects Data Mega Projects Data
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction-to-Cloud-ComputingFinal.pptx
Introduction to Knowledge Engineering Part 1
Supervised vs unsupervised machine learning algorithms
Transcultural that can help you someday.

A Successful Hiring Process for Data Scientists

  • 1. KDD 2022, W26 Half Day Workshop – Tuesday August 16 3rd IADSS Workshop on Data Science Standards – Hiring, Assessing and Upskilling Data Science Talent IADSS link https://www.iadss.org/ Workshop link https://www.iadss.org/kdd2022 One Hiring Process – Quiz for Learning and Reasoning Greg Makowski Head of Data Science Solutions greg.Makowski@jci.com www.LinkedIn.com/in/GregMakowski Building Technologies & Solutions
  • 2. 2 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Context  My background • I have been deploying data mining and data science models since 1992 • I have been hiring and building DS teams since 2010  Assumptions driving requirements and design • You can have candidates describe their past models deployed and development process. That is not hard. – However, that may not be a close match for upcoming work • The types of future algorithms and architectures will likely be constantly changing, between the project requirements and what is developing as State Of The Art (SOTA) • Therefore, it would be good to get a sense of how a candidate would do on future, upcoming projects and new algorithms • There is a large variety of new things to learn, of new algorithms or tweaks at a conference – but you shouldn’t expect the candidate to know all •  Expect that constant learning is needed in the future •  interview for the candidates learning process
  • 3. 3 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Context – other parts of the interview process • Job description – looking for a person that has ”deployed 3+ models”, like a ”medical residency”. 3-12 yrs • Other parts of the interview process already cover common basics – basic coding or software development skills – algorithm skills on algorithms they know well or have deployed • Treat the interview process as a “dating game” to get a good match for the candidate as well as the employer – What the company has to offer, existing enterprise software, tools, past projects – Career path in Data Science – “A day in the life of”, a “week in the life of” the candidate in the role – Personal responsibility, ownership, how the they can contribute to the employers OKR’s (Objectives and Key Results) – Intellectual variety – vertical markets, customer personas, algorithm families – Opportunities to intellectually grow, learn and adapt • In final round, have candidate present on past project for 45 min, which includes Q&A – Focus on their DS “value add”, avoid anything confidential to past client or co – Like an internal DS presentation or to DS contacts at a client site, to make it easier – Can be a reused conference or meetup presentation – A chance for all the DS staff in a compatible time zone to understand the past project and ask questions Exciting Mission • Save the world • Save people • Save machines
  • 4. 4 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Design of one part of the DS phone interview process  ”Teach” for 15 minutes, going over content in slides • Let the candidate ask questions, it is fine to ask to go back and forth in the slides • Give a problem context, a client role in a vertical domain, their values and KPIs • Pick something that goes deep into an algorithm, not at the “surface level” of a Python function call, but on how the algorithm interacts with the data • Every few slides, ask if they understand or have questions • Describe in terms of different modalities (math, geometry, process)  “ask reasoning questions” that may take 15-30 min
  • 5. 5 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Design of one part of the DS phone interview process  “ask reasoning questions” that may take 15-30 min • One question is more algorithmic or geometrical • One question is more about “fiduciary responsibility to the values and KPIs of the client”, for the DS to think as a consultant or in the business deployment terms • Tell the candidate they can ask to go back and forth over any of the training slides • They can ask questions of the client contact. Then roleplay as a client (not knowing DS). • Track the time it takes per question, as one metric to compare candidates. It is mostly used for comparisons of large differences • For any ”binary” answers, the candidate has to explain “why”. Ask probing questions • Ask the candidate to declare a “final answer state”. Don’t stop them if you hear something close. They should be confident and describe their reasoning – When deploying in the future, they have to be confident to know their “final answer state”
  • 6. 6 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Design of one part of the “on-site” interview process  Go over a short 2 week POC project in slides (something I know well and worked on). Provide background slides w text and diagrams for 15 min • Client requirements • Data, it’s noise, what it represents in terms of a physical process or human decisions, sample tables of different structures • Initial EDA that you could take for granted that most people would do • Offer to “role play” or be a “holodeck simulator” to answer questions – Of the client, who knows objectives and data at a business process level – Of the data, for a specified EDA, program or model, get a high level answer – A fellow DS on the team, who doesn’t know the problem – may trigger questions – Can go back and forth to any of the background slides • Questions for ~30 min: – How to structure the data to fit some algorithm family (very involved for this project) – How to train the models, model development process (less time, maybe 10 min) – Once you have an algorithm result, so what? What business process do you change to get value or a business KPI metric? How to estimate the value? (not obvious for this project)
  • 7. 7 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential When designing questions, consider:  Pick an algorithm or algorithm feature that is intentionally obscure • Want to see a candidate learning and adapting experience, how they ask questions • Want to avoid 1 in 10 candidates who already know the answer – then you can’t compare how they “would learn X the first time”  In the age of remote interviews, a person can web search easily • Check common web searches in advance, what they return and how that could be used. Consider how you may tweak your question or setup • Maybe don’t use an algorithm or feature name, just call it “alg X”
  • 8. 8 © Copyright Johnson Controls. All rights reserved. Johnson Controls—Public. Any unauthorized use, copying or distribution is strictly prohibited. Johnson Controls Confidential Questions, discussions, alternate approaches, advice?  Since 2010, candidates hired this way have been good at • adapting to new projects and requirements • No retention surprises or mis-hires