SlideShare a Scribd company logo
Developing
data scientists
Erin Shellman PhD
Data Science Kick-off
University of Michigan
October 6, 2015
effective
MS in Biostatistics ’08
PhD in Bioinformatics ‘12
…and an alum!
I’m Erin and I’m a
data scientist.
data scientists
communicate well.
effective
1. Present your work
2. Blog
3. Teach others
Student’s t0-do:
Center coursework
on presentation and
communication
Teacher’s t0-do:
data scientists write
high-quality code.
effective
–Sarah Guido, Data Scientist @ Bitly and UM alum
The single most useful thing I learned
in grad school was how to work with
data in Python.
–Trey Causey, Data Scientist @ ChefSteps, @4thdownbot
I wish there had been more emphasis
on real data/messy data/getting my
own data/cleaning data.
1. Develop deep expertise in a
programming language
2. Adopt good coding
practices now
Student’s t0-do:
1. Expose students to a variety
of datasets
2. Focus on applications
Teacher’s t0-do:
data scientists are
curious, life-long
learners.
effective
–Wendy Grus, Technical Data Analyst @ Inrix and UM alum
LEARN MORE CODING AND
STATISTICS. Meet other people working
on data sets not in your field. Learn
something not in your field. Don't sit in
a box staring at your thesis project.
–Amanda Casari, Senior Data Scientist @ Concur
I took Naval Criminal Law & international
relations. It was hard to stay awake. By
comparison, I looked forward to vector
calculus and loved Matlab. Lesson: Don't
be afraid to explore your options while
you are still on the track of figuring out
what comes next.
–James Pestrak, Senior Data Scientist @ Nordstrom
Boundaries between disciplines are
becoming less-defined. There are general,
powerful patterns of thinking that apply
to many disciplines. A good way to
become aware of these patterns is to
broaden exposure to disciplines.
1. Pursue academic interests
outside your discipline
2. Go to meet-ups!
Student’s t0-do:
1. Teach with diverse datasets
2. Illustrate parallels between
disciplines
Teacher’s t0-do:
Big ups to these
data scientists
effective
Amanda Casari
Trey Causey
Wendy Grus
Sarah Guido
James Pestrak

More Related Content

PPTX
Roles and Functions of Computers
PDF
More ways than one: Establishing research data services at an academic medica...
PPTX
Hermiston slide share
PPTX
Assessment Forum 2013 - Columbia University Libraries - 13_0620
PPTX
Expanding JSTOR's Support for Higher Education in Prison - NCHEP 2019
PPTX
Use of online resources
PPTX
Breaking Down Barriers to Higher Education in Prison: Access to Library Resou...
PPT
Bibliographic tools - Mendeley
Roles and Functions of Computers
More ways than one: Establishing research data services at an academic medica...
Hermiston slide share
Assessment Forum 2013 - Columbia University Libraries - 13_0620
Expanding JSTOR's Support for Higher Education in Prison - NCHEP 2019
Use of online resources
Breaking Down Barriers to Higher Education in Prison: Access to Library Resou...
Bibliographic tools - Mendeley

Viewers also liked (15)

PDF
Catching the most with high-throughput screening
PDF
Bot or Not
PDF
Intro to web scraping with Python
PDF
Web Scraping is BS
PDF
Scraping with Python for Fun and Profit - PyCon India 2010
PDF
Python beautiful soup - bs4
PPTX
Web Scraping With Python
PDF
Parse The Web Using Python+Beautiful Soup
PDF
Python, web scraping and content management: Scrapy and Django
PDF
Downloading the internet with Python + Scrapy
PDF
How to Scrap Any Website's content using ScrapyTutorial of How to scrape (cra...
PDF
Web Scraping with Python
PDF
Scraping the web with python
PDF
Web scraping in python
PDF
Beautiful soup
Catching the most with high-throughput screening
Bot or Not
Intro to web scraping with Python
Web Scraping is BS
Scraping with Python for Fun and Profit - PyCon India 2010
Python beautiful soup - bs4
Web Scraping With Python
Parse The Web Using Python+Beautiful Soup
Python, web scraping and content management: Scrapy and Django
Downloading the internet with Python + Scrapy
How to Scrap Any Website's content using ScrapyTutorial of How to scrape (cra...
Web Scraping with Python
Scraping the web with python
Web scraping in python
Beautiful soup
Ad

Similar to Developing effective data scientists (20)

PPTX
Immersive informatics - research data management at Pitt iSchool and Carnegie...
PPTX
Top Uses of Data Science in Education You Need to Know.pptx
PPTX
Ethical challenges for learning analytics
PPT
ICT Pedagogical Certificate
PPTX
SHEILA-CRLI seminar
PPTX
Open access is not enough, information skills are also needed
PPT
IL Psychology Librarians 2009
PPT
Lilac 2009 Information Literacy as a habit of learning
PPTX
Why should I care about information literacy?
PPTX
Latha K D ICT.pptx
PPTX
Data Science and Online Education
PPT
Discovery event stuart lee (the humanities researcher)
PPTX
Putting Students in the SADL
PPTX
Educational Technology - opportunities and pitfalls How to make the most use...
PPTX
Media Literacy in Education
DOCX
EDM: Finding Answers to Reduce Dropout Rates in Schools
PDF
Q3-M6_3Is_Data Collection Procedure.pdf
PPTX
Literacy in the information age
PDF
Future of Digital and Social Media in Academics by Dr. H Chaturvedi
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Top Uses of Data Science in Education You Need to Know.pptx
Ethical challenges for learning analytics
ICT Pedagogical Certificate
SHEILA-CRLI seminar
Open access is not enough, information skills are also needed
IL Psychology Librarians 2009
Lilac 2009 Information Literacy as a habit of learning
Why should I care about information literacy?
Latha K D ICT.pptx
Data Science and Online Education
Discovery event stuart lee (the humanities researcher)
Putting Students in the SADL
Educational Technology - opportunities and pitfalls How to make the most use...
Media Literacy in Education
EDM: Finding Answers to Reduce Dropout Rates in Schools
Q3-M6_3Is_Data Collection Procedure.pdf
Literacy in the information age
Future of Digital and Social Media in Academics by Dr. H Chaturvedi
Ad

More from Erin Shellman (6)

PDF
Case studies in data-driven merchandising
PDF
Building Robust Pipelines with Airflow
PDF
Fun! with the Twitter API
PDF
real time real talk
PDF
Collaborative Filtering for fun ...and profit!
PDF
Assumptions: Check yo'self before you wreck yourself
Case studies in data-driven merchandising
Building Robust Pipelines with Airflow
Fun! with the Twitter API
real time real talk
Collaborative Filtering for fun ...and profit!
Assumptions: Check yo'self before you wreck yourself

Recently uploaded (20)

PPTX
OMC Textile Division Presentation 2021.pptx
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
A Presentation on Touch Screen Technology
PDF
August Patch Tuesday
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
Chapter 5: Probability Theory and Statistics
OMC Textile Division Presentation 2021.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Hindi spoken digit analysis for native and non-native speakers
A comparative analysis of optical character recognition models for extracting...
Encapsulation_ Review paper, used for researhc scholars
Digital-Transformation-Roadmap-for-Companies.pptx
Programs and apps: productivity, graphics, security and other tools
A Presentation on Touch Screen Technology
August Patch Tuesday
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Zenith AI: Advanced Artificial Intelligence
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
A novel scalable deep ensemble learning framework for big data classification...
WOOl fibre morphology and structure.pdf for textiles
Chapter 5: Probability Theory and Statistics

Developing effective data scientists

  • 1. Developing data scientists Erin Shellman PhD Data Science Kick-off University of Michigan October 6, 2015 effective
  • 2. MS in Biostatistics ’08 PhD in Bioinformatics ‘12 …and an alum! I’m Erin and I’m a data scientist.
  • 4. 1. Present your work 2. Blog 3. Teach others Student’s t0-do:
  • 5. Center coursework on presentation and communication Teacher’s t0-do:
  • 7. –Sarah Guido, Data Scientist @ Bitly and UM alum The single most useful thing I learned in grad school was how to work with data in Python.
  • 8. –Trey Causey, Data Scientist @ ChefSteps, @4thdownbot I wish there had been more emphasis on real data/messy data/getting my own data/cleaning data.
  • 9. 1. Develop deep expertise in a programming language 2. Adopt good coding practices now Student’s t0-do:
  • 10. 1. Expose students to a variety of datasets 2. Focus on applications Teacher’s t0-do:
  • 11. data scientists are curious, life-long learners. effective
  • 12. –Wendy Grus, Technical Data Analyst @ Inrix and UM alum LEARN MORE CODING AND STATISTICS. Meet other people working on data sets not in your field. Learn something not in your field. Don't sit in a box staring at your thesis project.
  • 13. –Amanda Casari, Senior Data Scientist @ Concur I took Naval Criminal Law & international relations. It was hard to stay awake. By comparison, I looked forward to vector calculus and loved Matlab. Lesson: Don't be afraid to explore your options while you are still on the track of figuring out what comes next.
  • 14. –James Pestrak, Senior Data Scientist @ Nordstrom Boundaries between disciplines are becoming less-defined. There are general, powerful patterns of thinking that apply to many disciplines. A good way to become aware of these patterns is to broaden exposure to disciplines.
  • 15. 1. Pursue academic interests outside your discipline 2. Go to meet-ups! Student’s t0-do:
  • 16. 1. Teach with diverse datasets 2. Illustrate parallels between disciplines Teacher’s t0-do:
  • 17. Big ups to these data scientists effective Amanda Casari Trey Causey Wendy Grus Sarah Guido James Pestrak