SlideShare a Scribd company logo
How You Can Use Open
Source Materials to Learn
Python & Data Science
Kamila Stępniowska, EuroPython 2018
github.com/KStepniowska/EuroPython2018
CC-BY
What can you expect?
- Sociology
- Diversity: Geek Girls Carrots, Women Who Code, She’s Coding
- New Business Manager @10Clouds
Please don’t ask me about:
1. How can you become a data scientist in 3 weeks?
2. Which algorithm will solve an “A” or “B” problem?
EuroPython 2018 Kamila Stępniowska, CC-BY
I hope that you will learn more about...
RESOURCES - Python and Data Science
github.com/KStepniowska/EuroPython2018
Open Source - basics
Data Science Workflow
PROJECTS & COOPERATION & CONTRIBUTION
EuroPython 2018 Kamila Stępniowska, CC-BY
Shall we?
EuroPython 2018 Kamila Stępniowska, CC-BY
Fernando José Ignacio
Gárate Parra
https://bit.ly/2A5MoOW
CC BY-NC 2.0
Open Source
EuroPython 2018 Kamila Stępniowska, CC-BY
“Open data and content can be freely
used, modified, and shared by anyone
for any purpose”
https://opendefinition.org/
EuroPython 2018 Kamila Stępniowska, CC-BY
Educational Materials
Text, Pictures, Videos, Audio Records...
https://creativecommons.org/licenses/
EuroPython 2018 Kamila Stępniowska, CC-BY
As a User
Code
https://opensource.org/licenses
https://www.gnu.org/graphics/license-logos.en.html
EuroPython 2018 Kamila Stępniowska, CC-BY
As a User
Pick yours
General: choosealicense.com
Text: creativecommons.org/licenses/
Code: opensource.org/licenses
EuroPython 2018 Kamila Stępniowska, CC-BY
As a Creator
Python
EuroPython 2018 Kamila Stępniowska, CC-BY
Why Python?
COMMUNITY
Welcoming & Supportive
Global & Diverse
...
If there is a problem, there is a great chance that someone has written and
shared the solution already.
EuroPython 2018 Kamila Stępniowska, CC-BY
Learning Experience
Find Your Project -> learn by building
Find Your People -> Cooperate
Find a way to Contribute -> help others
https://bugs.python.org/
EuroPython 2018 Kamila Stępniowska, CC-BY
For Beginners
PEP 8 python.org/dev/peps/pep-0008/
“PEP 8 — the Style Guide for Python Code
This stylized presentation of the well-established PEP 8 was created by Kenneth
Reitz (for humans).”
*PEP=Python Enhancement Proposal pep8.org/#fn1
EuroPython 2018 Kamila Stępniowska, CC-BY
For Beginners
PEP 20 - The Zen of Python
python.org/dev/peps/pep-0020/
...
EuroPython 2018 Kamila Stępniowska, CC-BY
Resources: For Beginners
Python, Programming, Open Source
Knowledge: python.org -> Beginner’s Guide
for Non-Programmers https://bit.ly/1Iv5glG
for Programmers https://bit.ly/1UIBJMJ
How to learn: Lynn Rooth “Sink or swim”
http://www.roguelynn.com/words/The-New-Coder-A-path-to-Software-Engineering/
EuroPython 2018 Kamila Stępniowska, CC-BY
xkcd
CC BY-NC 2.5
https://xkcd.com/1838/
EuroPython 2018 Kamila Stępniowska, CC-BY
Data Science
EuroPython 2018 Kamila Stępniowska, CC-BY
Use of Python in Data Science
Python Developer - Survey 2017 Results
(9,500 developers, 150 countries)
“What do you use Python for?
(multiple answers)”
50% Data analysis, 31% Machine learning
jetbrains.com/research/python-developers-survey-2017/
EuroPython 2018 Kamila Stępniowska, CC-BY
jetbrains.com/research/python-developers-survey-2017/
EuroPython 2018 Kamila Stępniowska, CC-BY
Python & Data Science - what’s more?
Jupiter Notebook jupyter.org/
PyCharm jetbrains.com/pycharm/
Spyder pythonhosted.org/spyder/
EuroPython 2018 Kamila Stępniowska, CC-BY
Use Python to build your tools to explore data
You need to know Python to be able to freely
build experiments.
EuroPython 2018 Kamila Stępniowska, CC-BY
“Data”
Gathering, cleaning and data preparation is crucial.
Typical issues:
- there is not enough data
- data is messy
- we actually don’t know what is in the data set...
Gill Press, Forbes https://bit.ly/2OgNM4D
EuroPython 2018 Kamila Stępniowska, CC-BY
Data preparation is
even 80% of a
Data Scientist Work
“Science” - on the hunt for the right questions
● Understand what I want to achieve
● Define the problem that I want to solve
● Define what is the input and what I want to be an output
● Looking for helpful algorithms
● Compare the chosen algorithms
● Choose the algorithm/s to be used
● Choose the evaluation metrics
● Choose parameters set for experiments
● Run experiments
● Analyse the results
● Define the conclusions and/or get back to previous points
Anna Gut, Python Developer & Team Lead @10Clouds
EuroPython 2018 Kamila Stępniowska, CC-BY
UNDERSTAND
SEARCH
EXPERIMENT
How to find the right algorithm?
The Internet…
- the resource - do you define the source as trusted? (eg. scikit-learn)
- number of stars, forks, when was the last commit? (GitHub)
- the code
- is it aligned with the Python standards? (PEP 8)
- check the particular functions
- ...
- does it fit to the general architecture of a project?
- ask a friend
Anna Gut, Python Developer & Team Lead @10Clouds
EuroPython 2018 Kamila Stępniowska, CC-BY
Hacks - what was your steps & how did you get there
Step is a wrapper over the transformer and handles multiple aspects of the
execution of the pipeline, such as saving intermediate results (if needed),
checkpointing the model during training and more.
Transformer is purely computational, data scientist-defined piece that takes an
input data and produces some output data. Typical Transformers are neural
network, machine learning algorithms and pre- or post-processing routines.
github.com/neptune-ml/steppy
EuroPython 2018 Kamila Stępniowska, CC-BY
Resources
Data Science, Open Source
All… datasciencemasters.org/
Transformation from Math & Phys into Data Science:
p.migdal.pl/2016/03/15/data-science-intro-for-math-phys-background.html
EuroPython 2018 Kamila Stępniowska, CC-BY
Projects
Cooperation
Contribution
EuroPython 2018 Kamila Stępniowska, CC-BY
Projects
Find your project
- newcoder.io/tutorials/
- www.kaggle.com/
- devmesh.intel.com/
EuroPython 2018 Kamila Stępniowska, CC-BY
POSSIBLE?
DRIVING
ME?
Cooperation
Online:
- pyslackers.com (14,757 members)
- mail.python.org/mailman/listinfo/tutor
- https://www.facebook.com/groups/python.programmers
Offline:
- PyData, PyWaw
- PyLadies, Girl Geek, Geek Girls Carrots (Krakow)
- Django Carrots, Django Girls
EuroPython 2018 Kamila Stępniowska, CC-BY
Contribution
- Bag Tracker bugs.python.org
- Open Source Projects opensource.guide/how-to-contribute
- Answer questions at pyslackers.com
- Become a speaker/mentor pydata.org
- Organize Django Girls djangogirls.org/organize
- ...
EuroPython 2018 Kamila Stępniowska, CC-BY
Even More Resources...
EuroPython 2018 Kamila Stępniowska, CC-BY
Open Education
jose.theoj.org
EuroPython 2018 Kamila Stępniowska, CC-BY
Thank you!
EuroPython 2018 Kamila Stępniowska, CC-BY
github.com/KStepniowska/EuroPython2018
kamila.stepniowska@10clouds.com
@kstepniowska

More Related Content

PPTX
Python and BIG Data analytics | Python Fundamentals | Python Architecture
PDF
Python in Data Science Work
PPTX
Python for Big Data Analytics
PDF
Collaborations in the Extreme: 
The rise of open code development in the scie...
PDF
Power of Python with Big Data
PDF
Scikit-learn: the state of the union 2016
PDF
Big Data com Python
PDF
Turbocharge your data science with python and r
Python and BIG Data analytics | Python Fundamentals | Python Architecture
Python in Data Science Work
Python for Big Data Analytics
Collaborations in the Extreme: 
The rise of open code development in the scie...
Power of Python with Big Data
Scikit-learn: the state of the union 2016
Big Data com Python
Turbocharge your data science with python and r

Similar to How You Can Use Open Source Materials to Learn Python & Data Science - EuroPython 2018 (20)

PDF
Introduction to Python Syntax and Semantics
PDF
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
PPTX
The-Power-of-Python-in-Computer-Science.pptx
PDF
PyData: Past, Present Future (PyData SV 2014 Keynote)
PDF
Python Programming: The Best Language for Every Coder
PDF
A Comprehensive Guide to Python for AI, ML, and Data Science
PDF
Introduction to python
PDF
What Is The Future of Data Science With Python?
PDF
Programming for data science in python
PDF
S2-Programming_with_Data_Computational_Physics.pdf
PDF
Python Essentials For Dummies John C Shovic Alan Simpson
PDF
python-programming-3-books-in-ryan-turner_compress.pdf
PPT
Introduction to the intermediate Python - v1.1
PDF
Using_python_webdevolopment_datascience.pdf
PDF
Exploring and Using the Python Ecosystem
PDF
(Ebook) Data Science with Python by coll.
PPTX
Python-data-science.pptx
PPTX
Python Developer Roadmap 2023
PDF
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
PDF
Data science presentation
Introduction to Python Syntax and Semantics
PYTHON FOR DATA SCIENCE- EXPLAINED IN 6 EASY STEPS
The-Power-of-Python-in-Computer-Science.pptx
PyData: Past, Present Future (PyData SV 2014 Keynote)
Python Programming: The Best Language for Every Coder
A Comprehensive Guide to Python for AI, ML, and Data Science
Introduction to python
What Is The Future of Data Science With Python?
Programming for data science in python
S2-Programming_with_Data_Computational_Physics.pdf
Python Essentials For Dummies John C Shovic Alan Simpson
python-programming-3-books-in-ryan-turner_compress.pdf
Introduction to the intermediate Python - v1.1
Using_python_webdevolopment_datascience.pdf
Exploring and Using the Python Ecosystem
(Ebook) Data Science with Python by coll.
Python-data-science.pptx
Python Developer Roadmap 2023
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
Data science presentation
Ad

Recently uploaded (20)

PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
master seminar digital applications in india
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Sports Quiz easy sports quiz sports quiz
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Cell Structure & Organelles in detailed.
FourierSeries-QuestionsWithAnswers(Part-A).pdf
TR - Agricultural Crops Production NC III.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
master seminar digital applications in india
Abdominal Access Techniques with Prof. Dr. R K Mishra
VCE English Exam - Section C Student Revision Booklet
102 student loan defaulters named and shamed – Is someone you know on the list?
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Renaissance Architecture: A Journey from Faith to Humanism
Sports Quiz easy sports quiz sports quiz
Pharmacology of Heart Failure /Pharmacotherapy of CHF
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Final Presentation General Medicine 03-08-2024.pptx
GDM (1) (1).pptx small presentation for students
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Cell Structure & Organelles in detailed.
Ad

How You Can Use Open Source Materials to Learn Python & Data Science - EuroPython 2018

  • 1. How You Can Use Open Source Materials to Learn Python & Data Science Kamila Stępniowska, EuroPython 2018 github.com/KStepniowska/EuroPython2018 CC-BY
  • 2. What can you expect? - Sociology - Diversity: Geek Girls Carrots, Women Who Code, She’s Coding - New Business Manager @10Clouds Please don’t ask me about: 1. How can you become a data scientist in 3 weeks? 2. Which algorithm will solve an “A” or “B” problem? EuroPython 2018 Kamila Stępniowska, CC-BY
  • 3. I hope that you will learn more about... RESOURCES - Python and Data Science github.com/KStepniowska/EuroPython2018 Open Source - basics Data Science Workflow PROJECTS & COOPERATION & CONTRIBUTION EuroPython 2018 Kamila Stępniowska, CC-BY
  • 4. Shall we? EuroPython 2018 Kamila Stępniowska, CC-BY Fernando José Ignacio Gárate Parra https://bit.ly/2A5MoOW CC BY-NC 2.0
  • 5. Open Source EuroPython 2018 Kamila Stępniowska, CC-BY
  • 6. “Open data and content can be freely used, modified, and shared by anyone for any purpose” https://opendefinition.org/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 7. Educational Materials Text, Pictures, Videos, Audio Records... https://creativecommons.org/licenses/ EuroPython 2018 Kamila Stępniowska, CC-BY As a User
  • 9. Pick yours General: choosealicense.com Text: creativecommons.org/licenses/ Code: opensource.org/licenses EuroPython 2018 Kamila Stępniowska, CC-BY As a Creator
  • 10. Python EuroPython 2018 Kamila Stępniowska, CC-BY
  • 11. Why Python? COMMUNITY Welcoming & Supportive Global & Diverse ... If there is a problem, there is a great chance that someone has written and shared the solution already. EuroPython 2018 Kamila Stępniowska, CC-BY
  • 12. Learning Experience Find Your Project -> learn by building Find Your People -> Cooperate Find a way to Contribute -> help others https://bugs.python.org/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 13. For Beginners PEP 8 python.org/dev/peps/pep-0008/ “PEP 8 — the Style Guide for Python Code This stylized presentation of the well-established PEP 8 was created by Kenneth Reitz (for humans).” *PEP=Python Enhancement Proposal pep8.org/#fn1 EuroPython 2018 Kamila Stępniowska, CC-BY
  • 14. For Beginners PEP 20 - The Zen of Python python.org/dev/peps/pep-0020/ ... EuroPython 2018 Kamila Stępniowska, CC-BY
  • 15. Resources: For Beginners Python, Programming, Open Source Knowledge: python.org -> Beginner’s Guide for Non-Programmers https://bit.ly/1Iv5glG for Programmers https://bit.ly/1UIBJMJ How to learn: Lynn Rooth “Sink or swim” http://www.roguelynn.com/words/The-New-Coder-A-path-to-Software-Engineering/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 17. Data Science EuroPython 2018 Kamila Stępniowska, CC-BY
  • 18. Use of Python in Data Science Python Developer - Survey 2017 Results (9,500 developers, 150 countries) “What do you use Python for? (multiple answers)” 50% Data analysis, 31% Machine learning jetbrains.com/research/python-developers-survey-2017/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 20. Python & Data Science - what’s more? Jupiter Notebook jupyter.org/ PyCharm jetbrains.com/pycharm/ Spyder pythonhosted.org/spyder/ EuroPython 2018 Kamila Stępniowska, CC-BY
  • 21. Use Python to build your tools to explore data You need to know Python to be able to freely build experiments. EuroPython 2018 Kamila Stępniowska, CC-BY
  • 22. “Data” Gathering, cleaning and data preparation is crucial. Typical issues: - there is not enough data - data is messy - we actually don’t know what is in the data set... Gill Press, Forbes https://bit.ly/2OgNM4D EuroPython 2018 Kamila Stępniowska, CC-BY Data preparation is even 80% of a Data Scientist Work
  • 23. “Science” - on the hunt for the right questions ● Understand what I want to achieve ● Define the problem that I want to solve ● Define what is the input and what I want to be an output ● Looking for helpful algorithms ● Compare the chosen algorithms ● Choose the algorithm/s to be used ● Choose the evaluation metrics ● Choose parameters set for experiments ● Run experiments ● Analyse the results ● Define the conclusions and/or get back to previous points Anna Gut, Python Developer & Team Lead @10Clouds EuroPython 2018 Kamila Stępniowska, CC-BY UNDERSTAND SEARCH EXPERIMENT
  • 24. How to find the right algorithm? The Internet… - the resource - do you define the source as trusted? (eg. scikit-learn) - number of stars, forks, when was the last commit? (GitHub) - the code - is it aligned with the Python standards? (PEP 8) - check the particular functions - ... - does it fit to the general architecture of a project? - ask a friend Anna Gut, Python Developer & Team Lead @10Clouds EuroPython 2018 Kamila Stępniowska, CC-BY
  • 25. Hacks - what was your steps & how did you get there Step is a wrapper over the transformer and handles multiple aspects of the execution of the pipeline, such as saving intermediate results (if needed), checkpointing the model during training and more. Transformer is purely computational, data scientist-defined piece that takes an input data and produces some output data. Typical Transformers are neural network, machine learning algorithms and pre- or post-processing routines. github.com/neptune-ml/steppy EuroPython 2018 Kamila Stępniowska, CC-BY
  • 26. Resources Data Science, Open Source All… datasciencemasters.org/ Transformation from Math & Phys into Data Science: p.migdal.pl/2016/03/15/data-science-intro-for-math-phys-background.html EuroPython 2018 Kamila Stępniowska, CC-BY
  • 28. Projects Find your project - newcoder.io/tutorials/ - www.kaggle.com/ - devmesh.intel.com/ EuroPython 2018 Kamila Stępniowska, CC-BY POSSIBLE? DRIVING ME?
  • 29. Cooperation Online: - pyslackers.com (14,757 members) - mail.python.org/mailman/listinfo/tutor - https://www.facebook.com/groups/python.programmers Offline: - PyData, PyWaw - PyLadies, Girl Geek, Geek Girls Carrots (Krakow) - Django Carrots, Django Girls EuroPython 2018 Kamila Stępniowska, CC-BY
  • 30. Contribution - Bag Tracker bugs.python.org - Open Source Projects opensource.guide/how-to-contribute - Answer questions at pyslackers.com - Become a speaker/mentor pydata.org - Organize Django Girls djangogirls.org/organize - ... EuroPython 2018 Kamila Stępniowska, CC-BY
  • 31. Even More Resources... EuroPython 2018 Kamila Stępniowska, CC-BY
  • 32. Open Education jose.theoj.org EuroPython 2018 Kamila Stępniowska, CC-BY
  • 33. Thank you! EuroPython 2018 Kamila Stępniowska, CC-BY github.com/KStepniowska/EuroPython2018 kamila.stepniowska@10clouds.com @kstepniowska