SlideShare a Scribd company logo
Python 	

on Science?
Marcel Caraciolo
@marcelcaraciolo	

CTO of Genomika Diagnósticos, Scientist, MsC at Computer Science and Data Analysis,	

works with Python for 7 years, interested at mobile	

education, machine learning and dataaaaa! Current president of Association PythonBrazil.	

Recife, Brazil - http://aimotion.blogspot.com
Yes we can!
About me
Creator os several scientific python packages including crab
(recsys), benchy and now biopandas
Until last year, Scientist Chief at Atepassar.com 	

(e-learning social network)
Co-Founder and Instructor of PyCursos, teaching Python on-line
including the famous Masanori’s Python for Zoooommbiesss!!
Interested at Python, mobile, e-learning, machine learning and now
my new acquisition skill: bioinformatics!!!
2014, new challenges!
http://www.genomika.com.br
Onde tudo começou…
!
Março 2013
!
https://github.com/
marcelcaraciolo/Geo-
Friendship-Visualization
What already you know…
What already you know…
What already you know…
What already you know…
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Putting Science back in Comp Science
Much of the software stack is for systems programming
C++, Java, .NET, ObjC, web, etc.
Complex numbers ? Vectorized primitives ?
Software stack for scientists is not helpful as it should be
FORTRAN, C/C++ is still where many scientists end up
Python on Science ?   Yes, We can.
High Performance with Big Data
Packages for data analysis and visualization
Syntax - Gets out of your way !!
Community Driven
Ready for web applications
Which is better Data Analysis language ? R or Python ?
Quora, http://www.quora.com/Data-Analysis/Which-is-the-better-Data-analysis-language-R-or-Python
“Python	
  is	
  good	
  for	
  data	
  cleanup,	
  R	
  for	
  sta7s7cal	
  models”	
  
Which is better Data Analysis language ? R or Python ?
Quora, http://www.quora.com/Data-Analysis/Which-is-the-better-Data-analysis-language-R-or-Python
“Python	
  is	
  good	
  for	
  data	
  cleanup,	
  R	
  for	
  sta7s7cal	
  models”	
  
“R	
  is	
  quirky	
  and	
  weird	
  but	
  the	
  sta7s7cians	
  love	
  it	
  and	
  there	
  really	
  
isn’t	
  any	
  compelling	
  reason	
  to	
  switch”	
  
Which is better Data Analysis language ? R or Python ?
Quora, http://www.quora.com/Data-Analysis/Which-is-the-better-Data-analysis-language-R-or-Python
“Python	
  is	
  good	
  for	
  data	
  cleanup,	
  R	
  for	
  sta7s7cal	
  models”	
  
“R	
  is	
  quirky	
  and	
  weird	
  but	
  the	
  sta7s7cians	
  love	
  it	
  and	
  there	
  really	
  
isn’t	
  any	
  compelling	
  reason	
  to	
  switch”	
  
“You’re	
  running	
  an	
  scien7fic	
  simula7on	
  on	
  a	
  laptop?	
  	
  
	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Perhaps	
  you	
  should	
  write	
  it	
  in	
  C++/FORTRAN”	
  
“You’re(running(an(MCMC(simula2on(on(
a(laptop?(Perhaps(you(should(write(it(in(
C++/FORTRAN”(
Numba
just-in-time compiler to LLVM through
@decorators
numba.pydata.org	
  
Numba
just-in-time compiler to LLVM through
@decorators*
numba.pydata.org	
  
*	
  aka	
  ,	
  fast.	
  easy	
  
Python on Science ?   Yes, We can.
Basic packages for data analysis and visualisation
NumPy: The foundation of the
Python Data Analysis stack
Numpy: Array oriented
DEMO (.)
Matplotlib
2D plotting library
DEMO (…)
!
Python Data Analysis toolkit upon Numpy
DEMO
Upon pandas it started several fork projects
https://github.com/kjordahl/geopandas
Upon pandas it started several fork projects
https://github.com/wrobstory/vincent
D3 with Vega
Upon pandas it started several fork projects
http://statsmodels.sourceforge.net/
Statistics, regression plots
Scikit-learn
Python toolkit for machine learning
DEMO (….)
Applied Science with Python!
Hurricane Detector using GFS Data
Colorado State University
Minwoo Lee
http://conference.scipy.org/scipy2011/slides/lee_hurricane_prediction.pdf
biological data analysis with python
http://www.astropy.org/
http://scikit-image.org/
NLTK
http://www.nltk.org/
IPython:
Interactive Python
DEMO (..)
DEMO
DEMO WITH WAKARI +
DEMO WITH GALLERY NBVIEWER (….)
12. Anaconda Distribution packages
Anaconda:	
  pulls	
  it	
  all	
  together
12. Anaconda Distribution packages
https://store.continuum.io/cshop/anaconda/
$ conda list
$ conda search
$ conda install <package-name>
$ conda create -n numpy16 ipython-notebook numpy=1.6
$ source activate numpy16
$ source deactivate
https://binstar.org/
$ conda install binstar
$ conda build <recipe-dir>
$ conda config --add channels https://conda.binstar.org/username
$ binstar login
How do you show your scientific app
to the world ?
One alternative: Yhat
How do you show your scientific app
to the world ?
One alternative: Yhat + Heroku
DEMO
http://blog.yhathq.com/posts/digit-recognition-with-node-and-
python.html
Who is using Scientific Python ?
Tools for scientific development
Tools for scientific development
Getting Started
http://stackoverflow.com/questions/9555635/open-source-enthought-python-alternative
http://fonnesbeck.github.io/ScipySuperpack/
Recent builds of fundamental Python scientific computing packages for OS X
https://code.google.com/p/pythonxy/
Scientific-oriented Python Distribution based on Qt and Spyder
https://store.continuum.io/cshop/anaconda/
Completely free enterprise-ready Python distribution for large-scale data processing, predictive analytics, and
scientific computing
!
Scientific python has also several events!
http://conference.scipy.org/
…and library community discussion groups
http://www.scipy.org/scipylib/mailing-lists.html
https://groups.google.com/forum/#!forum/pydata
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
biopython@biopython.org
…and library community discussion groups
http://pyscience-brasil.wikidot.com/
https://groups.google.com/forum/#!forum/pyscience-brasil
…and library community discussion groups
http://pycursos.com/biblioteca/computacao-cientifica/?filter=lesson
Scientific Community is f**ing amazing!
https://www.enthought.com/products/pyxll/
PyXLL - Python for Excel Solution
Clinical Sequencing
Flask + Python -Web Dashboard
Python + matplotlib + scipy + numpy - Sequencing
biological and log databases
biopandas in work and sequencing parallel workflow
approaches
What Am I working on ?
biopandas
What Am I working on ?
https://github.com/genomika/biopandas
Challenges
Reproducible	
  Research
Challenges
“A	
  rule	
  of	
  thumb	
  among	
  biotechnology	
  venture	
  capitalists	
  is	
  that	
  half	
  
of	
  published	
  research	
  cannot	
  be	
  replicated”	
  
Challenges
How	
  do	
  we	
  replicate	
  research	
  today?
Challenges
How	
  do	
  we	
  replicate	
  research	
  today?
collaborate	
  on	
  
Challenges
How	
  do	
  we	
  replicate	
  research	
  today?
collaborate	
  on	
  
data	
  analysis
How do we collaborate today ?
??????
Python on Science ?   Yes, We can.
Project-based interaction
Project-based interaction
wakari.io	
  	
  	
  	
  	
  	
  	
  	
  	
  	
  Browser-­‐based	
  Python	
  &	
  Linux	
  environment	
  
Presente em 22 dos
26 estados
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python on Science ?   Yes, We can.
Python 	

on Science?
Marcel Caraciolo
@marcelcaraciolo	

CTO of Genomika Diagnósticos, Scientist, MsC at Computer Science and Data Analysis,	

works with Python for 7 years, interested at mobile	

education, machine learning and dataaaaa! Current president of Association PythonBrazil.	

Recife, Brazil - http://aimotion.blogspot.com
Yes we can!

More Related Content

PPTX
Introduction to Python
PDF
Scientist meets web dev: how Python became the language of data
PDF
Python Developer Certification
PDF
Python入門 : 4日間コース社内トレーニング
PDF
The str/bytes nightmare before python2 EOL
PDF
Introduction to python
PDF
Why Python is better for Data Science
PDF
pyconjp2015_talk_Translation of Python Program__
Introduction to Python
Scientist meets web dev: how Python became the language of data
Python Developer Certification
Python入門 : 4日間コース社内トレーニング
The str/bytes nightmare before python2 EOL
Introduction to python
Why Python is better for Data Science
pyconjp2015_talk_Translation of Python Program__

What's hot (11)

PDF
FSharp and Data 101
PPTX
Python programming | Fundamentals of Python programming
PDF
Python for Science and Engineering: a presentation to A*STAR and the Singapor...
PDF
Intro to Python
PDF
Python - basics
PDF
Introduction of python
ODP
Python and Machine Learning
PPTX
C pythontalk
PPTX
AI Machine Learning Complete Course: for PHP & Python Devs
PDF
10 more-things-you-can-do-with-python
PDF
What is Python?
FSharp and Data 101
Python programming | Fundamentals of Python programming
Python for Science and Engineering: a presentation to A*STAR and the Singapor...
Intro to Python
Python - basics
Introduction of python
Python and Machine Learning
C pythontalk
AI Machine Learning Complete Course: for PHP & Python Devs
10 more-things-you-can-do-with-python
What is Python?
Ad

Viewers also liked (20)

PPTX
第3回メドレー読書会後半
PDF
Python for Data Science
PDF
Ecossistemas de startups nordestinos os desafios para a competitividade (2)
PDF
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
PDF
ThingTank @ MIT-Skoltech Innovation Symposium 2014
PDF
IBECC - Contratos Empresariais - Revisão e Controle
PPTX
Projeto software alem da tecnologia v2
PDF
Aula06 matriz em C
PDF
Plano do Projeto
PPTX
MySQL - copiando, movendo e restaurando dados
PDF
Web 2.0 Collaboration – Using digital tools for redesigning governance
PPTX
História do Escritório Virtual de Aracaju
PPTX
O papel da internet na Assessoria de Imprensa
PPT
Seminario - Versão Final
PPTX
Palestra - Bem vindo a era pós-digital: Empreendendo em um ambiente mutante.
PDF
Apresentação ForkInSergipe
PDF
Social Web Studies - What kind of collaboration is right for your business
PDF
Implementacao e desempenho da virtualizacao no dcomp ufs
PDF
Google+ Para Empresas - GBG Aracaju - Tiago Araujo Melo
PDF
Ruby on rails - CEFET de Lagarto
第3回メドレー読書会後半
Python for Data Science
Ecossistemas de startups nordestinos os desafios para a competitividade (2)
Network Learning: AI-driven Connectivist Framework for E-Learning 3.0
ThingTank @ MIT-Skoltech Innovation Symposium 2014
IBECC - Contratos Empresariais - Revisão e Controle
Projeto software alem da tecnologia v2
Aula06 matriz em C
Plano do Projeto
MySQL - copiando, movendo e restaurando dados
Web 2.0 Collaboration – Using digital tools for redesigning governance
História do Escritório Virtual de Aracaju
O papel da internet na Assessoria de Imprensa
Seminario - Versão Final
Palestra - Bem vindo a era pós-digital: Empreendendo em um ambiente mutante.
Apresentação ForkInSergipe
Social Web Studies - What kind of collaboration is right for your business
Implementacao e desempenho da virtualizacao no dcomp ufs
Google+ Para Empresas - GBG Aracaju - Tiago Araujo Melo
Ruby on rails - CEFET de Lagarto
Ad

Similar to Python on Science ? Yes, We can. (20)

PPTX
Intro to Python Data Analysis in Wakari
KEY
Crab - A Python Framework for Building Recommendation Systems
PDF
A Whirlwind Tour Of Python
PDF
Samsung SDS OpeniT - The possibility of Python
PDF
Python webinar 4th june
PDF
Introduction to Crab - Python Framework for Building Recommender Systems
PDF
What Is The Future of Data Science With Python?
PPTX
Python for Big Data Analytics
PPTX
Python PPT
PDF
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
PDF
Python standard library &amp; list of important libraries
PDF
Collaborations in the Extreme: 
The rise of open code development in the scie...
PPTX
Introduction to Python Programming Language For Artificial Intelligence
PDF
Top Libraries for Machine Learning with Python
PPTX
What is the best way to learn Python.pptx
PDF
slidesgo-unleashing-the-power-of-python-your-gateway-to-programming-mastery-2...
PDF
How You Can Use Open Source Materials to Learn Python & Data Science - EuroPy...
PPTX
What is Python? An overview of Python for science.
PDF
PyTorch Deep Learning Framework | USDSI®
PDF
what is python ?
Intro to Python Data Analysis in Wakari
Crab - A Python Framework for Building Recommendation Systems
A Whirlwind Tour Of Python
Samsung SDS OpeniT - The possibility of Python
Python webinar 4th june
Introduction to Crab - Python Framework for Building Recommender Systems
What Is The Future of Data Science With Python?
Python for Big Data Analytics
Python PPT
python-for-advanced-data-science-techniques-and-best-practices-20240911071850...
Python standard library &amp; list of important libraries
Collaborations in the Extreme: 
The rise of open code development in the scie...
Introduction to Python Programming Language For Artificial Intelligence
Top Libraries for Machine Learning with Python
What is the best way to learn Python.pptx
slidesgo-unleashing-the-power-of-python-your-gateway-to-programming-mastery-2...
How You Can Use Open Source Materials to Learn Python & Data Science - EuroPy...
What is Python? An overview of Python for science.
PyTorch Deep Learning Framework | USDSI®
what is python ?

More from Marcel Caraciolo (20)

PDF
Como interpretar seu próprio genoma com Python
PDF
Joblib: Lightweight pipelining for parallel jobs (v2)
PDF
Construindo softwares de bioinformática para análises clínicas : Desafios e...
PDF
Como Python ajudou a automatizar o nosso laboratório v.2
PDF
Como Python pode ajudar na automação do seu laboratório
PDF
Oficina Python: Hackeando a Web com Python 3
PDF
Recommender Systems with Ruby (adding machine learning, statistics, etc)
PDF
Opensource - Como começar e dá dinheiro ?
PDF
Big Data com Python
PDF
Benchy, python framework for performance benchmarking of Python Scripts
PDF
Python e 10 motivos por que devo conhece-la ?
PDF
GeoMapper, Python Script for Visualizing Data on Social Networks with Geo-loc...
PDF
Benchy: Lightweight framework for Performance Benchmarks
PDF
Construindo Sistemas de Recomendação com Python
PDF
Python, A pílula Azul da programação
PDF
Construindo Soluções Científicas com Big Data & MapReduce
PDF
Como Python está mudando a forma de aprendizagem à distância no Brasil
PDF
Novas Tendências para a Educação a Distância: Como reinventar a educação ?
PDF
Aula WebCrawlers com Regex - PyCursos
PDF
Arquivos Zip com Python - Aula PyCursos
Como interpretar seu próprio genoma com Python
Joblib: Lightweight pipelining for parallel jobs (v2)
Construindo softwares de bioinformática para análises clínicas : Desafios e...
Como Python ajudou a automatizar o nosso laboratório v.2
Como Python pode ajudar na automação do seu laboratório
Oficina Python: Hackeando a Web com Python 3
Recommender Systems with Ruby (adding machine learning, statistics, etc)
Opensource - Como começar e dá dinheiro ?
Big Data com Python
Benchy, python framework for performance benchmarking of Python Scripts
Python e 10 motivos por que devo conhece-la ?
GeoMapper, Python Script for Visualizing Data on Social Networks with Geo-loc...
Benchy: Lightweight framework for Performance Benchmarks
Construindo Sistemas de Recomendação com Python
Python, A pílula Azul da programação
Construindo Soluções Científicas com Big Data & MapReduce
Como Python está mudando a forma de aprendizagem à distância no Brasil
Novas Tendências para a Educação a Distância: Como reinventar a educação ?
Aula WebCrawlers com Regex - PyCursos
Arquivos Zip com Python - Aula PyCursos

Recently uploaded (20)

PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
famous lake in india and its disturibution and importance
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PDF
An interstellar mission to test astrophysical black holes
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PDF
Sciences of Europe No 170 (2025)
PDF
HPLC-PPT.docx high performance liquid chromatography
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
neck nodes and dissection types and lymph nodes levels
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
INTRODUCTION TO EVS | Concept of sustainability
famous lake in india and its disturibution and importance
AlphaEarth Foundations and the Satellite Embedding dataset
bbec55_b34400a7914c42429908233dbd381773.pdf
The KM-GBF monitoring framework – status & key messages.pptx
ECG_Course_Presentation د.محمد صقران ppt
Introduction to Fisheries Biotechnology_Lesson 1.pptx
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Cell Membrane: Structure, Composition & Functions
An interstellar mission to test astrophysical black holes
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Sciences of Europe No 170 (2025)
HPLC-PPT.docx high performance liquid chromatography

Python on Science ? Yes, We can.