Jupyter for Education: 

Beyond Gutenberg and Erasmus
2015-07-25 • Seattle
Paco Nathan, @pacoid

O’Reilly Learning
Who We Are:
O’Reilly Learning
O’Reilly Learning is a new business unit
focused on the (rapid) evolution of learning
experiences for our audience, spanning
across the range of product offerings at
O'Reilly Media
Not These People …
These People …
O’Reilly Learning
Objective:
Examine, make sense of, and organize 

our various training products and learning
channels – for ourselves and our customers
Content flows through a maze of editorial
process, production workflows, delivery
channels, etc., from authors to audience…
Authors
Audience
DB:
videos
Git:
versioning
Atlas:
publications
EPUB
oreilly.com Safari
On24:
webcasts
OST:
online courses
Events
Studio:
recording
SMEs
Meetup, etc.:
partnerships
O’Reilly Learning
Content flows through a maze of editorial
process, production workflows, delivery
channels, etc., from authors to audience…
Authors
Audience
DB:
videos
Git:
versioning
Atlas:
publications
EPUB
oreilly.com Safari
On24:
webcasts
OST:
online courses
Events
Studio:
recording
SMEs
Meetup, etc.:
partnerships
O’Reilly Learning
regarded by authors as a
relatively “agile” process, 

more than most – even so, 

it needs much improvement
IMHO, here’s the crux of the issue, which
impedes the industry in general:
Authors
Audience
DB:
videos
Git:
versioning
Atlas:
publications
EPUB
oreilly.com Safari
On24:
webcasts
OST:
online courses
Events
Studio:
recording
SMEs
Meetup, etc.:
partnerships
O’Reilly Learning
Authors
Audience
DB:
videos
Git:
versioning
Atlas:
publications
EPUB
oreilly.com Safari
On24:
webcasts
OST:
online courses
Events
Studio:
recording
SMEs
Meetup, etc.:
partnerships
The Learning Architecture:
Defining Development and Enabling Continuous Learning
David Mallon, Dani Johnson
Bersin (2014-05-06)
http://www.bersin.com/Practice/Detail.aspx?
docid=17435&mode=search&p=Learning-@-Development
This report is designed to help leaders 

and talent development and learning 

professionals to take positive steps 

toward understanding and implementing 

learning architectures.
Learning Architecture
In the words of Michael Pollan,
“You are what you eat eats.”
michaelpollan.com/reviews/you-are-what-you-eat/
Learning Architecture
We live within a community of makers,
innovators, learners, implementers…
Our objective initially is to provide a
learning architecture within our company,
leveraging it as a pattern that can help 

our customers build their learning
architectures, subsequently deployed 

on behalf of their customers
Authors
Audience
DB:
videos
Git:
versioning
Atlas:
publications
EPUB
oreilly.com Safari
On24:
webcasts
OST:
online courses
Events
Studio:
recording
SMEs
Meetup, etc.:
partnerships
Learning Architecture
Background:
On Demand Analytic and Learning Environments with Jupyter

Kyle Kelley, Andrew Odewahn

lambdaops.com/jupyter-environments-odsc2015/
Exploring a couple themes, in particular:
• computational narratives
- exploratory data analysis
- software development/collaboration
- API exploration
- technical papers
- reports/exec dashboards
• code-as-media
- Thebe project, etc.
Background:
Personal experience in 2012-15 as 

an independent author and instructor…
Just Enough Math

Paco Nathan

O’Reilly Media (2014)

http://justenoughmath.com
Background:
Personal learnings, based on working 

on this project with Kyle and Andrew…
How to transit from the role of data scientist,
software developer, engineering director – 

into a role of author, teacher and vice versa
Background:
Interactive notebooks: 

Sharing the code
Helen Shen
Nature (2014-11-05)
nature.com/news/interactive-notebooks-
sharing-the-code-1.16261
Background:
Embracing Jupyter Notebooks at O'Reilly

Andrew Odewahn, 2015-05-07
https://beta.oreilly.com/ideas/jupyter-at-oreilly
“O'Reilly Media is using our Atlas platform to 

make Jupyter Notebooks a first class authoring
environment for our publishing program.”
Jupyter, Thebe, Docker, etc.
Background:
Embracing Jupyter Notebooks at O'Reilly
Andrew Odewahn
https://beta.oreilly.com/ideas/jupyter-at-oreilly
“O'Reilly Media is using our Atlas platform to
make Jupyter Notebooks a first class authoring
environment for our publishing program.”
Jupyter
Background:
Background:
Atlas is our content platform backed by Git,
for project collaboration among authors,
editors, et al.
https://atlas.oreilly.com/
Background:
Thebe (a moon of Jupiter) provides a layer
atop Jupyter that is needed for publishing,
white-labeled content, etc.
https://github.com/oreillymedia/thebe
Background:
Beta is our proof of concept:
https://beta.oreilly.com/learning
Tech Stack:
production presentation
Thebe:
player
Jupyter:
notebook
Docker:
container
web page:
interaction
Git:
versioning
Atlas:
publications
various
formats
authoring
cloud
infra
Question:
What’s the delta between our current 

author workflow and this new world of 

Jupyter + Docker +Thebe + cloud, etc.?
production presentation
Thebe:
player
Jupyter:
notebook
Docker:
container
web page:
interaction
Git:
versioning
Atlas:
publications
various
formats
authoring
cloud
infra
Great Examples:
Great Examples:
Seeing what Microsoft is doing with Jupyter
notebooks in Cortana Analytics – that’s brilliant
http://gallery.azureml.net/Experiment/3fe213e3ae6244c5ac84a73e1b451dc4
Most definitely check out CodeNeuro,
both online and the conf/hackathon… 

for example:
Jeremey Freeman, HHMI Janelia Farm

http://notebooks.codeneuro.org/
Matthew Conlen, NY Data Company

http://lightning-viz.org/
Olga Botvinnick, UCSD

http://yeolab.github.io/flotilla/docs/gallery/
Great Examples:
Curating a list of examples, as a shared
doc online, and some exemplars include…
Lorena Barba, GWU

http://lorenabarba.com/
Anita Raichand

https://github.com/painterly/data_py
Chris Fonnesbeck,Vanderbilt

https://plot.ly/ipython-notebooks/computational-bayesian-
analysis/
Donne Martin, NemetschekVectorworks

https://bit.ly/data-notes
Great Examples:
Compare/contrast Jupyter with other
interesting notebooks impls…
Databricks

https://class01.cloud.databricks.com/#notebook/76328
R Markdown

http://rmarkdown.rstudio.com/
Andy Petrella, Data Fellas

https://github.com/andypetrella/spark-notebook
IBM Knowledge Anyhow

https://knowledgeanyhow.org
Mathematica

https://www.wolfram.com/learningcenter/tutorialcollection/
NotebooksAndDocuments/
Great Examples:
Learning:
A few features on the wish list for
notebooks:
• integrating video content
• social aspects, collaboration
• a spectrum of learning modes engaged
• how to integrate classroom experience
• expert mentoring
• learning paths
• remote learning environments, e.g.,
massive open online somethingorother
Learning meets Data Science:
MOOCs, such as edX, provide excellent
features for learning at scale, however:
• costly for authors producing content
• difficult to instrument
• relatively low ROI (completion rates)

Typesafe as a rare counterexample
• lacking social context that reinforces
learning … it’s difficult to staff a 

small army of TAs who are needed
What about MOOCs?
Peter Norvig @ Future Learning 2020
Summit, 2015-05-30:
• search engines surface too many
choices for available learning content
• (“Thanks Google”)
• need to get people to want to interact
with the material – generally due to
social context
What about MOOCs?
Significant improvement in the notion 

of “flipped” a.k.a. inverted classrooms
For a good example, see:
Caltech Offers Online Course with 

Live Lectures in Machine Learning
Yaser Abu-Mostafa (2012-03-30)
http://www.caltech.edu/news/caltech-offers-online-
course-live-lectures-machine-learning-4248
Learning meets Data Science:
There are other pedagogical issues to
address, e.g., how to differentiate which
content or mode will be most effective 

for a learner’s needs and learning style
Patterns of Code as Media

Andrew Odewahn, O’Reilly Media

odewahn.github.io/patterns-of-code-as-media/www/
introduction.html
Learning meets Data Science:
total
newbie
good
overview
Do you have sufficient familiarity with the topic?
utterly
confused
familiar
territory
Can you build on familiarity with a related topic?
must get
unstuck
send pull
request
Do you have necessary proficiency in the topic?
learner
topic
experience
concise
topic
inter-
disciplinary
How many boundaries must you span to achieve structural literacy for this topic?
want to
for myself
have to
for my job
What is your primary motivation to learn this topic?
bleeding
edge
COBOL 2020
Where are you on the "diffusion of innovation" curve w.r.t. the topic?
on-
demand
major
event
How high is the transaction cost for the experience delivered to you?
"go read
the code"
full-team
participation
Does the learning experience immerse you within a diverse, supportive social context?
Learning meets Data Science:
BTW, did we mention the intense needs 

for data analytics at scale and, in particular,
dimensional reduction? :)
Education is more than lessons, exams,
certifications, instructor evals, etc., … 

though tooling often reduces it to that level
Is it possible to measure the “distance”
between a learner and the subject
community?
From Amateurs to Connoisseurs:

Modeling the Evolution of User 

Expertise through Online Reviews

Julian McAuley, Jure Leskovec

http://i.stanford.edu/~julian/pdfs/www13.pdf
Learning meets Data Science:
Learning Curves are forever –
In some sense, this is essence 

of Data Science: 

How well do you learn?
In my experience, much of the
risk encountered in managing a
Data Science team is about
budgeting for learning curve
Learning meets Data Science:
ThrowYour Life a Curve

Whitney Johnson
blogs.hbr.org/johnson/2012/09/
throw-your-life-a-curve.html
For example, notions of continuous learning:
• deconstruction of the cognitive bias One Size Fits All
• “makes a compelling case for personal disruption”
• “plan your career around learning curves”
• hire people who learn/re-learn efficiently
Learning meets Data Science:
So who (or where) are the experts
in this graph?!
Diffusion of Innovation

Everett M. Rogers (1962)

http://sphweb.bumc.bu.edu/otlt/MPH-Modules/SB/
SB721-Models/SB721-Models4.html
Learning meets Data Science:
Looking Ahead:
Moving beyond books, beyond Kindle,
beyond MOOCs …
Moving forward, important aspects include:
learning paths, continuous learning, inverted
classroom, computational thinking, learner
segmentation, etc.
Also, it’s not so much about how an
individual learns, rather our focus should
include social context, e.g., learning within 

a team
Looking Ahead:
Moving beyond books, beyond Kindle,
beyond MOOCs
Moving forward, important aspects include:
learning paths
classroom
segmentation
Also, it’s not so much about how an
individual learns, rather our focus should
include
a team
Looking Ahead:
we’re eager to work
with great new
notebook authors!!

#pioneers
Thank You!

More Related Content

PDF
GalvanizeU Seattle: Eleven Almost-Truisms About Data
PDF
Data Science in 2016: Moving Up
PDF
Microservices, containers, and machine learning
PDF
Data Science in Future Tense
PDF
Use of standards and related issues in predictive analytics
PDF
A New Year in Data Science: ML Unpaused
PDF
Apache Spark and the Emerging Technology Landscape for Big Data
PDF
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
GalvanizeU Seattle: Eleven Almost-Truisms About Data
Data Science in 2016: Moving Up
Microservices, containers, and machine learning
Data Science in Future Tense
Use of standards and related issues in predictive analytics
A New Year in Data Science: ML Unpaused
Apache Spark and the Emerging Technology Landscape for Big Data
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More

What's hot (20)

PDF
How Apache Spark fits in the Big Data landscape
PDF
How Apache Spark fits into the Big Data landscape
PDF
Databricks Meetup @ Los Angeles Apache Spark User Group
PDF
SF Python Meetup: TextRank in Python
PDF
Architecture in action 01
PDF
Spark streaming
PDF
QCon São Paulo: Real-Time Analytics with Spark Streaming
PDF
Graph Analytics in Spark
PDF
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
PDF
Data Science with Spark
PPTX
Gephi, Graphx, and Giraph
PDF
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
PDF
An excursion into Graph Analytics with Apache Spark GraphX
PDF
Data Science with Spark - Training at SparkSummit (East)
PDF
Microservices, Containers, and Machine Learning
PDF
GraphX: Graph analytics for insights about developer communities
PDF
Strata EU 2014: Spark Streaming Case Studies
PPTX
Exploring Titan and Spark GraphX for Analyzing Time-Varying Electrical Networks
PDF
H2O with Erin LeDell at Portland R User Group
PPT
Big Graph Analytics on Neo4j with Apache Spark
How Apache Spark fits in the Big Data landscape
How Apache Spark fits into the Big Data landscape
Databricks Meetup @ Los Angeles Apache Spark User Group
SF Python Meetup: TextRank in Python
Architecture in action 01
Spark streaming
QCon São Paulo: Real-Time Analytics with Spark Streaming
Graph Analytics in Spark
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Data Science with Spark
Gephi, Graphx, and Giraph
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
An excursion into Graph Analytics with Apache Spark GraphX
Data Science with Spark - Training at SparkSummit (East)
Microservices, Containers, and Machine Learning
GraphX: Graph analytics for insights about developer communities
Strata EU 2014: Spark Streaming Case Studies
Exploring Titan and Spark GraphX for Analyzing Time-Varying Electrical Networks
H2O with Erin LeDell at Portland R User Group
Big Graph Analytics on Neo4j with Apache Spark
Ad

Viewers also liked (20)

PDF
Data Science Reinvents Learning?
PDF
Agile Data Science 2.0
PDF
Jupyter, A Platform for Data Science at Scale
PPT
”’I den svenska och tyska litteraturens mittpunkt’: Svenska Pommerns roll som...
PPTX
Motivación laboral
PDF
IBM Hadoop-DS Benchmark Report - 30TB
PPTX
ระบบสารสนเทศ
PDF
2016 Results & Outlook
PDF
Blistering fast access to Hadoop with SQL
PPTX
tarea 7 gabriel
PPSX
Your moment is Waiting
PDF
Agile Data Science
PDF
JSON-LD Update
PDF
Agile analytics applications on hadoop
PDF
Enabling Multimodel Graphs with Apache TinkerPop
PPT
Agile Data Science: Building Hadoop Analytics Applications
PDF
ConsumerLab: The Self-Driving Future
PDF
Introduction to PySpark
PDF
Zipcar
PDF
Agile Data Science 2.0 - Big Data Science Meetup
Data Science Reinvents Learning?
Agile Data Science 2.0
Jupyter, A Platform for Data Science at Scale
”’I den svenska och tyska litteraturens mittpunkt’: Svenska Pommerns roll som...
Motivación laboral
IBM Hadoop-DS Benchmark Report - 30TB
ระบบสารสนเทศ
2016 Results & Outlook
Blistering fast access to Hadoop with SQL
tarea 7 gabriel
Your moment is Waiting
Agile Data Science
JSON-LD Update
Agile analytics applications on hadoop
Enabling Multimodel Graphs with Apache TinkerPop
Agile Data Science: Building Hadoop Analytics Applications
ConsumerLab: The Self-Driving Future
Introduction to PySpark
Zipcar
Agile Data Science 2.0 - Big Data Science Meetup
Ad

Similar to Jupyter for Education: Beyond Gutenberg and Erasmus (20)

PPT
Conole Ouldi Siemen/Downes seminar
PPT
Conole_AECT_presentation
PPT
Conole Aect
PPT
Get cloudengine jisc-elluminate_wednesdays
PPT
Conole Canada Keynote
PPT
Conole Canada Keynote
PPT
Conole Canada Keynote
PPT
Conole Kuwait
PPTX
Embracing AI In Assessment
PPTX
Next Generation Teaching and Learning
PPT
Get CloudEngine IET coffee morning July 2011
PPTX
New Technologies, New Ways of thinking
PPTX
Conole workshop jtelss
PPT
Conole Cambridge
PPTX
Gtc Workshop
PPT
Digital Textbooks: Needs Assessment & Implementation on Campus
PPTX
Conole edinburgh
PDF
Digital Fluencies: Why, What & Where We Are
PPT
Conole iet coffee_morning
PPTX
NWeLC Keynote
Conole Ouldi Siemen/Downes seminar
Conole_AECT_presentation
Conole Aect
Get cloudengine jisc-elluminate_wednesdays
Conole Canada Keynote
Conole Canada Keynote
Conole Canada Keynote
Conole Kuwait
Embracing AI In Assessment
Next Generation Teaching and Learning
Get CloudEngine IET coffee morning July 2011
New Technologies, New Ways of thinking
Conole workshop jtelss
Conole Cambridge
Gtc Workshop
Digital Textbooks: Needs Assessment & Implementation on Campus
Conole edinburgh
Digital Fluencies: Why, What & Where We Are
Conole iet coffee_morning
NWeLC Keynote

More from Paco Nathan (11)

PDF
Human in the loop: a design pattern for managing teams working with ML
PDF
Human-in-the-loop: a design pattern for managing teams that leverage ML
PDF
Human-in-a-loop: a design pattern for managing teams which leverage ML
PDF
Humans in a loop: Jupyter notebooks as a front-end for AI
PDF
Humans in the loop: AI in open source and industry
PDF
Computable Content
PDF
Computable Content: Lessons Learned
PDF
What's new with Apache Spark?
PDF
Big Data is changing abruptly, and where it is likely heading
PDF
Brief Intro to Apache Spark @ Stanford ICME
PDF
How Apache Spark fits into the Big Data landscape
Human in the loop: a design pattern for managing teams working with ML
Human-in-the-loop: a design pattern for managing teams that leverage ML
Human-in-a-loop: a design pattern for managing teams which leverage ML
Humans in a loop: Jupyter notebooks as a front-end for AI
Humans in the loop: AI in open source and industry
Computable Content
Computable Content: Lessons Learned
What's new with Apache Spark?
Big Data is changing abruptly, and where it is likely heading
Brief Intro to Apache Spark @ Stanford ICME
How Apache Spark fits into the Big Data landscape

Recently uploaded (20)

PDF
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
PDF
Literature_Review_methods_ BRACU_MKT426 course material
PDF
Race Reva University – Shaping Future Leaders in Artificial Intelligence
PPTX
Core Concepts of Personalized Learning and Virtual Learning Environments
PDF
Empowerment Technology for Senior High School Guide
PDF
plant tissues class 6-7 mcqs chatgpt.pdf
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PDF
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
PDF
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
PDF
Everyday Spelling and Grammar by Kathi Wyldeck
PDF
Journal of Dental Science - UDMY (2022).pdf
PPTX
MICROPARA INTRODUCTION XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
HVAC Specification 2024 according to central public works department
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
FOISHS ANNUAL IMPLEMENTATION PLAN 2025.pdf
Literature_Review_methods_ BRACU_MKT426 course material
Race Reva University – Shaping Future Leaders in Artificial Intelligence
Core Concepts of Personalized Learning and Virtual Learning Environments
Empowerment Technology for Senior High School Guide
plant tissues class 6-7 mcqs chatgpt.pdf
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
Environmental Education MCQ BD2EE - Share Source.pdf
MICROENCAPSULATION_NDDS_BPHARMACY__SEM VII_PCI Syllabus.pdf
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
Everyday Spelling and Grammar by Kathi Wyldeck
Journal of Dental Science - UDMY (2022).pdf
MICROPARA INTRODUCTION XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
What’s under the hood: Parsing standardized learning content for AI
Journal of Dental Science - UDMY (2021).pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
HVAC Specification 2024 according to central public works department
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf

Jupyter for Education: Beyond Gutenberg and Erasmus

  • 1. Jupyter for Education: 
 Beyond Gutenberg and Erasmus 2015-07-25 • Seattle Paco Nathan, @pacoid
 O’Reilly Learning
  • 3. O’Reilly Learning O’Reilly Learning is a new business unit focused on the (rapid) evolution of learning experiences for our audience, spanning across the range of product offerings at O'Reilly Media
  • 6. O’Reilly Learning Objective: Examine, make sense of, and organize 
 our various training products and learning channels – for ourselves and our customers
  • 7. Content flows through a maze of editorial process, production workflows, delivery channels, etc., from authors to audience… Authors Audience DB: videos Git: versioning Atlas: publications EPUB oreilly.com Safari On24: webcasts OST: online courses Events Studio: recording SMEs Meetup, etc.: partnerships O’Reilly Learning
  • 8. Content flows through a maze of editorial process, production workflows, delivery channels, etc., from authors to audience… Authors Audience DB: videos Git: versioning Atlas: publications EPUB oreilly.com Safari On24: webcasts OST: online courses Events Studio: recording SMEs Meetup, etc.: partnerships O’Reilly Learning regarded by authors as a relatively “agile” process, 
 more than most – even so, 
 it needs much improvement
  • 9. IMHO, here’s the crux of the issue, which impedes the industry in general: Authors Audience DB: videos Git: versioning Atlas: publications EPUB oreilly.com Safari On24: webcasts OST: online courses Events Studio: recording SMEs Meetup, etc.: partnerships O’Reilly Learning Authors Audience DB: videos Git: versioning Atlas: publications EPUB oreilly.com Safari On24: webcasts OST: online courses Events Studio: recording SMEs Meetup, etc.: partnerships
  • 10. The Learning Architecture: Defining Development and Enabling Continuous Learning David Mallon, Dani Johnson Bersin (2014-05-06) http://www.bersin.com/Practice/Detail.aspx? docid=17435&mode=search&p=Learning-@-Development This report is designed to help leaders 
 and talent development and learning 
 professionals to take positive steps 
 toward understanding and implementing 
 learning architectures. Learning Architecture
  • 11. In the words of Michael Pollan, “You are what you eat eats.” michaelpollan.com/reviews/you-are-what-you-eat/ Learning Architecture
  • 12. We live within a community of makers, innovators, learners, implementers… Our objective initially is to provide a learning architecture within our company, leveraging it as a pattern that can help 
 our customers build their learning architectures, subsequently deployed 
 on behalf of their customers Authors Audience DB: videos Git: versioning Atlas: publications EPUB oreilly.com Safari On24: webcasts OST: online courses Events Studio: recording SMEs Meetup, etc.: partnerships Learning Architecture
  • 14. On Demand Analytic and Learning Environments with Jupyter
 Kyle Kelley, Andrew Odewahn
 lambdaops.com/jupyter-environments-odsc2015/ Exploring a couple themes, in particular: • computational narratives - exploratory data analysis - software development/collaboration - API exploration - technical papers - reports/exec dashboards • code-as-media - Thebe project, etc. Background:
  • 15. Personal experience in 2012-15 as 
 an independent author and instructor… Just Enough Math
 Paco Nathan
 O’Reilly Media (2014)
 http://justenoughmath.com Background:
  • 16. Personal learnings, based on working 
 on this project with Kyle and Andrew… How to transit from the role of data scientist, software developer, engineering director – 
 into a role of author, teacher and vice versa Background:
  • 17. Interactive notebooks: 
 Sharing the code Helen Shen Nature (2014-11-05) nature.com/news/interactive-notebooks- sharing-the-code-1.16261 Background:
  • 18. Embracing Jupyter Notebooks at O'Reilly
 Andrew Odewahn, 2015-05-07 https://beta.oreilly.com/ideas/jupyter-at-oreilly “O'Reilly Media is using our Atlas platform to 
 make Jupyter Notebooks a first class authoring environment for our publishing program.” Jupyter, Thebe, Docker, etc. Background:
  • 19. Embracing Jupyter Notebooks at O'Reilly Andrew Odewahn https://beta.oreilly.com/ideas/jupyter-at-oreilly “O'Reilly Media is using our Atlas platform to make Jupyter Notebooks a first class authoring environment for our publishing program.” Jupyter Background:
  • 20. Background: Atlas is our content platform backed by Git, for project collaboration among authors, editors, et al. https://atlas.oreilly.com/
  • 21. Background: Thebe (a moon of Jupiter) provides a layer atop Jupyter that is needed for publishing, white-labeled content, etc. https://github.com/oreillymedia/thebe
  • 22. Background: Beta is our proof of concept: https://beta.oreilly.com/learning
  • 23. Tech Stack: production presentation Thebe: player Jupyter: notebook Docker: container web page: interaction Git: versioning Atlas: publications various formats authoring cloud infra
  • 24. Question: What’s the delta between our current 
 author workflow and this new world of 
 Jupyter + Docker +Thebe + cloud, etc.? production presentation Thebe: player Jupyter: notebook Docker: container web page: interaction Git: versioning Atlas: publications various formats authoring cloud infra
  • 26. Great Examples: Seeing what Microsoft is doing with Jupyter notebooks in Cortana Analytics – that’s brilliant http://gallery.azureml.net/Experiment/3fe213e3ae6244c5ac84a73e1b451dc4
  • 27. Most definitely check out CodeNeuro, both online and the conf/hackathon… 
 for example: Jeremey Freeman, HHMI Janelia Farm
 http://notebooks.codeneuro.org/ Matthew Conlen, NY Data Company
 http://lightning-viz.org/ Olga Botvinnick, UCSD
 http://yeolab.github.io/flotilla/docs/gallery/ Great Examples:
  • 28. Curating a list of examples, as a shared doc online, and some exemplars include… Lorena Barba, GWU
 http://lorenabarba.com/ Anita Raichand
 https://github.com/painterly/data_py Chris Fonnesbeck,Vanderbilt
 https://plot.ly/ipython-notebooks/computational-bayesian- analysis/ Donne Martin, NemetschekVectorworks
 https://bit.ly/data-notes Great Examples:
  • 29. Compare/contrast Jupyter with other interesting notebooks impls… Databricks
 https://class01.cloud.databricks.com/#notebook/76328 R Markdown
 http://rmarkdown.rstudio.com/ Andy Petrella, Data Fellas
 https://github.com/andypetrella/spark-notebook IBM Knowledge Anyhow
 https://knowledgeanyhow.org Mathematica
 https://www.wolfram.com/learningcenter/tutorialcollection/ NotebooksAndDocuments/ Great Examples:
  • 31. A few features on the wish list for notebooks: • integrating video content • social aspects, collaboration • a spectrum of learning modes engaged • how to integrate classroom experience • expert mentoring • learning paths • remote learning environments, e.g., massive open online somethingorother Learning meets Data Science:
  • 32. MOOCs, such as edX, provide excellent features for learning at scale, however: • costly for authors producing content • difficult to instrument • relatively low ROI (completion rates)
 Typesafe as a rare counterexample • lacking social context that reinforces learning … it’s difficult to staff a 
 small army of TAs who are needed What about MOOCs?
  • 33. Peter Norvig @ Future Learning 2020 Summit, 2015-05-30: • search engines surface too many choices for available learning content • (“Thanks Google”) • need to get people to want to interact with the material – generally due to social context What about MOOCs?
  • 34. Significant improvement in the notion 
 of “flipped” a.k.a. inverted classrooms For a good example, see: Caltech Offers Online Course with 
 Live Lectures in Machine Learning Yaser Abu-Mostafa (2012-03-30) http://www.caltech.edu/news/caltech-offers-online- course-live-lectures-machine-learning-4248 Learning meets Data Science:
  • 35. There are other pedagogical issues to address, e.g., how to differentiate which content or mode will be most effective 
 for a learner’s needs and learning style Patterns of Code as Media
 Andrew Odewahn, O’Reilly Media
 odewahn.github.io/patterns-of-code-as-media/www/ introduction.html Learning meets Data Science:
  • 36. total newbie good overview Do you have sufficient familiarity with the topic? utterly confused familiar territory Can you build on familiarity with a related topic? must get unstuck send pull request Do you have necessary proficiency in the topic? learner topic experience concise topic inter- disciplinary How many boundaries must you span to achieve structural literacy for this topic? want to for myself have to for my job What is your primary motivation to learn this topic? bleeding edge COBOL 2020 Where are you on the "diffusion of innovation" curve w.r.t. the topic? on- demand major event How high is the transaction cost for the experience delivered to you? "go read the code" full-team participation Does the learning experience immerse you within a diverse, supportive social context? Learning meets Data Science: BTW, did we mention the intense needs 
 for data analytics at scale and, in particular, dimensional reduction? :)
  • 37. Education is more than lessons, exams, certifications, instructor evals, etc., … 
 though tooling often reduces it to that level Is it possible to measure the “distance” between a learner and the subject community? From Amateurs to Connoisseurs:
 Modeling the Evolution of User 
 Expertise through Online Reviews
 Julian McAuley, Jure Leskovec
 http://i.stanford.edu/~julian/pdfs/www13.pdf Learning meets Data Science:
  • 38. Learning Curves are forever – In some sense, this is essence 
 of Data Science: 
 How well do you learn? In my experience, much of the risk encountered in managing a Data Science team is about budgeting for learning curve Learning meets Data Science:
  • 39. ThrowYour Life a Curve
 Whitney Johnson blogs.hbr.org/johnson/2012/09/ throw-your-life-a-curve.html For example, notions of continuous learning: • deconstruction of the cognitive bias One Size Fits All • “makes a compelling case for personal disruption” • “plan your career around learning curves” • hire people who learn/re-learn efficiently Learning meets Data Science:
  • 40. So who (or where) are the experts in this graph?! Diffusion of Innovation
 Everett M. Rogers (1962)
 http://sphweb.bumc.bu.edu/otlt/MPH-Modules/SB/ SB721-Models/SB721-Models4.html Learning meets Data Science:
  • 42. Moving beyond books, beyond Kindle, beyond MOOCs … Moving forward, important aspects include: learning paths, continuous learning, inverted classroom, computational thinking, learner segmentation, etc. Also, it’s not so much about how an individual learns, rather our focus should include social context, e.g., learning within 
 a team Looking Ahead:
  • 43. Moving beyond books, beyond Kindle, beyond MOOCs Moving forward, important aspects include: learning paths classroom segmentation Also, it’s not so much about how an individual learns, rather our focus should include a team Looking Ahead: we’re eager to work with great new notebook authors!!
 #pioneers