SlideShare a Scribd company logo
Tiezheng Li
Tiezheng.Li@twosigma.com
Two Sigma Investments, LLC
PyData | November 2017
The Beaker Extensions for Jupyter:
Agenda
Beaker Notebook
From Beaker Notebook to BeakerX
BeakerX Live Demo
DISCOVERABLE
DATA
DATA
ANALYSIS
+ MODELING
SCALABLE
+ DISTRIBUTED COMPUTE
PUBLICATION
+ COLLABORATION
OUR VISION FOR DATA SCIENCE
BEAKER,
AN INTRODUCTION
LANGUAGE
MATTERS
BeakerX Beaker Extensions for Jupyter
BeakerX Beaker Extensions for Jupyter
Oct 2013
Internal GA
Mar 2015
R, Scala, Java, Python2/3 support
Jun 2015
PySpark, SparkR, Clojure, Kdb
support
Nov 2016
BeakerX Pivot
Apr 2016
External Beaker Lab Alpha
LIFE OF BEAKER
May 2014
Open Source Beta
Aug 2017
BeakerX RC1
OPEN SOURCE WORLD
nbconvert
nbviewer
nbpresent
nbgrader
Jupyter Hub nbdime
nbmanager
binder
Jupyter Lab
FORK? MERGE? JOIN?
THE PIVOT
BeakerX Beaker Extensions for Jupyter
WE DID IT!
94%
1463
213
● Time Series Visualizations
● JVM Kernels
● Interactive Tables
● Collaborative Publication
● True Polyglot Analysis (in progress)
● Data Discovery (in progress)
BeakerX: A unique addition to the Jupyter Ecosystem
DEMO
Future Work:
● Migration to Jupyter Lab
● Spark deep integration
● Data grids
● True Polyglot Analysis
● Data Discovery
● and more … !
BeakerX Beaker Extensions for Jupyter
SHAPING THE ECOSYSTEM
THANK YOU

More Related Content

PPTX
BeakerX - Tiezheng Li
PPTX
The future of Data on Kubernetes
PDF
Testing and Monitoring and Broken Things | Nikki Attea | Sensu
PDF
Chronografand dashboarding
PDF
How to Streamline Incident Response with InfluxDB, PagerDuty and Rundeck
PDF
S3 Server Hackathon Presented by S3 Server, a Scality Product, Seagate and Ho...
PDF
Datadog- Monitoring In Motion
PDF
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
BeakerX - Tiezheng Li
The future of Data on Kubernetes
Testing and Monitoring and Broken Things | Nikki Attea | Sensu
Chronografand dashboarding
How to Streamline Incident Response with InfluxDB, PagerDuty and Rundeck
S3 Server Hackathon Presented by S3 Server, a Scality Product, Seagate and Ho...
Datadog- Monitoring In Motion
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana

What's hot (16)

PDF
Kubernetes Config Management Landscape
PPTX
WHODIS_kearns_presentation.v0a
PDF
Tanny Ng, Nadeem Syed [WP Engine] | How WP Engine Transformed Monitoring Into...
PDF
PDF
Handle insane devices traffic using Google Cloud Platform - Andrea Ulisse - C...
PDF
Lessons Learned: Spring Cloud -> Docker -> Kubernetes
PDF
Big Data Analytics London - Data Science in the Cloud
PDF
APNIC Hackathon The Lord of IPv6
PDF
Elastic at Procter & Gamble: A Network Story
PDF
The Power of GitOps with Flux & GitOps Toolkit
PDF
Microservices at Mercari
PDF
XebiConFr 15 - Kafka par la face nord
KEY
Utilizing Open Government Data Using Drupal
PDF
apidays LIVE Paris 2021 - GraphQL Today and Tomorrow by Uri Goldshtein, The G...
PDF
INSTALLING THE TICK STACK AND YOUR FIRST QUERY
PDF
Kubernetes: Managed or Not Managed?
Kubernetes Config Management Landscape
WHODIS_kearns_presentation.v0a
Tanny Ng, Nadeem Syed [WP Engine] | How WP Engine Transformed Monitoring Into...
Handle insane devices traffic using Google Cloud Platform - Andrea Ulisse - C...
Lessons Learned: Spring Cloud -> Docker -> Kubernetes
Big Data Analytics London - Data Science in the Cloud
APNIC Hackathon The Lord of IPv6
Elastic at Procter & Gamble: A Network Story
The Power of GitOps with Flux & GitOps Toolkit
Microservices at Mercari
XebiConFr 15 - Kafka par la face nord
Utilizing Open Government Data Using Drupal
apidays LIVE Paris 2021 - GraphQL Today and Tomorrow by Uri Goldshtein, The G...
INSTALLING THE TICK STACK AND YOUR FIRST QUERY
Kubernetes: Managed or Not Managed?
Ad

More from PyData (20)

PDF
Michal Mucha: Build and Deploy an End-to-end Streaming NLP Insight System | P...
PDF
Unit testing data with marbles - Jane Stewart Adams, Leif Walsh
PDF
The TileDB Array Data Storage Manager - Stavros Papadopoulos, Jake Bolewski
PDF
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
PDF
Deploying Data Science for Distribution of The New York Times - Anne Bauer
PPTX
Graph Analytics - From the Whiteboard to Your Toolbox - Sam Lerma
PPTX
Do Your Homework! Writing tests for Data Science and Stochastic Code - David ...
PDF
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
PDF
Mining dockless bikeshare and dockless scootershare trip data - Stefanie Brod...
PDF
Avoiding Bad Database Surprises: Simulation and Scalability - Steven Lott
PDF
Words in Space - Rebecca Bilbro
PDF
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
PPTX
Pydata beautiful soup - Monica Puerto
PDF
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
PPTX
Extending Pandas with Custom Types - Will Ayd
PDF
Measuring Model Fairness - Stephen Hoover
PDF
What's the Science in Data Science? - Skipper Seabold
PDF
Applying Statistical Modeling and Machine Learning to Perform Time-Series For...
PDF
Solving very simple substitution ciphers algorithmically - Stephen Enright-Ward
PDF
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
Michal Mucha: Build and Deploy an End-to-end Streaming NLP Insight System | P...
Unit testing data with marbles - Jane Stewart Adams, Leif Walsh
The TileDB Array Data Storage Manager - Stavros Papadopoulos, Jake Bolewski
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
Deploying Data Science for Distribution of The New York Times - Anne Bauer
Graph Analytics - From the Whiteboard to Your Toolbox - Sam Lerma
Do Your Homework! Writing tests for Data Science and Stochastic Code - David ...
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
Mining dockless bikeshare and dockless scootershare trip data - Stefanie Brod...
Avoiding Bad Database Surprises: Simulation and Scalability - Steven Lott
Words in Space - Rebecca Bilbro
End-to-End Machine learning pipelines for Python driven organizations - Nick ...
Pydata beautiful soup - Monica Puerto
1D Convolutional Neural Networks for Time Series Modeling - Nathan Janos, Jef...
Extending Pandas with Custom Types - Will Ayd
Measuring Model Fairness - Stephen Hoover
What's the Science in Data Science? - Skipper Seabold
Applying Statistical Modeling and Machine Learning to Perform Time-Series For...
Solving very simple substitution ciphers algorithmically - Stephen Enright-Ward
The Face of Nanomaterials: Insightful Classification Using Deep Learning - An...
Ad

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Empathic Computing: Creating Shared Understanding
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
cuic standard and advanced reporting.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Chapter 3 Spatial Domain Image Processing.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
sap open course for s4hana steps from ECC to s4
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Network Security Unit 5.pdf for BCA BBA.
Empathic Computing: Creating Shared Understanding
A comparative analysis of optical character recognition models for extracting...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Reach Out and Touch Someone: Haptics and Empathic Computing
cuic standard and advanced reporting.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

BeakerX Beaker Extensions for Jupyter

Editor's Notes

  • #2: Good morning and welcome to this session. The Beaker Extensions for Jupyter: BeakerX Before that let me first introduce myself and what I do. My name is Tiezheng Li I am a software engineer at Two Sigma Since joining Two Sigma I’ve been working on a team that builds products for Modelers that make data easy to discover, consume, publish and visualize in Two Sigma BeakerX is one of our approaches to accomplish this goal.