SlideShare a Scribd company logo
Portable Scalable Data Visualization Techniques for Apache Spark and Python Notebook-based Analytics
Portable & Scalable Data
Visualization Techniques
for Spark & Notebook based Analytics
Douglas Moore
Enterprise Solutions Architect, Databricks
June 2020
Why Portable Data Visualizations?
Portability is Hard
https://www.anaconda.com/blog/python-data-visualization-2018-why-so-many-libraries
Graphic by Jake VanderPlas
Blog by James A. Bednar
Strategies
Summary: Portable Data Visualization Strategies
▪ Image buffer, Image file
▪ HTML/JS in-line
▪ Data Lake to Data Visualization
▪ Hooks
▪ Headless Chrome browser
▪ Add a proxy
▪ Scale out w/ Spark
Resources
Assets: https://github.com/dmoore247/spark-ai-summit-2020
▪ Demo Notebook
References
▪ Python Visualization Landscape
▪ pandas_bokeh
▪ pyviz.org
Feedback
Your feedback is important to us.
Don’t forget to rate and
review the sessions.
Portable Scalable Data Visualization Techniques for Apache Spark and Python Notebook-based Analytics
Data Lake Visualization Drawn to scaleBronze

More Related Content

PDF
Scalable AutoML for Time Series Forecasting using Ray
PDF
SQL Analytics Powering Telemetry Analysis at Comcast
PDF
Migrating Your Data Platform At a High Growth Startup
PDF
How R Developers Can Build and Share Data and AI Applications that Scale with...
PDF
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
PDF
201905 Azure Databricks for Machine Learning
PDF
Healthcare Claim Reimbursement using Apache Spark
PDF
Scale and Optimize Data Engineering Pipelines with Software Engineering Best ...
Scalable AutoML for Time Series Forecasting using Ray
SQL Analytics Powering Telemetry Analysis at Comcast
Migrating Your Data Platform At a High Growth Startup
How R Developers Can Build and Share Data and AI Applications that Scale with...
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
201905 Azure Databricks for Machine Learning
Healthcare Claim Reimbursement using Apache Spark
Scale and Optimize Data Engineering Pipelines with Software Engineering Best ...

What's hot (20)

PDF
Scaling and Modernizing Data Platform with Databricks
PDF
Delight: An Improved Apache Spark UI, Free, and Cross-Platform
PPTX
TechEvent Databricks on Azure
PDF
Geosp.AI.tial: Applying Big Data and Machine Learning to Solve the World's To...
PPTX
Spark - Migration Story
PDF
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
PDF
Real-Time Analytics and Actions Across Large Data Sets with Apache Spark
PDF
Leveraging Apache Spark to Develop AI-Enabled Products and Services at Bosch
PDF
Getting Started with Databricks SQL Analytics
PDF
Data pipeline and data lake for autonomous driving
PDF
Democratizing Data
PDF
Building Robust Production Data Pipelines with Databricks Delta
PDF
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
PDF
Databricks + Snowflake: Catalyzing Data and AI Initiatives
PDF
Vectorized Deep Learning Acceleration from Preprocessing to Inference and Tra...
PDF
Building Identity Graphs over Heterogeneous Data
PDF
An Approach to Data Quality for Netflix Personalization Systems
PDF
Saving Energy in Homes with a Unified Approach to Data and AI
PDF
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
PDF
Bridging the Completeness of Big Data on Databricks
Scaling and Modernizing Data Platform with Databricks
Delight: An Improved Apache Spark UI, Free, and Cross-Platform
TechEvent Databricks on Azure
Geosp.AI.tial: Applying Big Data and Machine Learning to Solve the World's To...
Spark - Migration Story
Azure Databricks—Apache Spark as a Service with Sascha Dittmann
Real-Time Analytics and Actions Across Large Data Sets with Apache Spark
Leveraging Apache Spark to Develop AI-Enabled Products and Services at Bosch
Getting Started with Databricks SQL Analytics
Data pipeline and data lake for autonomous driving
Democratizing Data
Building Robust Production Data Pipelines with Databricks Delta
Building an Enterprise Data Platform with Azure Databricks to Enable Machine ...
Databricks + Snowflake: Catalyzing Data and AI Initiatives
Vectorized Deep Learning Acceleration from Preprocessing to Inference and Tra...
Building Identity Graphs over Heterogeneous Data
An Approach to Data Quality for Netflix Personalization Systems
Saving Energy in Homes with a Unified Approach to Data and AI
Lambda Architecture in the Cloud with Azure Databricks with Andrei Varanovich
Bridging the Completeness of Big Data on Databricks
Ad

Similar to Portable Scalable Data Visualization Techniques for Apache Spark and Python Notebook-based Analytics (20)

PPTX
Data-Visualization-with-Python-4 PPT.ppt
PDF
DAVLectuer3 Exploratory data analysis .pdf
PDF
datavisualizationinpythonv2-171103225436.pdf
PPTX
Data-Visualization-with-Python-2 PPT.pptx
PDF
Data visualization in Python
PDF
Python Visualisation for Data Science
PPTX
DATA ANALYSIS AND VISUALISATION using python 2
PDF
Data Analysis and Visualization using Python
PDF
7-Steps to Perform Data Visualization- Pickl.AI
PPTX
Threat hunting using notebook technologies
PPTX
Data Visualization in Python of b.tech student.pptx
PPTX
Role of Visualization in Data Management
PPTX
alltrinbruno(DS2) data science visualisation.pptx
PDF
Unlocking Insights Data Analysis Visualization
PDF
Visualizing big data in the browser using spark
PPTX
Introduction to Data Visualization, Importance and types
PPTX
Azure Notebooks - Jupyter for the Cloud
PDF
Exploratory Data Analysis in Spark
PPTX
Data Visualization1.pptx
PPT
Big data visualization allotting by r and python with gui tools
Data-Visualization-with-Python-4 PPT.ppt
DAVLectuer3 Exploratory data analysis .pdf
datavisualizationinpythonv2-171103225436.pdf
Data-Visualization-with-Python-2 PPT.pptx
Data visualization in Python
Python Visualisation for Data Science
DATA ANALYSIS AND VISUALISATION using python 2
Data Analysis and Visualization using Python
7-Steps to Perform Data Visualization- Pickl.AI
Threat hunting using notebook technologies
Data Visualization in Python of b.tech student.pptx
Role of Visualization in Data Management
alltrinbruno(DS2) data science visualisation.pptx
Unlocking Insights Data Analysis Visualization
Visualizing big data in the browser using spark
Introduction to Data Visualization, Importance and types
Azure Notebooks - Jupyter for the Cloud
Exploratory Data Analysis in Spark
Data Visualization1.pptx
Big data visualization allotting by r and python with gui tools
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
PPT
Data Lakehouse Symposium | Day 1 | Part 2
PPTX
Data Lakehouse Symposium | Day 2
PPTX
Data Lakehouse Symposium | Day 4
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
PDF
Democratizing Data Quality Through a Centralized Platform
PDF
Learn to Use Databricks for Data Science
PDF
Why APM Is Not the Same As ML Monitoring
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
PDF
Stage Level Scheduling Improving Big Data and AI Integration
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
PDF
Sawtooth Windows for Feature Aggregations
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
PDF
Re-imagine Data Monitoring with whylogs and Spark
PDF
Raven: End-to-end Optimization of ML Prediction Queries
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
PDF
Massive Data Processing in Adobe Using Delta Lake
DW Migration Webinar-March 2022.pptx
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 4
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Democratizing Data Quality Through a Centralized Platform
Learn to Use Databricks for Data Science
Why APM Is Not the Same As ML Monitoring
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Stage Level Scheduling Improving Big Data and AI Integration
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Sawtooth Windows for Feature Aggregations
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Re-imagine Data Monitoring with whylogs and Spark
Raven: End-to-end Optimization of ML Prediction Queries
Processing Large Datasets for ADAS Applications using Apache Spark
Massive Data Processing in Adobe Using Delta Lake

Recently uploaded (20)

PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Introduction to Business Data Analytics.
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPT
Quality review (1)_presentation of this 21
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Major-Components-ofNKJNNKNKNKNKronment.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
1_Introduction to advance data techniques.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Global journeys: estimating international migration
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Lecture1 pattern recognition............
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Business Ppt On Nestle.pptx huunnnhhgfvu
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Introduction to Business Data Analytics.
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Quality review (1)_presentation of this 21
.pdf is not working space design for the following data for the following dat...
Major-Components-ofNKJNNKNKNKNKronment.pptx
Fluorescence-microscope_Botany_detailed content
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
1_Introduction to advance data techniques.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Miokarditis (Inflamasi pada Otot Jantung)
Clinical guidelines as a resource for EBP(1).pdf
Global journeys: estimating international migration
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Lecture1 pattern recognition............
Introduction-to-Cloud-ComputingFinal.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush

Portable Scalable Data Visualization Techniques for Apache Spark and Python Notebook-based Analytics