SlideShare a Scribd company logo
Scalable Machine/Deep Learning with
Apache SystemML on Power
1
Why Apache SystemML
 Today’s Roles of Data Scientists
– Algorithm researcher: Invent new optimization schemes
– Systems programmer: provide distributed
implementations
– Deployment engineer: Run for varying datasets
– Systems researcher: Optimize clusters
 SystemML simplifies the Life of Data Scientists
– in implementing custom machine learning
– running algorithms distributed if needed
– running algorithms varying from small data to large data
NIPS ICML
KDD
JMLR
2
Apache SystemML – Declarative Machine Learning
 Productivity of data scientists
– Machine learning language for data scientists
(“The SQL for analytics”)
– Strong foundation in linear algebra and statistical functions
– Comes with approx. 20+ algorithms pre-implemented
– Enable Solutions development and Tools
 Scalability & Performance
– Built on data parallel platforms, e.g. Spark
 Cost-based optimizer to compile execution plans
– Depending on data characteristics (tall/skinny, short/wide) and cluster
characteristics
– Ranging from in-memory single node to clusters (MapReduce, Spark),
and hybrid plans
 APIs & Tools
– Command line: standalone Java app, spark-submit, hadoop jar
– Use in Spark through Scala, Python, R, and Java APIs
– Embeddable scoring library
– Tools: REPL (Scala Spark and pyspark), SparkR, SparkML,
Jupyter, Zeppelin Notebooks
Hadoop or
Spark Cluster
(scale-out)
In-Memory
Single Node
(scale-up)
Runtime
Compiler
Language
GPU backend
In progress
3
Apache SystemML Open Source
 Apache Open source Project (http://systemml.apache.org/)
– Nov. 2015, Start SystemML Apache Incubator Project
– …
– Feb. 2017, Release 0.12.0 on Spark 1.6.x …, Python API.
May 2017, Release 0.14.0 on Spark 2.0.2+.
– May 2017, Apache Top Level Project
– Sep 2017, Release 0.15
 Release downloads (http://systemml.apache.org/download)
– Binaries
– Coordinates to Maven repository
 Github source code (https://github.com/apache/systemml)
 Documentation (https://apache.github.io/systemml/)
 3 Hours KDD Hands-On Tutorial (http://systemml.apache.org/tutorial-
kdd2017.html), Aug. 2017
4
5
https://github.com/apache/systemml/blob/master/samples/jupyter-
notebooks/Deep_Learning_Image_Classification.ipynb
Handwritten Digits Image Classification
Using LeNet CNN
SystemML on Power Environment
 Contributed native ppc64le libraries for Jcuda to mavenized jcuda
project
– GPU backend on Power for SystemML
 Contributed native ppc64le libraries to protoc project
– Useful for compiling Caffe proto files
 Supported native BLAS operations in SystemML
– Matrix Multiplication, Convolution (forward/backward)
– OpenBLAS with OpenMP support
6
Linear Regression Conjugate Gradient
(preliminary 1/2)
7
0
2
4
6
8
10
12
14
64 128 256 512 1024 2048
TimeinSeconds
No. of Rows of input matrix (in Thousands)
PPC CPU Time
PPC GPU Time
x86 CPU Time
x86 GPU Time
Data: random with sparsity 0.95, 1000 features
Icpt: 0, maxi: 20, tol: 0.001, reg: 0.01
Driver-memory: 100G, local[*] master
M-V multiplication
chain is memory bound,
But more cores help
with parallelization.
Linear Regression Conjugate Gradient
(preliminary 2/2)
8
0
2
4
6
8
10
12
14
64 256 1024
TimeinSeconds
No. of Rows of input matrix (in Thousands)
PPC GPU Time
x86 GPU Time
Data: random with sparsity 0.95, 1000 features
Icpt: 0, maxi: 20, tol: 0.001, reg: 0.01
Driver-memory: 100G, local[*] master
0
1
2
3
4
5
6
7
64 256 1024
TimeinSeconds
No. of Rows of input matrix (in Thousands)
CPU-GPU Transfer Time
PPC toDev Time
x86 toDev Time
Most of the time is spent
in transferring data from
host to device
-> 2x performance benefit
due to CPU-GPU NVLink
More Details
 Matthias Boehm, Alexandre Evfimievski, Niketan Pansare, Berthold Reinwald, Prithvi Sen: Declarative, Large-Scale Machine Learning with
Apache SystemML, 3 hours hands-on tutorial, KDD 2017
 Tarek Elgamal, Shangyu Luo, Matthias Boehm, Alexandre V. Evfimievski, Shirish Tatikonda, Berthold Reinwald, Prithviray Sen: SPOOF: Sum-
Product Optimization and Operator Fusion for Large-Scale Machine Learning. CIDR 2017
 Ahmed Elgohary, Matthias Boehm, Peter J. Haas, Frederick R. Reiss, Berthold Reinwald: Compressed Linear Algebra for Large Scale
Machine Learning. VLDB 2016 (Best Paper Award)
– Extended Version to appear in VLDB Journal, 2017
– Summary Version to appear in ACM SIGMOD Record Research Highlights, 2017
 Matthias Boehm, Michael W. Dusenberry, Deron Eriksson, Alexandre V. Evfimievski, Faraz Makari Manshadi, Niketan Pansare, Berthold
Reinwald, Frederick R. Reiss, Prithviraj Sen, Arvind C. Surve, Shirish Tatikonda. SystemML: Declarative Machine Learning on Spark. VLDB
2016
 Botong Huang, Matthias Boehm, Yuanyuan Tian, Berthold Reinwald, Shirish Tatikonda, Frederick R. Reiss: Resource Elasticity for Large-
Scale Machine Learning. SIGMOD 2015: 137-152
 Arash Ashari, Shirish Tatikonda, Matthias Boehm, Berthold Reinwald, Keith Campbell, John Keenleyside, P. Sadayappan: On optimizing
machine learning workloads via kernel fusion. PPOPP 2015: 173-182
 Sebastian Schelter, Juan Soto, Volker Markl, Douglas Burdick, Berthold Reinwald, Alexandre V. Evfimievski: Efficient sample generation for
scalable meta learning. ICDE 2015: 1191-1202
 Matthias Boehm, Douglas R. Burdick, Alexandre V. Evfimievski, Berthold Reinwald, Frederick R. Reiss, Prithviraj Sen, Shirish
Tatikonda, Yuanyuan Tian: SystemML's Optimizer: Plan Generation for Large-Scale Machine Learning Programs. IEEE Data Eng.
Bull. 37(3): 52-62 (2014)
 Matthias Boehm, Shirish Tatikonda, Berthold Reinwald, Prithviraj Sen, Yuanyuan Tian, Douglas Burdick, Shivakumar Vaithyanathan: Hybrid
Parallelization Strategies for Large-Scale Machine Learning in SystemML. PVLDB 7(7): 553-564 (2014)
 Peter D. Kirchner, Matthias Boehm, Berthold Reinwald, Daby M. Sow, Michael Schmidt, Deepak S. Turaga, Alain Biem: Large Scale
Discriminative Metric Learning. IPDPS Workshop 2014: 1656-1663
 Yuanyuan Tian, Shirish Tatikonda, Berthold Reinwald: Scalable and Numerically Stable Descriptive Statistics in SystemML. ICDE 2012: 1351-
1359
 Amol Ghoting, Rajasekar Krishnamurthy, Edwin P. D. Pednault, Berthold Reinwald, Vikas Sindhwani, Shirish Tatikonda, Yuanyuan
Tian, Shivakumar Vaithyanathan: SystemML: Declarative machine learning on MapReduce. ICDE 2011: 231-242
Custom
Algorithm
Optimizer
Resource
Elasticity
GPU
Sampling
Numeric
Stability
Task
Parallelism
1st paper
on Spark
Compression
Automatic
Rewr & Fusion
9
Hands on
Tutorial
Summary
 SystemML simplifies the Life of Data Scientist
 Custom Machine/Deep Learning Algorithms
 Scale up & out
 Mixed Workloads
– Memory access bound
– Compute bound
 Strike Balance between
– Data transfer
– Parallelism
10

More Related Content

PDF
Simple, Modular and Extensible Big Data Platform Concept
PDF
Flare: Scale Up Spark SQL with Native Compilation and Set Your Data on Fire! ...
PDF
Distributed TensorFlow on Hadoop, Mesos, Kubernetes, Spark
PDF
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
PDF
Distributed Deep Learning with Hadoop and TensorFlow
PDF
Matrix Factorizations at Scale: a Comparison of Scientific Data Analytics on ...
PPTX
A Scaleable Implementation of Deep Learning on Spark -Alexander Ulanov
PPTX
Distributed Deep Learning + others for Spark Meetup
Simple, Modular and Extensible Big Data Platform Concept
Flare: Scale Up Spark SQL with Native Compilation and Set Your Data on Fire! ...
Distributed TensorFlow on Hadoop, Mesos, Kubernetes, Spark
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Distributed Deep Learning with Hadoop and TensorFlow
Matrix Factorizations at Scale: a Comparison of Scientific Data Analytics on ...
A Scaleable Implementation of Deep Learning on Spark -Alexander Ulanov
Distributed Deep Learning + others for Spark Meetup

What's hot (20)

PDF
Harnessing Big Data with Spark
PDF
Distributed deep learning
PDF
Big_Data_Heterogeneous_Programming IEEE_Big_Data 2015
PDF
Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016
PPTX
Flux - Open Machine Learning Stack / Pipeline
PPT
OpenCL caffe IWOCL 2016 presentation final
PPTX
APSys Presentation Final copy2
PDF
Deep Learning with DL4J on Apache Spark: Yeah it’s Cool, but are You Doing it...
PDF
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
PDF
OpenPOWER Webinar on Machine Learning for Academic Research
PDF
How Machine Learning and AI Can Support the Fight Against COVID-19
PPT
Download It
PDF
Intro to Machine Learning for GPUs
PDF
Spark 101
PPTX
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
PDF
Volume 2-issue-6-2040-2045
PDF
DASK and Apache Spark
PDF
Python Powered Data Science at Pivotal (PyData 2013)
PDF
Geospatial Analytics at Scale with Deep Learning and Apache Spark
PPTX
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
Harnessing Big Data with Spark
Distributed deep learning
Big_Data_Heterogeneous_Programming IEEE_Big_Data 2015
Hussein Mehanna, Engineering Director, ML Core - Facebook at MLconf ATL 2016
Flux - Open Machine Learning Stack / Pipeline
OpenCL caffe IWOCL 2016 presentation final
APSys Presentation Final copy2
Deep Learning with DL4J on Apache Spark: Yeah it’s Cool, but are You Doing it...
Pivotal Data Labs - Technology and Tools in our Data Scientist's Arsenal
OpenPOWER Webinar on Machine Learning for Academic Research
How Machine Learning and AI Can Support the Fight Against COVID-19
Download It
Intro to Machine Learning for GPUs
Spark 101
Josh Patterson, Advisor, Skymind – Deep learning for Industry at MLconf ATL 2016
Volume 2-issue-6-2040-2045
DASK and Apache Spark
Python Powered Data Science at Pivotal (PyData 2013)
Geospatial Analytics at Scale with Deep Learning and Apache Spark
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
Ad

Similar to System mldl meetup (20)

PPTX
2018 03 25 system ml ai and openpower meetup
PPTX
System mldl meetup
PDF
SystemML - Datapalooza Denver - 05.17.16 MWD
PDF
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
PDF
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
PDF
Alpine Tech Talk: System ML by Berthold Reinwald
PDF
Apache SystemML - Declarative Large-Scale Machine Learning
PPTX
Building Custom
Machine Learning Algorithms
with Apache SystemML
PDF
Building Custom Machine Learning Algorithms With Apache SystemML
PDF
Inside Apache SystemML by Frederick Reiss
PDF
Scallable Distributed Deep Learning on OpenPOWER systems
PDF
What's new in Apache SystemML - Declarative Machine Learning
PDF
Build a deep learning pipeline on apache spark for ads optimization
PDF
Apache SystemML Architecture by Niketan Panesar
PDF
Apache SystemML Architecture by Niketan Panesar
PPTX
WML OpenPOWER presentation
PPTX
Applied_Machine_Learning_Zero_Lecture_Chetan_Jalendra.pptx
PDF
Power Software Development with Apache Spark
PDF
Apache® Spark™ MLlib: From Quick Start to Scikit-Learn
PDF
Very large scale distributed deep learning on BigDL
2018 03 25 system ml ai and openpower meetup
System mldl meetup
SystemML - Datapalooza Denver - 05.17.16 MWD
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Overview of Apache SystemML by Berthold Reinwald and Nakul Jindal
Alpine Tech Talk: System ML by Berthold Reinwald
Apache SystemML - Declarative Large-Scale Machine Learning
Building Custom
Machine Learning Algorithms
with Apache SystemML
Building Custom Machine Learning Algorithms With Apache SystemML
Inside Apache SystemML by Frederick Reiss
Scallable Distributed Deep Learning on OpenPOWER systems
What's new in Apache SystemML - Declarative Machine Learning
Build a deep learning pipeline on apache spark for ads optimization
Apache SystemML Architecture by Niketan Panesar
Apache SystemML Architecture by Niketan Panesar
WML OpenPOWER presentation
Applied_Machine_Learning_Zero_Lecture_Chetan_Jalendra.pptx
Power Software Development with Apache Spark
Apache® Spark™ MLlib: From Quick Start to Scikit-Learn
Very large scale distributed deep learning on BigDL
Ad

More from Ganesan Narayanasamy (20)

PDF
Empowering Engineering Faculties: Bridging the Gap with Emerging Technologies
PDF
Chip Design Curriculum development Residency program
PDF
Basics of Digital Design and Verilog
PDF
180 nm Tape out experience using Open POWER ISA
PDF
Workload Transformation and Innovations in POWER Architecture
PDF
OpenPOWER Workshop at IIT Roorkee
PDF
Deep Learning Use Cases using OpenPOWER systems
PDF
IBM BOA for POWER
PDF
OpenPOWER System Marconi100
PDF
OpenPOWER Latest Updates
PDF
POWER10 innovations for HPC
PDF
Deeplearningusingcloudpakfordata
PDF
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
PDF
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
PDF
AI in healthcare - Use Cases
PDF
AI in Health Care using IBM Systems/OpenPOWER systems
PDF
AI in Healh Care using IBM POWER systems
PDF
Poster from NUS
PDF
SAP HANA on POWER9 systems
PPTX
Graphical Structure Learning accelerated with POWER9
Empowering Engineering Faculties: Bridging the Gap with Emerging Technologies
Chip Design Curriculum development Residency program
Basics of Digital Design and Verilog
180 nm Tape out experience using Open POWER ISA
Workload Transformation and Innovations in POWER Architecture
OpenPOWER Workshop at IIT Roorkee
Deep Learning Use Cases using OpenPOWER systems
IBM BOA for POWER
OpenPOWER System Marconi100
OpenPOWER Latest Updates
POWER10 innovations for HPC
Deeplearningusingcloudpakfordata
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare - Use Cases
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Healh Care using IBM POWER systems
Poster from NUS
SAP HANA on POWER9 systems
Graphical Structure Learning accelerated with POWER9

Recently uploaded (20)

PDF
Encapsulation theory and applications.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
cuic standard and advanced reporting.pdf
PPT
Teaching material agriculture food technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Cloud computing and distributed systems.
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Electronic commerce courselecture one. Pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Encapsulation theory and applications.pdf
Review of recent advances in non-invasive hemoglobin estimation
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Network Security Unit 5.pdf for BCA BBA.
cuic standard and advanced reporting.pdf
Teaching material agriculture food technology
Dropbox Q2 2025 Financial Results & Investor Presentation
Cloud computing and distributed systems.
Spectral efficient network and resource selection model in 5G networks
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Electronic commerce courselecture one. Pdf
20250228 LYD VKU AI Blended-Learning.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
NewMind AI Monthly Chronicles - July 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?
Advanced methodologies resolving dimensionality complications for autism neur...
Building Integrated photovoltaic BIPV_UPV.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing

System mldl meetup

  • 1. Scalable Machine/Deep Learning with Apache SystemML on Power 1
  • 2. Why Apache SystemML  Today’s Roles of Data Scientists – Algorithm researcher: Invent new optimization schemes – Systems programmer: provide distributed implementations – Deployment engineer: Run for varying datasets – Systems researcher: Optimize clusters  SystemML simplifies the Life of Data Scientists – in implementing custom machine learning – running algorithms distributed if needed – running algorithms varying from small data to large data NIPS ICML KDD JMLR 2
  • 3. Apache SystemML – Declarative Machine Learning  Productivity of data scientists – Machine learning language for data scientists (“The SQL for analytics”) – Strong foundation in linear algebra and statistical functions – Comes with approx. 20+ algorithms pre-implemented – Enable Solutions development and Tools  Scalability & Performance – Built on data parallel platforms, e.g. Spark  Cost-based optimizer to compile execution plans – Depending on data characteristics (tall/skinny, short/wide) and cluster characteristics – Ranging from in-memory single node to clusters (MapReduce, Spark), and hybrid plans  APIs & Tools – Command line: standalone Java app, spark-submit, hadoop jar – Use in Spark through Scala, Python, R, and Java APIs – Embeddable scoring library – Tools: REPL (Scala Spark and pyspark), SparkR, SparkML, Jupyter, Zeppelin Notebooks Hadoop or Spark Cluster (scale-out) In-Memory Single Node (scale-up) Runtime Compiler Language GPU backend In progress 3
  • 4. Apache SystemML Open Source  Apache Open source Project (http://systemml.apache.org/) – Nov. 2015, Start SystemML Apache Incubator Project – … – Feb. 2017, Release 0.12.0 on Spark 1.6.x …, Python API. May 2017, Release 0.14.0 on Spark 2.0.2+. – May 2017, Apache Top Level Project – Sep 2017, Release 0.15  Release downloads (http://systemml.apache.org/download) – Binaries – Coordinates to Maven repository  Github source code (https://github.com/apache/systemml)  Documentation (https://apache.github.io/systemml/)  3 Hours KDD Hands-On Tutorial (http://systemml.apache.org/tutorial- kdd2017.html), Aug. 2017 4
  • 6. SystemML on Power Environment  Contributed native ppc64le libraries for Jcuda to mavenized jcuda project – GPU backend on Power for SystemML  Contributed native ppc64le libraries to protoc project – Useful for compiling Caffe proto files  Supported native BLAS operations in SystemML – Matrix Multiplication, Convolution (forward/backward) – OpenBLAS with OpenMP support 6
  • 7. Linear Regression Conjugate Gradient (preliminary 1/2) 7 0 2 4 6 8 10 12 14 64 128 256 512 1024 2048 TimeinSeconds No. of Rows of input matrix (in Thousands) PPC CPU Time PPC GPU Time x86 CPU Time x86 GPU Time Data: random with sparsity 0.95, 1000 features Icpt: 0, maxi: 20, tol: 0.001, reg: 0.01 Driver-memory: 100G, local[*] master M-V multiplication chain is memory bound, But more cores help with parallelization.
  • 8. Linear Regression Conjugate Gradient (preliminary 2/2) 8 0 2 4 6 8 10 12 14 64 256 1024 TimeinSeconds No. of Rows of input matrix (in Thousands) PPC GPU Time x86 GPU Time Data: random with sparsity 0.95, 1000 features Icpt: 0, maxi: 20, tol: 0.001, reg: 0.01 Driver-memory: 100G, local[*] master 0 1 2 3 4 5 6 7 64 256 1024 TimeinSeconds No. of Rows of input matrix (in Thousands) CPU-GPU Transfer Time PPC toDev Time x86 toDev Time Most of the time is spent in transferring data from host to device -> 2x performance benefit due to CPU-GPU NVLink
  • 9. More Details  Matthias Boehm, Alexandre Evfimievski, Niketan Pansare, Berthold Reinwald, Prithvi Sen: Declarative, Large-Scale Machine Learning with Apache SystemML, 3 hours hands-on tutorial, KDD 2017  Tarek Elgamal, Shangyu Luo, Matthias Boehm, Alexandre V. Evfimievski, Shirish Tatikonda, Berthold Reinwald, Prithviray Sen: SPOOF: Sum- Product Optimization and Operator Fusion for Large-Scale Machine Learning. CIDR 2017  Ahmed Elgohary, Matthias Boehm, Peter J. Haas, Frederick R. Reiss, Berthold Reinwald: Compressed Linear Algebra for Large Scale Machine Learning. VLDB 2016 (Best Paper Award) – Extended Version to appear in VLDB Journal, 2017 – Summary Version to appear in ACM SIGMOD Record Research Highlights, 2017  Matthias Boehm, Michael W. Dusenberry, Deron Eriksson, Alexandre V. Evfimievski, Faraz Makari Manshadi, Niketan Pansare, Berthold Reinwald, Frederick R. Reiss, Prithviraj Sen, Arvind C. Surve, Shirish Tatikonda. SystemML: Declarative Machine Learning on Spark. VLDB 2016  Botong Huang, Matthias Boehm, Yuanyuan Tian, Berthold Reinwald, Shirish Tatikonda, Frederick R. Reiss: Resource Elasticity for Large- Scale Machine Learning. SIGMOD 2015: 137-152  Arash Ashari, Shirish Tatikonda, Matthias Boehm, Berthold Reinwald, Keith Campbell, John Keenleyside, P. Sadayappan: On optimizing machine learning workloads via kernel fusion. PPOPP 2015: 173-182  Sebastian Schelter, Juan Soto, Volker Markl, Douglas Burdick, Berthold Reinwald, Alexandre V. Evfimievski: Efficient sample generation for scalable meta learning. ICDE 2015: 1191-1202  Matthias Boehm, Douglas R. Burdick, Alexandre V. Evfimievski, Berthold Reinwald, Frederick R. Reiss, Prithviraj Sen, Shirish Tatikonda, Yuanyuan Tian: SystemML's Optimizer: Plan Generation for Large-Scale Machine Learning Programs. IEEE Data Eng. Bull. 37(3): 52-62 (2014)  Matthias Boehm, Shirish Tatikonda, Berthold Reinwald, Prithviraj Sen, Yuanyuan Tian, Douglas Burdick, Shivakumar Vaithyanathan: Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML. PVLDB 7(7): 553-564 (2014)  Peter D. Kirchner, Matthias Boehm, Berthold Reinwald, Daby M. Sow, Michael Schmidt, Deepak S. Turaga, Alain Biem: Large Scale Discriminative Metric Learning. IPDPS Workshop 2014: 1656-1663  Yuanyuan Tian, Shirish Tatikonda, Berthold Reinwald: Scalable and Numerically Stable Descriptive Statistics in SystemML. ICDE 2012: 1351- 1359  Amol Ghoting, Rajasekar Krishnamurthy, Edwin P. D. Pednault, Berthold Reinwald, Vikas Sindhwani, Shirish Tatikonda, Yuanyuan Tian, Shivakumar Vaithyanathan: SystemML: Declarative machine learning on MapReduce. ICDE 2011: 231-242 Custom Algorithm Optimizer Resource Elasticity GPU Sampling Numeric Stability Task Parallelism 1st paper on Spark Compression Automatic Rewr & Fusion 9 Hands on Tutorial
  • 10. Summary  SystemML simplifies the Life of Data Scientist  Custom Machine/Deep Learning Algorithms  Scale up & out  Mixed Workloads – Memory access bound – Compute bound  Strike Balance between – Data transfer – Parallelism 10

Editor's Notes