SlideShare a Scribd company logo
Reproducible
Distributed
Experiments
Kamal Hakimzadeh
PhD Student
mahh@kth.se
1
Jim Dowling
Associate Professor
jdowling@kth.se
www.karamel.io
Agenda
2
• Motivation
• Reproducibility
• Demo: Simple experiment 30-40 min
• Karamel Rep.
• Karamel Engine
• Orchestration
• Challenges
www.karamel.io
Motivation
3
Analytical vs Empirical proof
DS supports many scientific advancements
Scheduling, fault tolerant, scalability …
Extremely complex
www.karamel.io
Reproducible vs. Replicable
4
1. Laboratory
2. Experimenter
3. Apparatus
Reproducible
Replicable
Computational Reproducibility: Infrastructure, software, experiment and data
www.karamel.io
Demo : Word Count
5
Text Generator Text Generator Text Generator
Word Count
www.karamel.io
Karamel: Rep. in different layers
6
Bare Metal
Google Compute Engine
Virtual Machine is and abstract entity
Software is defined in Chef It is publicly available in Github
www.karamel.io
Karamel Engine
7
DSL Service
Cloud Clients
Karamel Engine
Physical
Mapping
Orchestrator
www.karamel.io
Orchestration – queuing model
8www.karamel.io
Result
9www.karamel.io
Challenges and future work
10
Scalability
Fault Recovery Model
Elasticity – Handle Churn
Instrumentation
Recommendation System
Language Support
Load generators
Scheduling
Container base machines Result Management
Debugging
www.karamel.io
Team members
11
Kamal Hakimzadeh
PhD Student at KTH
mahh@kth.se
Alberto Lorente Leal
Software Developer at Comeon
a.lorenteleal@gmail.com
Jim Dowling
Associate Professor at KTH
jdowling@kth.se
Hooman Peiro Sajjad
PhD Student at KTH
shps@kth.se
Abhimanyu Babbar
Backend Developer at Wrap
abhimanyu.babbar88@gmail.comwww.karamel.io
12www.karamel.io

More Related Content

PDF
Vyacheslav Zholudev – Flink, a Convenient Abstraction Layer for Yarn?
PDF
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
PDF
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
PDF
Jim Dowling – Interactive Flink analytics with HopsWorks and Zeppelin
PDF
Christian Kreuzfeld – Static vs Dynamic Stream Processing
PDF
Rental Cars and Industrialized Learning to Rank with Sean Downes
PDF
Databricks: What We Have Learned by Eating Our Dog Food
PDF
Interactive Data Analysis with Apache Flink @ Flink Meetup in Berlin
Vyacheslav Zholudev – Flink, a Convenient Abstraction Layer for Yarn?
Suneel Marthi – BigPetStore Flink: A Comprehensive Blueprint for Apache Flink
Marc Schwering – Using Flink with MongoDB to enhance relevancy in personaliza...
Jim Dowling – Interactive Flink analytics with HopsWorks and Zeppelin
Christian Kreuzfeld – Static vs Dynamic Stream Processing
Rental Cars and Industrialized Learning to Rank with Sean Downes
Databricks: What We Have Learned by Eating Our Dog Food
Interactive Data Analysis with Apache Flink @ Flink Meetup in Berlin

What's hot (20)

PPTX
Stateful Stream Processing at In-Memory Speed
PDF
Albert Bifet – Apache Samoa: Mining Big Data Streams with Apache Flink
PDF
Tuning ML Models: Scaling, Workflows, and Architecture
PPTX
Assaf Araki – Real Time Analytics at Scale
PDF
Bay Area Apache Flink Meetup Community Update August 2015
PDF
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
PDF
Spark Summit EU talk by Heiko Korndorf
PDF
Moon soo Lee – Data Science Lifecycle with Apache Flink and Apache Zeppelin
PDF
Operationalizing Machine Learning at Scale with Sameer Nori
PDF
Spark Summit EU talk by Ahsan Javed Awan
PDF
From R Script to Production Using rsparkling with Navdeep Gill
PDF
Insights Without Tradeoffs: Using Structured Streaming
PPTX
Flink Streaming
PDF
Enabling Scalable Data Science Pipeline with Mlflow at Thermo Fisher Scientific
PDF
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
PDF
Presto
PDF
Memory Optimization and Reliable Metrics in ML Pipelines at Netflix
PDF
Building Data Pipelines in Python
PPTX
A Data Streaming Architecture with Apache Flink (berlin Buzzwords 2016)
PDF
Accelerating Machine Learning on Databricks Runtime
Stateful Stream Processing at In-Memory Speed
Albert Bifet – Apache Samoa: Mining Big Data Streams with Apache Flink
Tuning ML Models: Scaling, Workflows, and Architecture
Assaf Araki – Real Time Analytics at Scale
Bay Area Apache Flink Meetup Community Update August 2015
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Spark Summit EU talk by Heiko Korndorf
Moon soo Lee – Data Science Lifecycle with Apache Flink and Apache Zeppelin
Operationalizing Machine Learning at Scale with Sameer Nori
Spark Summit EU talk by Ahsan Javed Awan
From R Script to Production Using rsparkling with Navdeep Gill
Insights Without Tradeoffs: Using Structured Streaming
Flink Streaming
Enabling Scalable Data Science Pipeline with Mlflow at Thermo Fisher Scientific
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
Presto
Memory Optimization and Reliable Metrics in ML Pipelines at Netflix
Building Data Pipelines in Python
A Data Streaming Architecture with Apache Flink (berlin Buzzwords 2016)
Accelerating Machine Learning on Databricks Runtime
Ad

Viewers also liked (20)

PDF
K. Tzoumas & S. Ewen – Flink Forward Keynote
PDF
Flink Apachecon Presentation
PDF
Alexander Kolb – Flink. Yet another Streaming Framework?
PDF
Apache Flink internals
PDF
Ufuc Celebi – Stream & Batch Processing in one System
PDF
Introduction to Apache Flink - Fast and reliable big data processing
PDF
Maximilian Michels – Google Cloud Dataflow on Top of Apache Flink
PDF
Mikio Braun – Data flow vs. procedural programming
PDF
Marton Balassi – Stateful Stream Processing
PPTX
Slim Baltagi – Flink vs. Spark
PDF
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
PPTX
Flink Case Study: Bouygues Telecom
PPTX
Apache Flink Training: DataStream API Part 1 Basic
PDF
Vasia Kalavri – Training: Gelly School
PPTX
Michael Häusler – Everyday flink
PPTX
Aljoscha Krettek – Notions of Time
PPTX
Chris Hillman – Beyond Mapreduce Scientific Data Processing in Real-time
PPTX
Apache Flink Training: DataStream API Part 2 Advanced
PPTX
Apache Flink: API, runtime, and project roadmap
PPTX
Apache Flink Training: System Overview
K. Tzoumas & S. Ewen – Flink Forward Keynote
Flink Apachecon Presentation
Alexander Kolb – Flink. Yet another Streaming Framework?
Apache Flink internals
Ufuc Celebi – Stream & Batch Processing in one System
Introduction to Apache Flink - Fast and reliable big data processing
Maximilian Michels – Google Cloud Dataflow on Top of Apache Flink
Mikio Braun – Data flow vs. procedural programming
Marton Balassi – Stateful Stream Processing
Slim Baltagi – Flink vs. Spark
Mohamed Amine Abdessemed – Real-time Data Integration with Apache Flink & Kafka
Flink Case Study: Bouygues Telecom
Apache Flink Training: DataStream API Part 1 Basic
Vasia Kalavri – Training: Gelly School
Michael Häusler – Everyday flink
Aljoscha Krettek – Notions of Time
Chris Hillman – Beyond Mapreduce Scientific Data Processing in Real-time
Apache Flink Training: DataStream API Part 2 Advanced
Apache Flink: API, runtime, and project roadmap
Apache Flink Training: System Overview
Ad

Similar to Kamal Hakimzadeh – Reproducible Distributed Experiments (20)

PDF
PyData Meetup - Feature Store for Hopsworks and ML Pipelines
PPTX
Hangman for the Masses Showcase of Web Tech
PPTX
Agile Experiments in Machine Learning
PDF
Rethinking Testing
PDF
Learning on Deep Learning
PPTX
Practical unit testing in c & c++
PPTX
Introduction to Agile Hardware
PPTX
Automated testing on steroids – Trick for managing test data using Docker sna...
PPTX
Recommendations for Building Machine Learning Software
PDF
Agile experiments in Machine Learning with F#
PPTX
Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...
PPTX
Techorama 2017 - Testing the unit, and beyond.
PDF
Bootstrapping of PySpark Models for Factorial A/B Tests
PDF
Open Source Tools for Materials Informatics
PPTX
Web tech: lecture 5
PDF
Test strategies for data processing pipelines
PDF
How to test a Mainframe Application
PDF
Machine learning model to production
PPTX
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
PDF
Machine learning systems for engineers
PyData Meetup - Feature Store for Hopsworks and ML Pipelines
Hangman for the Masses Showcase of Web Tech
Agile Experiments in Machine Learning
Rethinking Testing
Learning on Deep Learning
Practical unit testing in c & c++
Introduction to Agile Hardware
Automated testing on steroids – Trick for managing test data using Docker sna...
Recommendations for Building Machine Learning Software
Agile experiments in Machine Learning with F#
Justin Basilico, Research/ Engineering Manager at Netflix at MLconf SF - 11/1...
Techorama 2017 - Testing the unit, and beyond.
Bootstrapping of PySpark Models for Factorial A/B Tests
Open Source Tools for Materials Informatics
Web tech: lecture 5
Test strategies for data processing pipelines
How to test a Mainframe Application
Machine learning model to production
MLOps and Reproducible ML on AWS with Kubeflow and SageMaker
Machine learning systems for engineers

More from Flink Forward (20)

PDF
Building a fully managed stream processing platform on Flink at scale for Lin...
PPTX
Evening out the uneven: dealing with skew in Flink
PPTX
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
PDF
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
PDF
Introducing the Apache Flink Kubernetes Operator
PPTX
Autoscaling Flink with Reactive Mode
PDF
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
PPTX
One sink to rule them all: Introducing the new Async Sink
PPTX
Tuning Apache Kafka Connectors for Flink.pptx
PDF
Flink powered stream processing platform at Pinterest
PPTX
Apache Flink in the Cloud-Native Era
PPTX
Where is my bottleneck? Performance troubleshooting in Flink
PPTX
Using the New Apache Flink Kubernetes Operator in a Production Deployment
PPTX
The Current State of Table API in 2022
PDF
Flink SQL on Pulsar made easy
PPTX
Dynamic Rule-based Real-time Market Data Alerts
PPTX
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
PPTX
Processing Semantically-Ordered Streams in Financial Services
PDF
Tame the small files problem and optimize data layout for streaming ingestion...
PDF
Batch Processing at Scale with Flink & Iceberg
Building a fully managed stream processing platform on Flink at scale for Lin...
Evening out the uneven: dealing with skew in Flink
“Alexa, be quiet!”: End-to-end near-real time model building and evaluation i...
Introducing BinarySortedMultiMap - A new Flink state primitive to boost your ...
Introducing the Apache Flink Kubernetes Operator
Autoscaling Flink with Reactive Mode
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
One sink to rule them all: Introducing the new Async Sink
Tuning Apache Kafka Connectors for Flink.pptx
Flink powered stream processing platform at Pinterest
Apache Flink in the Cloud-Native Era
Where is my bottleneck? Performance troubleshooting in Flink
Using the New Apache Flink Kubernetes Operator in a Production Deployment
The Current State of Table API in 2022
Flink SQL on Pulsar made easy
Dynamic Rule-based Real-time Market Data Alerts
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Processing Semantically-Ordered Streams in Financial Services
Tame the small files problem and optimize data layout for streaming ingestion...
Batch Processing at Scale with Flink & Iceberg

Recently uploaded (20)

PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
KodekX | Application Modernization Development
PPTX
Cloud computing and distributed systems.
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
A Presentation on Artificial Intelligence
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Unlocking AI with Model Context Protocol (MCP)
PPT
Teaching material agriculture food technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Dropbox Q2 2025 Financial Results & Investor Presentation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KodekX | Application Modernization Development
Cloud computing and distributed systems.
Building Integrated photovoltaic BIPV_UPV.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Advanced methodologies resolving dimensionality complications for autism neur...
CIFDAQ's Market Insight: SEC Turns Pro Crypto
A Presentation on Artificial Intelligence
Review of recent advances in non-invasive hemoglobin estimation
Digital-Transformation-Roadmap-for-Companies.pptx
Encapsulation_ Review paper, used for researhc scholars
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
Unlocking AI with Model Context Protocol (MCP)
Teaching material agriculture food technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

Kamal Hakimzadeh – Reproducible Distributed Experiments