SlideShare a Scribd company logo
Elastic Clusters for Data
Analysis in OpenNebula in
EUBra-BIGSEA
Carlos de Alfonso, on Behalf of the
EUBRA-BIGSEA Consortium
www.eubra-bigsea.eu | @bigsea_eubr 1
BIGSEA in a NutShell
www.eubra-bigsea.eu | @bigsea_eubr
• A European-Brazilian Consortium aiming at
• European-Brazilian User Scenario will make use of the
Data Analytics API and the Programming model interface.
• The Programming model interface will use the services
deployed using TOSCA standard
― Service specifications will be instantiated on a cloud
IaaS, automatically managing Quality of Service.
― The BigData Analytics services will run on the cloud
infrastructure.
• A security framework will be defined to provide security
and privacy.
QoS IaaS
Big Data
Analytics
Program.
model
interface
Security
User
Scenario
Access
tokens
Assurance
BigData
Privacy
Assurance
Access tokens
TOSCA
specs
Requests
Development IDE
2
BIGSEA Architecture
www.eubra-bigsea.eu | @bigsea_eubr 3
Infrastructure Manager (IM)
www.grycap.upv.es/im
www.eubra-bigsea.eu | @bigsea_eubr
•Two approaches to VMI management
•Deploy existing vanilla images (plain OS)
and configure instances using tools to
recreate the desired conf.
•Example: Instantiate a plain Ubuntu 14.04
and use Ansible to automatically install a
full LAMP (Apache, MySQL, PHP) stack.
•Create specific VMIs for
different Clouds from
templates.
•Deployments based on recipes, configuration
and contextualization services.
•Enables platform-agnostic or hybrid
deployments.
•Supports TOSCA.
VMI
catalogue
*
Elastic Clusters on the Cloud (EC3),
http://www.grycap.upv.es/ec3
• EC3 (Elastic Cloud Computing Cluster) leverages CLUES
(http://www.grycap.upv.es/clues) to create elastic virtual clusters
in the Cloud.
- No upfront investment.
- Customizable.
- Usage-dependent cost.
- Automatic
elasticity.
5
www.eubra-bigsea.eu | @bigsea_eubr
Resource-level elasticity
• When a new job in a Mesos framework is
submitted, EC3 will check if there are enough
resources to fulfil the request
- EC3 will request the Cloud Management Framework to
deploy new VMs to allocate the necessary resources
for the new framework
• EC3 uses Infrastructure Manager to install and re-configure
the software.
• A snapshot from the first WN is stored to speed-up the
deployment of further nodes.
- When resources become idle
for a while, they are
undeployed.
- Not-elastic resources are not
undeployed.
6
www.eubra-bigsea.eu | @bigsea_eubr
Platform-agonstic Deployment
www.eubra-bigsea.eu | @bigsea_eubr 7
Vertical elasticity at the
resource level
• VMs are allocated from a CMF that manages a general-
purpose on-premise cloud
• Physical limitations are normally at the
memory size rather than at the CPU share.
• A physical machine can be shared
among different deployments
• Both BIGSEA stack or other one.
• Or due to increased isolation.
• Oversubscribing memory can
enable reaching higher
number of VMs per physical
node, not bounding to a
specific partitioning.
• A VM from one deployment
can ”release” the free memory
to other VMs in the node.
8www.eubra-bigsea.eu | @bigsea_eubr
The ONE Backend
www.eubra-bigsea.eu | @bigsea_eubr 9
Conclusions
•EUBra-BIGSEA aims at providing a set of cloud
services to
• Facilitate the deployment of complex multi-framework
Big Data infrastructures.
• Provide elasticity at the level of the physical resources
and frameworks.
• Include QoS Constraints.
• Facilitate the use of customized environments.
•Those cloud services are directly used by higher-
level programming models hiding the platform
particularities.
10www.eubra-bigsea.eu | @bigsea_eubr
Thanks
•The authors would also like to thank the Spanish
"Ministerio de Economía, Industria y
Competitividad" for the project "BigCLOE'' with
reference number TIN2016-79951-R.
11

More Related Content

PDF
OpenNebulaConf2017EU: FairShare Scheduling by Valentina Zaccolo, INDIGO
PDF
OpenNebulaConf2017EU: Providing cloud and Managed Hosting Environment by Mich...
PDF
OpenNebulaConf2017EU: Transforming an Old Supercomputer into a Cloud Platform...
PDF
OpenNebulaConf2017EU: Hyper converged infrastructure with OpenNebula and Ceph...
PDF
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
PDF
OpenNebulaConf2017EU: Growing into the Petabytes for Fun and Profit by Michal...
PDF
OpenNebulaConf2017EU: Welcome Talk State and Future of OpenNebula by Ignacio ...
PDF
OpenNebulaConf2017EU: IPP Cloud by Jimmy Goffaux, IPPON
OpenNebulaConf2017EU: FairShare Scheduling by Valentina Zaccolo, INDIGO
OpenNebulaConf2017EU: Providing cloud and Managed Hosting Environment by Mich...
OpenNebulaConf2017EU: Transforming an Old Supercomputer into a Cloud Platform...
OpenNebulaConf2017EU: Hyper converged infrastructure with OpenNebula and Ceph...
OpenNebulaConf2017EU: Enabling Dev and Infra teams by Lodewijk De Schuyter,De...
OpenNebulaConf2017EU: Growing into the Petabytes for Fun and Profit by Michal...
OpenNebulaConf2017EU: Welcome Talk State and Future of OpenNebula by Ignacio ...
OpenNebulaConf2017EU: IPP Cloud by Jimmy Goffaux, IPPON

What's hot (20)

PDF
OpenNebula Conf 2014: Expanding OpenNebula´s support for Cloud Bursting - Emm...
PDF
OpenNebula Conf | Lightning talk: Managing a Scientific Computing Facility wi...
PDF
OpenNebula Conf 2014 | Bootstrapping a virtual infrastructure using OpenNebul...
PDF
OpenNebula Conf 2014 | State and future of OpenNebula - Ignacio Llorente
PDF
OpenNebula Conf: 2014 | Lightning talk: Managing Docker Containers with OpenN...
PDF
Accelerating Spark with Kubernetes
PPTX
OpenContrail Implementations
PDF
OpenNebula Conf 2014 | The rOCCI project - a year later - alias OpenNebula in...
PDF
CEPH DAY BERLIN - WELCOME
PPTX
State of the Container Ecosystem
PDF
Netflix Data Benchmark @ HPTS 2017
PDF
Apache Iceberg - A Table Format for Hige Analytic Datasets
PDF
What’s new in Alluxio 2: from seamless operations to structured data management
PDF
Cncf storage-final-filip
PPTX
CoreOS fest 2016 Summary - DevOps BP 2016 June
PPTX
Into the cold - Object Storage in SWITCHengines
PDF
Iceberg + Alluxio for Fast Data Analytics
PPTX
Operators experience and perspective on SDN with VLANs and L3 Networks
PDF
OpenNebula and StorPool: Building Powerful Clouds
PDF
OpenNebula at virtion
OpenNebula Conf 2014: Expanding OpenNebula´s support for Cloud Bursting - Emm...
OpenNebula Conf | Lightning talk: Managing a Scientific Computing Facility wi...
OpenNebula Conf 2014 | Bootstrapping a virtual infrastructure using OpenNebul...
OpenNebula Conf 2014 | State and future of OpenNebula - Ignacio Llorente
OpenNebula Conf: 2014 | Lightning talk: Managing Docker Containers with OpenN...
Accelerating Spark with Kubernetes
OpenContrail Implementations
OpenNebula Conf 2014 | The rOCCI project - a year later - alias OpenNebula in...
CEPH DAY BERLIN - WELCOME
State of the Container Ecosystem
Netflix Data Benchmark @ HPTS 2017
Apache Iceberg - A Table Format for Hige Analytic Datasets
What’s new in Alluxio 2: from seamless operations to structured data management
Cncf storage-final-filip
CoreOS fest 2016 Summary - DevOps BP 2016 June
Into the cold - Object Storage in SWITCHengines
Iceberg + Alluxio for Fast Data Analytics
Operators experience and perspective on SDN with VLANs and L3 Networks
OpenNebula and StorPool: Building Powerful Clouds
OpenNebula at virtion
Ad

Similar to OpenNebulaConf2017EU: Elastic Clusters for Data Analysis by Carlos de Alfonso, EUBra-BIGSEA (20)

PDF
EUBra-BIGSEA: Cloud services with QoS guarantees for Big Data analytics
PPTX
HPC and cloud distributed computing, as a journey
PDF
EUBraBIGSEA Final results
PPTX
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
PDF
FPGA Hardware Accelerator for Machine Learning
PDF
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
PPTX
Scality_Presentation.pptx
PDF
Cloud computing overview
PPTX
строим Microkernel architecture на базе паттерна pipes and filters
PPTX
Integrating Things and the smart mobile phone capabilities
PPTX
Federating Infrastructure as a Service cloud computing systems to create a un...
PPTX
Evaluating UCIe based multi-die SoC to meet timing and power
PDF
Webinar presentation on AUTOSAR Multicore Systems
PDF
Kognitio feb 2013
PPTX
Eucalyptus, Nimbus & OpenNebula
PPTX
Flexible compute
PPTX
Sanger, upcoming Openstack for Bio-informaticians
PDF
Openstack_administration
PDF
Kognitio overview april 2013
PPTX
Opensource tools for OpenStack IAAS
EUBra-BIGSEA: Cloud services with QoS guarantees for Big Data analytics
HPC and cloud distributed computing, as a journey
EUBraBIGSEA Final results
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
FPGA Hardware Accelerator for Machine Learning
2689 - Exploring IBM PureApplication System and IBM Workload Deployer Best Pr...
Scality_Presentation.pptx
Cloud computing overview
строим Microkernel architecture на базе паттерна pipes and filters
Integrating Things and the smart mobile phone capabilities
Federating Infrastructure as a Service cloud computing systems to create a un...
Evaluating UCIe based multi-die SoC to meet timing and power
Webinar presentation on AUTOSAR Multicore Systems
Kognitio feb 2013
Eucalyptus, Nimbus & OpenNebula
Flexible compute
Sanger, upcoming Openstack for Bio-informaticians
Openstack_administration
Kognitio overview april 2013
Opensource tools for OpenStack IAAS
Ad

More from OpenNebula Project (20)

PDF
OpenNebulaConf2019 - Welcome and Project Update - Ignacio M. Llorente, Rubén ...
PDF
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
PDF
OpenNebulaConf2019 - CORD and Edge computing with OpenNebula - Alfonso Aureli...
PDF
OpenNebulaConf2019 - 6 years (+) OpenNebula - Lessons learned - Sebastian Man...
PDF
OpenNebulaConf2019 - Performant and Resilient Storage the Open Source & Linux...
PDF
OpenNebulaConf2019 - Image Backups in OpenNebula - Momčilo Medić - ITAF
PDF
OpenNebulaConf2019 - How We Use GOCA to Manage our OpenNebula Cloud - Jean-Ph...
PDF
OpenNebulaConf2019 - Crytek: A Video gaming Edge Implementation "on the shoul...
PDF
Replacing vCloud with OpenNebula
PDF
NTS: What We Do With OpenNebula - and Why We Do It
PDF
OpenNebula from the Perspective of an ISP
PDF
NTS CAPTAIN / OpenNebula at Julius Blum GmbH
PDF
Performant and Resilient Storage: The Open Source & Linux Way
PDF
NetApp Hybrid Cloud with OpenNebula
PPTX
NSX with OpenNebula - upcoming 5.10
PDF
Security for Private Cloud Environments
PDF
CheckPoint R80.30 Installation on OpenNebula
PDF
DE-CIX: CloudConnectivity
PDF
PDF
Cloud Disaggregation with OpenNebula
OpenNebulaConf2019 - Welcome and Project Update - Ignacio M. Llorente, Rubén ...
OpenNebulaConf2019 - Building Virtual Environments for Security Analyses of C...
OpenNebulaConf2019 - CORD and Edge computing with OpenNebula - Alfonso Aureli...
OpenNebulaConf2019 - 6 years (+) OpenNebula - Lessons learned - Sebastian Man...
OpenNebulaConf2019 - Performant and Resilient Storage the Open Source & Linux...
OpenNebulaConf2019 - Image Backups in OpenNebula - Momčilo Medić - ITAF
OpenNebulaConf2019 - How We Use GOCA to Manage our OpenNebula Cloud - Jean-Ph...
OpenNebulaConf2019 - Crytek: A Video gaming Edge Implementation "on the shoul...
Replacing vCloud with OpenNebula
NTS: What We Do With OpenNebula - and Why We Do It
OpenNebula from the Perspective of an ISP
NTS CAPTAIN / OpenNebula at Julius Blum GmbH
Performant and Resilient Storage: The Open Source & Linux Way
NetApp Hybrid Cloud with OpenNebula
NSX with OpenNebula - upcoming 5.10
Security for Private Cloud Environments
CheckPoint R80.30 Installation on OpenNebula
DE-CIX: CloudConnectivity
Cloud Disaggregation with OpenNebula

Recently uploaded (20)

PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PDF
CloudStack 4.21: First Look Webinar slides
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPT
Geologic Time for studying geology for geologist
PPTX
Tartificialntelligence_presentation.pptx
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
Developing a website for English-speaking practice to English as a foreign la...
Getting started with AI Agents and Multi-Agent Systems
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
CloudStack 4.21: First Look Webinar slides
WOOl fibre morphology and structure.pdf for textiles
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Geologic Time for studying geology for geologist
Tartificialntelligence_presentation.pptx
A novel scalable deep ensemble learning framework for big data classification...
O2C Customer Invoices to Receipt V15A.pptx
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Taming the Chaos: How to Turn Unstructured Data into Decisions
Hindi spoken digit analysis for native and non-native speakers
DP Operators-handbook-extract for the Mautical Institute
sustainability-14-14877-v2.pddhzftheheeeee
A comparative study of natural language inference in Swahili using monolingua...
Module 1.ppt Iot fundamentals and Architecture
Final SEM Unit 1 for mit wpu at pune .pptx
Group 1 Presentation -Planning and Decision Making .pptx

OpenNebulaConf2017EU: Elastic Clusters for Data Analysis by Carlos de Alfonso, EUBra-BIGSEA

  • 1. Elastic Clusters for Data Analysis in OpenNebula in EUBra-BIGSEA Carlos de Alfonso, on Behalf of the EUBRA-BIGSEA Consortium www.eubra-bigsea.eu | @bigsea_eubr 1
  • 2. BIGSEA in a NutShell www.eubra-bigsea.eu | @bigsea_eubr • A European-Brazilian Consortium aiming at • European-Brazilian User Scenario will make use of the Data Analytics API and the Programming model interface. • The Programming model interface will use the services deployed using TOSCA standard ― Service specifications will be instantiated on a cloud IaaS, automatically managing Quality of Service. ― The BigData Analytics services will run on the cloud infrastructure. • A security framework will be defined to provide security and privacy. QoS IaaS Big Data Analytics Program. model interface Security User Scenario Access tokens Assurance BigData Privacy Assurance Access tokens TOSCA specs Requests Development IDE 2
  • 4. Infrastructure Manager (IM) www.grycap.upv.es/im www.eubra-bigsea.eu | @bigsea_eubr •Two approaches to VMI management •Deploy existing vanilla images (plain OS) and configure instances using tools to recreate the desired conf. •Example: Instantiate a plain Ubuntu 14.04 and use Ansible to automatically install a full LAMP (Apache, MySQL, PHP) stack. •Create specific VMIs for different Clouds from templates. •Deployments based on recipes, configuration and contextualization services. •Enables platform-agnostic or hybrid deployments. •Supports TOSCA. VMI catalogue *
  • 5. Elastic Clusters on the Cloud (EC3), http://www.grycap.upv.es/ec3 • EC3 (Elastic Cloud Computing Cluster) leverages CLUES (http://www.grycap.upv.es/clues) to create elastic virtual clusters in the Cloud. - No upfront investment. - Customizable. - Usage-dependent cost. - Automatic elasticity. 5 www.eubra-bigsea.eu | @bigsea_eubr
  • 6. Resource-level elasticity • When a new job in a Mesos framework is submitted, EC3 will check if there are enough resources to fulfil the request - EC3 will request the Cloud Management Framework to deploy new VMs to allocate the necessary resources for the new framework • EC3 uses Infrastructure Manager to install and re-configure the software. • A snapshot from the first WN is stored to speed-up the deployment of further nodes. - When resources become idle for a while, they are undeployed. - Not-elastic resources are not undeployed. 6 www.eubra-bigsea.eu | @bigsea_eubr
  • 8. Vertical elasticity at the resource level • VMs are allocated from a CMF that manages a general- purpose on-premise cloud • Physical limitations are normally at the memory size rather than at the CPU share. • A physical machine can be shared among different deployments • Both BIGSEA stack or other one. • Or due to increased isolation. • Oversubscribing memory can enable reaching higher number of VMs per physical node, not bounding to a specific partitioning. • A VM from one deployment can ”release” the free memory to other VMs in the node. 8www.eubra-bigsea.eu | @bigsea_eubr
  • 10. Conclusions •EUBra-BIGSEA aims at providing a set of cloud services to • Facilitate the deployment of complex multi-framework Big Data infrastructures. • Provide elasticity at the level of the physical resources and frameworks. • Include QoS Constraints. • Facilitate the use of customized environments. •Those cloud services are directly used by higher- level programming models hiding the platform particularities. 10www.eubra-bigsea.eu | @bigsea_eubr
  • 11. Thanks •The authors would also like to thank the Spanish "Ministerio de Economía, Industria y Competitividad" for the project "BigCLOE'' with reference number TIN2016-79951-R. 11