SlideShare a Scribd company logo
Big Data Europe Platform ReleaseMay 3rd 2017
Big Data Europe Integrator Platform
Empowering Communities with Data Technologies
Platform release
Dr. Hajira Jabeen
Senior researcher
University of Bonn
Platform Goals
◎Opensource
◎Ease of Use
◎Support a variety of use cases
◎Embrace emerging Big Data Technologies
◎Simple integration with custom components
Key actors
Platform Architecture Evolution
4
Platform Architecture Evolution
5
Platform Architecture Evolution
6
7
Platform Architecture Existing
Platform Architecture Existing
8
Platform Architecture Alternate View
Support Layer
Init Daemon
GUIs
Monitor
App Layer
Traffic
Forecast
Satellite Image Analysis
Platform Layer
Spark Flink Semantic Layer
Ontario SANSA Semagrow
Kafka
Real-time Stream Monitoring
...
...
Resource Management Layer (Swarm)
Hardware Layer
Premises Cloud (AWS, GCE, MS Azure, …)
Hadoop NOSQL Store CassandraElasticsearch ...RDF Store
Data Layer
BDE Supported Frameworks
Search/indexing Data processing
Apache Solr Apache Spark
Data acquisition Apache Flink
Apache Flume Semantic Components
Message passing Strabon
Apache Kafka Sextant
Data storage GeoTriples
Hue Silk
Apache Cassandra SEMAGROW
ScyllaDB LIMES
Apache Hive 4Store
Postgis OpenLink Virtuoso
10
Platform features
◎ BDE Development Environment
o Stack builder
o Workflow builder
o Instructions to add custom components to the BDE
stack
◎ Administrator Interface
o SwarmUI
◎ UI Integrator
o Workflow monitor
o Integrated web interface
11
What BDE Provides ?
◎Platform Installation Instructions
◎Usage Instructions
o Creating a stack
o Creating a workflow
o Monitoring the Stack
o Integration of Custom Components
12
Platform installation
◎Manual installation guide
◎Using Docker Machine
o On local machine (VirtualBox)
o In cloud (AWS, DigitalOcean, Azure)
o Bare metal
◎Screencasts
13
Deploying a Big Data Stack
◎ Stack Builder
◎ Stack
o Collection of communicating components to solve a
specific problem
◎ Described in Docker Compose
o Component configuration
o Application topology
14
Creation of WorkFlows
◎Pipeline Builder
o Allows creation of dependencies among
different applications
◎ WorkFlow Monitor
o Monitoring of pipeline-workflow using
15
Integrating Custom Components
◎Instructions
o Orchestrator required for initialization process
(init_daemon)
❖ Components may depend on each other
❖ Components may require manual intervention
o User Interface Integration
❖ Standard Interfaces from components
❖ Combine and align the interfaces
16
User Interfaces
◎Target: Facilitate the use of the platform
o User Interface Adaption
◎Available interfaces
o Workflow UIs
❖ Workflow Builder
❖ Workflow Monitor
o Swarm UI
o Integrator UI
17
Details !
18
Presentations by Ivan, Aad and Jens
19
Summary
20
Platform Architecture
21
Pilot Show Cases
22
SC1 SC2 SC3 SC4 SC5 SC6
SC7
SC1 - Open PHACTS discovery platform relating to biological/medical questions
SC2 - Discovery and Linking of Viticulture-relevant information
SC3 - System monitoring in energy production units
SC4 - Short-Term traffic flow forecasting.
SC5 - Supporting data-intensive climate research
SC6 - Citizens & Researchers Budget on Municipal Level
SC7 - Ingestion of remote sensing images and social sensing data to detect and verify
changes on the Earth surface for security applications
SC1- Health
23
SC2 - Food
24
SC3 - Energy
25
26
SC4 - Transport
FCD: Floating Car Data
NRT: Near Real Time
SC5 - Climate
27
SC6 - Social Sciences
28
SC7 - Security
29
BDE vs Hadoop distributions
Hortonworks Cloudera MapR Bigtop BDE
File System HDFS HDFS NFS HDFS HDFS
Installation Native Native Native Native lightweight
virtualization
Flexible Modular Architecture no no no no yes
High Availability Single failure
recovery (yarn)
Single failure
recovery (yarn)
Self healing, mult.
failure rec.
Single failure
recovery (yarn)
Failure recovery
Cost Commercial Commercial Commercial Free Free
Scaling Freemium Freemium Freemium Free Free
Addition of custom
components
Not easy No No No Yes
Integration testing yes yes yes yes --
Operating systems Linux Linux Linux Linux Windows/Mac/Linux
Management tool Ambari Cloudera manager MapR Control
system
- Docker swarm UI+
Custom
30
BDE vs Hadoop distributions
◎BDE is not built on top of existing distributions
◎Targets
o Communities
o Research Institutions
◎Bridges scientists and open data
◎Multi Tier research efforts towards Smart Data
31
Maintenance and Uptake
◎Community Driven
◎Adapters
■ Feuga , Eurostat, ILVO
■ I2cat, Vicomtech, IoF
◎Follow Up Projects
■ HOBBIT
■ Special
■ Big Ocean
■ Qrowd
32
Wrap up
33
◎Big Data Europe - Platform
o Containerized Components
o Development and Runtime Facilities
◎Show Cases
o From Components to Architectures
o Evolving Microservice Architectures

More Related Content

PPTX
BDE-BDVA Webinar: BDE Technical Overview
PPTX
Release webinar: Sansa and Ontario
PDF
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
PDF
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
PPTX
SC1 Workshop 2 Technical overview
PDF
BigDataEurope @BDVA Summit2016 1: The BDE Platform
PDF
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
PPTX
SC1 Workshop 2 General Introduction to BDE
BDE-BDVA Webinar: BDE Technical Overview
Release webinar: Sansa and Ontario
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
SC1 Workshop 2 Technical overview
BigDataEurope @BDVA Summit2016 1: The BDE Platform
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVA
SC1 Workshop 2 General Introduction to BDE

What's hot (20)

PDF
Josep Maria Salanova - Introduction to BDE+SC4
PPTX
Big Data Europe Transport Pilot case, Luigi Selmi
PDF
Big data Europe: concept, platform and pilots
PPTX
BDE SC6-pilot - 05/12/16 - cologne Michalis Vafopoulos
PDF
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...
PDF
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
PPTX
BDE SC6-hang out - technology part-SWC - Martin
PPTX
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
PPTX
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
PDF
Luigi Selmi - The Big Data Integrator Platform
PDF
Big data value policy context and public private partnership
PPTX
SC1 Workshop 2 Pilot instantiations
PDF
SC7 Hangout 3: Architecture of the BDE Pilot for Secure Societies
PDF
Big Data Europe at eHealth Week 2017: Linking Big Data in Health
PDF
BigDataEurope @BDVA Summit2016 2: Societal Pilots
PPTX
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
PPTX
Bde cessda sc6-hang_out-28september2016_ivana
PPTX
BDE SC6 workshop - introduction 2016
PDF
BDE SC3.3 Workshop - Agenda
PDF
SC7 Hangout 3: The BDE Secure Societies Pilot
Josep Maria Salanova - Introduction to BDE+SC4
Big Data Europe Transport Pilot case, Luigi Selmi
Big data Europe: concept, platform and pilots
BDE SC6-pilot - 05/12/16 - cologne Michalis Vafopoulos
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
BDE SC6-hang out - technology part-SWC - Martin
SC7 Webinar 4 04/05/2017 NCSR Demokritos Presentation "Event Detection"
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Luigi Selmi - The Big Data Integrator Platform
Big data value policy context and public private partnership
SC1 Workshop 2 Pilot instantiations
SC7 Hangout 3: Architecture of the BDE Pilot for Secure Societies
Big Data Europe at eHealth Week 2017: Linking Big Data in Health
BigDataEurope @BDVA Summit2016 2: Societal Pilots
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Bde cessda sc6-hang_out-28september2016_ivana
BDE SC6 workshop - introduction 2016
BDE SC3.3 Workshop - Agenda
SC7 Hangout 3: The BDE Secure Societies Pilot
Ad

Similar to Platform introduction & Summary (20)

PPTX
ICWE2017 BigDataEurope
PPTX
DEMETER at OGC Agriculture Session
PDF
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...
PPT
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
PPT
060314 Ispra Htap Presentations Husar 060314 Ispra
PDF
Role of cloud and analytics in IoT
PPTX
Open Source Edge Computing Platforms - Overview
PDF
cncf overview and building edge computing using kubernetes
PDF
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
PPTX
High Performance Processing of Streaming Data
PDF
The Synapse IoT Stack: Technology Trends in IOT and Big Data
PPTX
TransPAC3/ACE Measurement & PerfSONAR Update
PDF
Enabling Multi-access Edge Computing (MEC) Platform-as-a-Service for Enterprises
PDF
BDE SC3.3 Workshop - BDE Platform: Technical overview
PPT
Ultralight Data Movement for IoT with SDC Edge
PDF
Getting insights from IoT data with Apache Spark and Apache Bahir
PPTX
CPaaS.io Y1 Review Meeting - Use Cases
PDF
SnapLogic- iPaaS (Elastic Integration Cloud and Data Integration)
DOC
Stephen miller resume
DOC
Stephen miller resume
ICWE2017 BigDataEurope
DEMETER at OGC Agriculture Session
[OpenStack Day in Korea 2015] Keynote 2 - Leveraging OpenStack to Realize the...
2006-03-14 WG on HTAP-Relevant IT Techniques, Tools and Philosophies: DataFed...
060314 Ispra Htap Presentations Husar 060314 Ispra
Role of cloud and analytics in IoT
Open Source Edge Computing Platforms - Overview
cncf overview and building edge computing using kubernetes
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
High Performance Processing of Streaming Data
The Synapse IoT Stack: Technology Trends in IOT and Big Data
TransPAC3/ACE Measurement & PerfSONAR Update
Enabling Multi-access Edge Computing (MEC) Platform-as-a-Service for Enterprises
BDE SC3.3 Workshop - BDE Platform: Technical overview
Ultralight Data Movement for IoT with SDC Edge
Getting insights from IoT data with Apache Spark and Apache Bahir
CPaaS.io Y1 Review Meeting - Use Cases
SnapLogic- iPaaS (Elastic Integration Cloud and Data Integration)
Stephen miller resume
Stephen miller resume
Ad

More from BigData_Europe (20)

PDF
Rajendra Akerkar - LeMO Project
PDF
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
PDF
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
PDF
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
PDF
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
PDF
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
PDF
BDE SC3.3 Workshop - Data management in WT testing and monitoring
PDF
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
PDF
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
PDF
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
PDF
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
PDF
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
PPTX
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
PPTX
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
PPTX
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
PPTX
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
PPTX
SC1 Hangout: Updating public databases: Automation and other challenges for c...
PDF
SC7 Webinar 5 13/12/2017 SatCen Presentation "Secure societies activities: th...
PDF
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
PDF
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
Rajendra Akerkar - LeMO Project
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
BDE SC3.3 Workshop - Data management in WT testing and monitoring
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
SC1 Hangout: Updating public databases: Automation and other challenges for c...
SC7 Webinar 5 13/12/2017 SatCen Presentation "Secure societies activities: th...
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...

Recently uploaded (20)

PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Computer network topology notes for revision
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
1_Introduction to advance data techniques.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPT
ISS -ESG Data flows What is ESG and HowHow
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
IB Computer Science - Internal Assessment.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Computer network topology notes for revision
Clinical guidelines as a resource for EBP(1).pdf
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
1_Introduction to advance data techniques.pptx
.pdf is not working space design for the following data for the following dat...
climate analysis of Dhaka ,Banglades.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Supervised vs unsupervised machine learning algorithms
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
ISS -ESG Data flows What is ESG and HowHow

Platform introduction & Summary

  • 1. Big Data Europe Platform ReleaseMay 3rd 2017 Big Data Europe Integrator Platform Empowering Communities with Data Technologies Platform release Dr. Hajira Jabeen Senior researcher University of Bonn
  • 2. Platform Goals ◎Opensource ◎Ease of Use ◎Support a variety of use cases ◎Embrace emerging Big Data Technologies ◎Simple integration with custom components
  • 9. Platform Architecture Alternate View Support Layer Init Daemon GUIs Monitor App Layer Traffic Forecast Satellite Image Analysis Platform Layer Spark Flink Semantic Layer Ontario SANSA Semagrow Kafka Real-time Stream Monitoring ... ... Resource Management Layer (Swarm) Hardware Layer Premises Cloud (AWS, GCE, MS Azure, …) Hadoop NOSQL Store CassandraElasticsearch ...RDF Store Data Layer
  • 10. BDE Supported Frameworks Search/indexing Data processing Apache Solr Apache Spark Data acquisition Apache Flink Apache Flume Semantic Components Message passing Strabon Apache Kafka Sextant Data storage GeoTriples Hue Silk Apache Cassandra SEMAGROW ScyllaDB LIMES Apache Hive 4Store Postgis OpenLink Virtuoso 10
  • 11. Platform features ◎ BDE Development Environment o Stack builder o Workflow builder o Instructions to add custom components to the BDE stack ◎ Administrator Interface o SwarmUI ◎ UI Integrator o Workflow monitor o Integrated web interface 11
  • 12. What BDE Provides ? ◎Platform Installation Instructions ◎Usage Instructions o Creating a stack o Creating a workflow o Monitoring the Stack o Integration of Custom Components 12
  • 13. Platform installation ◎Manual installation guide ◎Using Docker Machine o On local machine (VirtualBox) o In cloud (AWS, DigitalOcean, Azure) o Bare metal ◎Screencasts 13
  • 14. Deploying a Big Data Stack ◎ Stack Builder ◎ Stack o Collection of communicating components to solve a specific problem ◎ Described in Docker Compose o Component configuration o Application topology 14
  • 15. Creation of WorkFlows ◎Pipeline Builder o Allows creation of dependencies among different applications ◎ WorkFlow Monitor o Monitoring of pipeline-workflow using 15
  • 16. Integrating Custom Components ◎Instructions o Orchestrator required for initialization process (init_daemon) ❖ Components may depend on each other ❖ Components may require manual intervention o User Interface Integration ❖ Standard Interfaces from components ❖ Combine and align the interfaces 16
  • 17. User Interfaces ◎Target: Facilitate the use of the platform o User Interface Adaption ◎Available interfaces o Workflow UIs ❖ Workflow Builder ❖ Workflow Monitor o Swarm UI o Integrator UI 17
  • 19. Presentations by Ivan, Aad and Jens 19
  • 22. Pilot Show Cases 22 SC1 SC2 SC3 SC4 SC5 SC6 SC7 SC1 - Open PHACTS discovery platform relating to biological/medical questions SC2 - Discovery and Linking of Viticulture-relevant information SC3 - System monitoring in energy production units SC4 - Short-Term traffic flow forecasting. SC5 - Supporting data-intensive climate research SC6 - Citizens & Researchers Budget on Municipal Level SC7 - Ingestion of remote sensing images and social sensing data to detect and verify changes on the Earth surface for security applications
  • 26. 26 SC4 - Transport FCD: Floating Car Data NRT: Near Real Time
  • 28. SC6 - Social Sciences 28
  • 30. BDE vs Hadoop distributions Hortonworks Cloudera MapR Bigtop BDE File System HDFS HDFS NFS HDFS HDFS Installation Native Native Native Native lightweight virtualization Flexible Modular Architecture no no no no yes High Availability Single failure recovery (yarn) Single failure recovery (yarn) Self healing, mult. failure rec. Single failure recovery (yarn) Failure recovery Cost Commercial Commercial Commercial Free Free Scaling Freemium Freemium Freemium Free Free Addition of custom components Not easy No No No Yes Integration testing yes yes yes yes -- Operating systems Linux Linux Linux Linux Windows/Mac/Linux Management tool Ambari Cloudera manager MapR Control system - Docker swarm UI+ Custom 30
  • 31. BDE vs Hadoop distributions ◎BDE is not built on top of existing distributions ◎Targets o Communities o Research Institutions ◎Bridges scientists and open data ◎Multi Tier research efforts towards Smart Data 31
  • 32. Maintenance and Uptake ◎Community Driven ◎Adapters ■ Feuga , Eurostat, ILVO ■ I2cat, Vicomtech, IoF ◎Follow Up Projects ■ HOBBIT ■ Special ■ Big Ocean ■ Qrowd 32
  • 33. Wrap up 33 ◎Big Data Europe - Platform o Containerized Components o Development and Runtime Facilities ◎Show Cases o From Components to Architectures o Evolving Microservice Architectures

Editor's Notes

  • #7: Explain Docker, compose, and Swarm
  • #23: Viticulture - Weinanbau