SlideShare a Scribd company logo
BIG DATA EUROPE
BIG	
  DATA	
  EUROPE	
  PLATFORM	
  REQUIREMENTS	
  &	
  DRAFT	
  
ARCHITECTURE:	
  THE	
  RESULTS	
  OF	
  THE	
  ONLINE	
  SURVEY	
  
	
  
BIG DATA EUROPE WORKSHOP: THE CHALLENGES OF BIG DATA FOR
SOCIETIES IN A CHANGING WORLD	
  
MARTIN	
  KALTENBÖCK	
  (SEMANTIC	
  WEB	
  COMPANY),	
  18.11.2015	
  HTTP://WWW.BIG-­‐DATA-­‐EUROPE.EU/	
  
Integrating Big Data, Software & Communities for Addressing
Europe’s Societal Challenges
Semantic Web Company (SWC)
SWC was founded 2001, head-quartered in Vienna
30 experts in linked data technologies & textmining
Product: PoolParty Semantic Suite (launched 2009)
Serving customers from all over the world
EU- & US-based consulting services
SWC: Customers & Partners
Some of our Customers
●  Credit Suisse
●  Boehringer Ingelheim
●  Roche
●  Wolters Kluwer
●  BMJ Publishing Group
●  Red Bull Media House
●  Canadian Broadcasting Corporation (CBC)
●  Pearson
●  Council of the EU
●  DG Environment, EC
●  Healthdirect Australia
●  Ministry of Finance (Austria)
●  World Bank Group
●  Inter-American Development Bank (IADB)
●  International Atomic Energy Agency (IAEA)
●  Buildings Performance Institute Europe (BPIE)
●  Renewable Energy & Energy Efficiency P (REEEP)
●  Global Buildings Performance Network (GBPN)
●  American Physical Society
●  Education Services Australia (ESA)
●  Norwegian Directorate of Immigration
●  Australian National Data Service
Finance / Automotive / Publisher / Health Care / Public Administration / Energy / Education
Selected Partners
●  EBCONT
●  EPAM Systems
●  iQuest
●  PwC
●  Tenforce
●  OpenLink Software
●  Ontotext
●  MarkLogic
●  Gravity Zero
●  Altotech
●  Wolters Kluwer
●  Taxonomy Strategies
●  Digirati
●  Fraunhofer (IAIS)
●  University of Leipzig (INFAI)
●  The Open Data Instizute (ODI)
We all have one goal in mind: Make machines smart enough so that they can
help us to find those needles in the haystack, which are really relevant to us.
The Motivation – Big Data
Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the
world today has been created in the last two years alone.
This data comes from everywhere: sensors used to gather climate information, posts to
social media sites, digital pictures and videos, purchase transaction records, and cell phone
GPS signals to name a few.
This data is big data. Source: IBM
Big Data Dimensions
Rationale
COORDINATION
Stakeholder Engagement
(Requirements Elicitation)
SUPPORT
Design, Realise, Evaluate
Big Data Aggregator Platform
Create and Manage Societal
Big Data Interest Groups
Cloud-deployment ready
Big Data Aggregator Platform
CSA
Measures
Results
BIG DATA EUROPE
STAKERHOLDER ENGAGEMENT
& REQUIREMENTS ENGINEERING
APPROACH	
  
Integrating Big Data, Software & Communities for Addressing
Europe’s Societal Challenges
BDE Stakeholder Engagement Approach & Activities
Work Packages & Implementation Phases
Community	
  
Building
M1-­‐M12 M13-­‐M24 M25-­‐M36
Enabling	
  
Technologies
Component	
  
Integration
Uptake
Integrator	
  
Deployment
Community	
  
Assessment
WP3	
  –	
  Big	
  Data	
  Generic	
  Enabling	
  
Technologies	
  &	
  Architecture
WP5	
  –	
  Big	
  Data	
  Integrator	
  Instances
WP7	
  –	
  Dissemination	
  &	
  Communication
WP2	
  –	
  Community	
  Building	
  &	
  Requirements
WP4	
  –	
  Big	
  Data	
  Integrator	
  Platform
WP6	
  –	
  Real-­‐life	
  Deployment	
  &	
  User	
  Evaluation
Orthogonal Dimensions of Big Data Ecosystems
Generic	
  Big	
  Data	
  Enabling	
  Technologies
Data	
  Value	
  Chain
Data	
  Generation	
  
&	
  Acquisition
Data	
  Analysis	
  &	
  
Processing
Data	
  Storage	
  &	
  
Curation
Data	
  
Visualization	
  &	
  
Usage
Data-­‐driven	
  
Services
Societal	
  Challenges
Domain	
  Specific	
  Data	
  Assets	
  &	
  Technology
Healthcare
Food	
  Security
Energy
Intelligent	
  Transport
Climate	
  &	
  Environment
Inclusive	
  &	
  Reflective	
  Societies
Secure	
  Societies
Methodology of Requirements Engineering
BDE Approach & Methodology
•  BDE Core Question Matrix as a basic Tool
•  Online Survey (20.5. – 26.6.2015, 394 Participants)
•  7 x 15 Face to Face Interviews (3 x 5 per SC)
•  7 Workshops in 2015 (7 in 2016, 7 in 2017)
•  7 BDE Pilot (Use Case) ideas / specifications
Requirements
Use case
pilots
Online
survey Interviews
BDE Core Question Matrix
Elements of the RE model
Questions to people within the specific Societal Challenge
(grouped by type of interviewee)
Business Strategic Technical Domain Experts
Stories Question Question Question Question
In this element, stories which describe the current status
and future development are asked
Question Question Question Question
Personas Question Question Question Question
In this element, typical personas which play a role are
described Question Question Question Question
Data Question Question Question Question
This element is to describe the data in amount, quality,
type, usage, etc. Question Question Question Question
Technologies Question Question Question Question
In this element, the technical requirements to our specific
solution are described Question Question Question Question
Other Question Question Question Question
BDE Stakeholder Survey
The empirical methodology of
online surveys generally coincides
with problems of representativity.
Samples generated through online
surveys are regarded as biased,
especially in terms of age, sex and
education.
Additionally lower response rates
compared to other methods, self-
selection and the lack of verifiability of
demographic information provided by
the respondents do not allow to draw
conclusions beyond the sample
ascertained by the survey itself.
BDE Stakeholder Survey - Participants
Participants: sector and organisation size
BDE Stakeholder Survey - Participants
Self-definition of role in the sector
BDE Stakeholder Survey - Participants
Participation in EU funded projects
BDE Stakeholder Survey - Participants
Years of IT Experience
BDE Stakeholder Survey - Results
Importance of Volume Importance of Velocity
BDE Stakeholder Survey - Results
Importance of Variety Efficiency of Data Infrastructures
BDE Stakeholder Survey - Results
Big
Data
Volume
Velocity
Variety
Veracity
•  Not an issue
•  Would be nice to have
•  Very important
•  “mostly economic and Social Science data”
•  Not so much data
•  “Increasingly important”
•  Very important “Data inconsistencies and
ambiguities are solved before processing”
BDE Stakeholder Survey - Results
Investments in Big Data Technologies Investments per Orgaisation Size
BDE Stakeholder Survey – Results: Growth of Data Volumes
BDE Stakeholder Survey – Results: Long Term Preservation
BDE Stakeholder Survey – Results: Long Term Preservation
BDE Stakeholder Survey – Results: Long Term Preservation
¥  Long term preservation of data
o  SC6 has the infrastructure in place for long­term
preservation of data
o  “Current practice is a core service where data is held in a
central place within a national infrastructure, and secure
remote access is provided to each social research team.”
¥  Data processing
o  “We use small samples or just the “main information”’ of
data needed.”
BDE Stakeholder Survey – Results
Need of Processing Large Volumes of Data
per Organisation Size
BIG DATA EUROPE
TECHNICAL REQUIREMENTS &
ARCHITECTURE / COMPONENTS	
  
Integrating Big Data, Software & Communities for Addressing
Europe’s Societal Challenges
Blueprint of the Data Aggregator Platform
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Batch Layer
Speed Layer
Data Storage
Real-time data &
Transactions …
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Batch View
Real-time
View
messagepassing
message passing
Applications & Showcases
Real-time dashboards
Domain-specific BDE apps
Big Data Analytics
In-stream Mining
BDEPlatform&Intelligence
Input data
Stream
Spatial
Social
Statistical
Temporal
Transactional
Imagery
+ Semantic Layer
Lambda Architecture
Spark	
  master
dispatcher
Spark	
  
worker
Spark	
  
worker
Target	
  situation
Spark	
  
master
dispatcher
Spark	
  
worker
Spark	
  
worker
Worker1
Worker2
Worker3
Deployed	
  situation
Big Data solutions – technical challenges
Work	
  distributor	
  &	
  monitor Work	
  executorsWork	
  initiator
Spark	
  master
dispatcher
Spark	
  
worker
Spark	
  
worker
Target	
  situation
Spark	
  
master
dispatcher
Spark	
  
worker
Spark	
  
worker
Worker1
Worker2
Worker3
Deployed	
  situation
Big Data solutions – technical challenges
BDE platform – generic robust resource management
BDE platform – generic robust resource management
Announcements….
•  HangOut, 23.11.2015, 11.00am -12.00pm CET (SC2)
INRA’s Big Data Perspectives and Implementation Challenges
•  HangOut, 25.11.2015, 14.00pm -15.00pm CET (SC1)
Challenge of Health, Demographic Change and Wellbeing
•  HangOut, 08.12.2015, 11.00pm -12.00pm CET (SC3)
Big Data in the energy domain
•  Big Data Europe MeetUp Vienna, 15.12.2015, 16:00-19:30pm CET, LINK
•  2016 Conference on Big Data from Space, March 15, 2016, LINK
SEMANTiCS2016, early September 2016
in Leipzig, Germany, http://www.semantics.cc
EDF2016, 29-30 June 2016
Eindhoven, Netherlands, http://2016.data-forum.eu
BDE Channels for Societal Challenge 6
•  Overall Website: http://www.big-data-europe.eu
•  SC 6 Website: http://www.big-data-europe.eu/social-sciences/
•  W3C Community Group: https://www.w3.org/community/bde-societies/
•  Subscribe BDE Newsletter: http://bit.ly/1PyhXRS
Contact the BDE Societal Challenge 6 network
Domain: Ivana Ilijasic Versic (CESSDA):
ivana.versic@cessda.net
Technical: Martin Kaltenböck (Semantic Web Company):
m.kaltenboeck@semantic-web.at
Workshop 18.11. – Interactive Sessions
Session 1: Data in place in the Social
Sciences and Humanities
•  What are the most important data sources in social
sciences available / you are using (open / closed)?
•  How are the characteristics along the 4 Vs of Big Data
regarding such sources (Volume - Variety - Velocity -
Veracity)?
Session 2: Risks and Challenges of
successful data management
•  What are the most important challenges in data
management in social sciences?
•  What are the most dangerous risks you can think of
regarding data management in social sciences?
•  SWOT - Analysis
Session 3: Technological demands of data
•  What technologies are in place in your organisations?
•  What technologies are on your roadmap - or are you
evaluating at the moment?
•  What are the most critical technological issues?
Session 4: Legal and policy demands of data
•  Open Versus Closed data in social sciences?
•  What are the most important legal issues in place?
•  What needs to change regarding Policies to enable
more efficient data management in social sciences?
Martin Kaltenböck, m.kaltenboeck@semantic-web.at
Semantic Web Company GmbH
Mariahilfer Strasse 70/8, A-1070 Vienna
+43-1-4021235
http://www.semantic-web.at
http://www.poolparty-software.com
http://slideshare.net/semwebcompany
http://youtube.com/semwebcompany
Your Questions please….
www.big-data-europe.eu
27-Nov-15
#BigDataEurope

More Related Content

PPT
Big Data technology for systems monitoring in Energy – Big Data Europe
PDF
SC6 Workshop 1: What can big data do for you?
PDF
BDE Technical Webinar 1 : Pilot Instantiation
PDF
SC6 Workshop 1: From your data to data stories - BigDataEurope, SC6 Workshop
PPT
SC4 BigDataEurope - Transport Data and Technologies Sean Gaines 11.12.2015
PPT
Societal Challnge 5 and Big Data Europe 1st hangout
PPTX
SC4 BigDataEurope - Policy - Maxime Flament
PDF
SC6 Workshop 1: Big data (phenomenon) challenges and requirements in official...
Big Data technology for systems monitoring in Energy – Big Data Europe
SC6 Workshop 1: What can big data do for you?
BDE Technical Webinar 1 : Pilot Instantiation
SC6 Workshop 1: From your data to data stories - BigDataEurope, SC6 Workshop
SC4 BigDataEurope - Transport Data and Technologies Sean Gaines 11.12.2015
Societal Challnge 5 and Big Data Europe 1st hangout
SC4 BigDataEurope - Policy - Maxime Flament
SC6 Workshop 1: Big data (phenomenon) challenges and requirements in official...

What's hot (20)

PDF
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
PPT
First online hangout SC5 - Big Data Europe first pilot-presentation-hangout
PPTX
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
PPT
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
PDF
SC7 Hangout 1: Community Building and user requirements for Big Data in Secur...
PPTX
BDE SC4 Hangout - Simon Scerri, Introduction
PPTX
SC4 Hangout 1: BDE-Transport Webinar Simon Scerri
PPT
EDF2014: Talk of Ksenia Petrichenko, Building Policy Analyst, Global Building...
PPT
SC4 Workshop 1: Dave Marples: Role of social media in transport
PPTX
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
PPT
EDF2014: José Ignacio Sánchez Valdenebro, Deputy Director of Digital Public S...
PDF
BDE Webinar: SC6 - EUROPE IN A CHANGING WORLD -INCLUSIVE, INNOVATIVE AND REFL...
PPT
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
PPTX
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
PPT
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
PPT
20140521 presentation ce de mv3
PPTX
EDF2014: Franck Cotton & Kamel Gadouche, France: TeraLab - A Secure Big Data...
PPTX
EDF2014: Nicolas Lemcke Horst, Ambassador of the Danish Basic Data Programme,...
PDF
BDE Technical Webinar 1 : Requirements elicitation
PPTX
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
EDF2014: Michele Vescovi, Researcher, Semantic & Knowledge Innovation Lab, It...
First online hangout SC5 - Big Data Europe first pilot-presentation-hangout
EDF2014: Kush Wadhwa, Senior Partner, Trilateral Research & Consulting: Addre...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
SC7 Hangout 1: Community Building and user requirements for Big Data in Secur...
BDE SC4 Hangout - Simon Scerri, Introduction
SC4 Hangout 1: BDE-Transport Webinar Simon Scerri
EDF2014: Talk of Ksenia Petrichenko, Building Policy Analyst, Global Building...
SC4 Workshop 1: Dave Marples: Role of social media in transport
SC4 Hangout 1: Big data europe transport webinar Philippe Crist
EDF2014: José Ignacio Sánchez Valdenebro, Deputy Director of Digital Public S...
BDE Webinar: SC6 - EUROPE IN A CHANGING WORLD -INCLUSIVE, INNOVATIVE AND REFL...
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...
SC4 Workshop 1: Simon Scerri: Existing tools and technologies
EDF2014: Taru Rastas, Senior Advisor, Ministry of Communications of Finland: ...
20140521 presentation ce de mv3
EDF2014: Franck Cotton & Kamel Gadouche, France: TeraLab - A Secure Big Data...
EDF2014: Nicolas Lemcke Horst, Ambassador of the Danish Basic Data Programme,...
BDE Technical Webinar 1 : Requirements elicitation
EDF2014: Nikolaos Loutas, Manager at PwC Belgium, Business Models for Linked ...
Ad

Similar to SC6 Workshop 1: Big Data Europe platform requirements and draft architecture: The results of the online survey - SC6 Workshop (20)

PPTX
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
PDF
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
PPTX
Introduction to: Big Data Europe Project
PPTX
BigDataEurope - Empowering Communities with Data Technologies
PDF
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
PPTX
Bde euro proworkshop
PDF
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
PDF
SC7 Workshop 1: Big Data in Secure Societies
PPTX
SC2 Workshop 1: Big Data Europe (BDE) - Project Overview & Food Workshop
PPTX
BigDataEurope: Project Introduction @ Year #1 Workshops
PPTX
BDE SC6.2 Workshop-05/12/16 - CESSDA
PPTX
BDE SC6 workshop - introduction 2016
PDF
Key Technology Trends for Big Data in Europe
PDF
Big data Europe: concept, platform and pilots
PDF
BigDataEurope @BDVA Summit2016 2: Societal Pilots
PPTX
SC1 Workshop 2 General Introduction to BDE
PDF
SC7 Workshop 2: The BigDataEurope project
PPTX
BigDataEurope Overview - Communities, Requirements & Pilots
PPTX
BDE_SC4_WS3_1_Simon Scerri - BDE Intro
PDF
SC7 Workshop 3: Big Data Europe Project
Presentation of the Big Data Europe project at the EIP Water Conference 2016 ...
Big Data Europe: Workshop 3 SC6 Social Science: THE IMPORTANCE OF METADATA & ...
Introduction to: Big Data Europe Project
BigDataEurope - Empowering Communities with Data Technologies
BDE SC3.3 Workshop - BDE review: Scope and Opportunities
Bde euro proworkshop
Apache Big_Data Europe event: "Demonstrating the Societal Value of Big & Smar...
SC7 Workshop 1: Big Data in Secure Societies
SC2 Workshop 1: Big Data Europe (BDE) - Project Overview & Food Workshop
BigDataEurope: Project Introduction @ Year #1 Workshops
BDE SC6.2 Workshop-05/12/16 - CESSDA
BDE SC6 workshop - introduction 2016
Key Technology Trends for Big Data in Europe
Big data Europe: concept, platform and pilots
BigDataEurope @BDVA Summit2016 2: Societal Pilots
SC1 Workshop 2 General Introduction to BDE
SC7 Workshop 2: The BigDataEurope project
BigDataEurope Overview - Communities, Requirements & Pilots
BDE_SC4_WS3_1_Simon Scerri - BDE Intro
SC7 Workshop 3: Big Data Europe Project
Ad

More from BigData_Europe (20)

PDF
Luigi Selmi - The Big Data Integrator Platform
PDF
Josep Maria Salanova - Introduction to BDE+SC4
PDF
Rajendra Akerkar - LeMO Project
PDF
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
PDF
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
PDF
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
PDF
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
PDF
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
PDF
BDE SC3.3 Workshop - Agenda
PDF
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
PDF
BDE SC3.3 Workshop - Data management in WT testing and monitoring
PDF
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
PDF
BDE SC3.3 Workshop - BDE Platform: Technical overview
PDF
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
PDF
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
PDF
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
PPTX
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
PPTX
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
PPTX
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
PPTX
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)
Luigi Selmi - The Big Data Integrator Platform
Josep Maria Salanova - Introduction to BDE+SC4
Rajendra Akerkar - LeMO Project
Big Data Europe SC6 WS #3: PILOT SC6: CITIZEN BUDGET ON MUNICIPAL LEVEL, Mart...
Big Data Europe SC6 WS #3: Big Data Europe Platform: Apps, challenges, goals ...
Big Data Europe SC6 WS 3: Where we are and are going for Big Data in OpenScie...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
BDE SC3.3 Workshop - Agenda
BDE SC3.3 Workshop - BDE Pilot case for Wind Turbine condition monitoring re...
BDE SC3.3 Workshop - Data management in WT testing and monitoring
BDE SC3.3 Workshop - Big Data in Wind Turbine Condition Monitoring
BDE SC3.3 Workshop - BDE Platform: Technical overview
BDE SC3.3 Workshop - Options for Wind Farm performance assessment and Power f...
BDE SC3.3 Workshop - Wind Farm Monitoring and advanced analytics
BDE SC1 Workshop 3 - BigMedilytics Overview (Supriyo Chatterjea)
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Big Data Europe (Simon Scerri)

Recently uploaded (20)

PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Lecture1 pattern recognition............
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Business Analytics and business intelligence.pdf
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Introduction to machine learning and Linear Models
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
1_Introduction to advance data techniques.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Clinical guidelines as a resource for EBP(1).pdf
Introduction-to-Cloud-ComputingFinal.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Lecture1 pattern recognition............
Introduction to Knowledge Engineering Part 1
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IB Computer Science - Internal Assessment.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Business Analytics and business intelligence.pdf
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Miokarditis (Inflamasi pada Otot Jantung)
Introduction to machine learning and Linear Models
Business Acumen Training GuidePresentation.pptx
1_Introduction to advance data techniques.pptx

SC6 Workshop 1: Big Data Europe platform requirements and draft architecture: The results of the online survey - SC6 Workshop

  • 1. BIG DATA EUROPE BIG  DATA  EUROPE  PLATFORM  REQUIREMENTS  &  DRAFT   ARCHITECTURE:  THE  RESULTS  OF  THE  ONLINE  SURVEY     BIG DATA EUROPE WORKSHOP: THE CHALLENGES OF BIG DATA FOR SOCIETIES IN A CHANGING WORLD   MARTIN  KALTENBÖCK  (SEMANTIC  WEB  COMPANY),  18.11.2015  HTTP://WWW.BIG-­‐DATA-­‐EUROPE.EU/   Integrating Big Data, Software & Communities for Addressing Europe’s Societal Challenges
  • 2. Semantic Web Company (SWC) SWC was founded 2001, head-quartered in Vienna 30 experts in linked data technologies & textmining Product: PoolParty Semantic Suite (launched 2009) Serving customers from all over the world EU- & US-based consulting services
  • 3. SWC: Customers & Partners Some of our Customers ●  Credit Suisse ●  Boehringer Ingelheim ●  Roche ●  Wolters Kluwer ●  BMJ Publishing Group ●  Red Bull Media House ●  Canadian Broadcasting Corporation (CBC) ●  Pearson ●  Council of the EU ●  DG Environment, EC ●  Healthdirect Australia ●  Ministry of Finance (Austria) ●  World Bank Group ●  Inter-American Development Bank (IADB) ●  International Atomic Energy Agency (IAEA) ●  Buildings Performance Institute Europe (BPIE) ●  Renewable Energy & Energy Efficiency P (REEEP) ●  Global Buildings Performance Network (GBPN) ●  American Physical Society ●  Education Services Australia (ESA) ●  Norwegian Directorate of Immigration ●  Australian National Data Service Finance / Automotive / Publisher / Health Care / Public Administration / Energy / Education Selected Partners ●  EBCONT ●  EPAM Systems ●  iQuest ●  PwC ●  Tenforce ●  OpenLink Software ●  Ontotext ●  MarkLogic ●  Gravity Zero ●  Altotech ●  Wolters Kluwer ●  Taxonomy Strategies ●  Digirati ●  Fraunhofer (IAIS) ●  University of Leipzig (INFAI) ●  The Open Data Instizute (ODI) We all have one goal in mind: Make machines smart enough so that they can help us to find those needles in the haystack, which are really relevant to us.
  • 4. The Motivation – Big Data Every day, we create 2.5 quintillion bytes of data — so much that 90% of the data in the world today has been created in the last two years alone. This data comes from everywhere: sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, and cell phone GPS signals to name a few. This data is big data. Source: IBM
  • 6. Rationale COORDINATION Stakeholder Engagement (Requirements Elicitation) SUPPORT Design, Realise, Evaluate Big Data Aggregator Platform Create and Manage Societal Big Data Interest Groups Cloud-deployment ready Big Data Aggregator Platform CSA Measures Results
  • 7. BIG DATA EUROPE STAKERHOLDER ENGAGEMENT & REQUIREMENTS ENGINEERING APPROACH   Integrating Big Data, Software & Communities for Addressing Europe’s Societal Challenges
  • 8. BDE Stakeholder Engagement Approach & Activities
  • 9. Work Packages & Implementation Phases Community   Building M1-­‐M12 M13-­‐M24 M25-­‐M36 Enabling   Technologies Component   Integration Uptake Integrator   Deployment Community   Assessment WP3  –  Big  Data  Generic  Enabling   Technologies  &  Architecture WP5  –  Big  Data  Integrator  Instances WP7  –  Dissemination  &  Communication WP2  –  Community  Building  &  Requirements WP4  –  Big  Data  Integrator  Platform WP6  –  Real-­‐life  Deployment  &  User  Evaluation
  • 10. Orthogonal Dimensions of Big Data Ecosystems Generic  Big  Data  Enabling  Technologies Data  Value  Chain Data  Generation   &  Acquisition Data  Analysis  &   Processing Data  Storage  &   Curation Data   Visualization  &   Usage Data-­‐driven   Services Societal  Challenges Domain  Specific  Data  Assets  &  Technology Healthcare Food  Security Energy Intelligent  Transport Climate  &  Environment Inclusive  &  Reflective  Societies Secure  Societies
  • 11. Methodology of Requirements Engineering BDE Approach & Methodology •  BDE Core Question Matrix as a basic Tool •  Online Survey (20.5. – 26.6.2015, 394 Participants) •  7 x 15 Face to Face Interviews (3 x 5 per SC) •  7 Workshops in 2015 (7 in 2016, 7 in 2017) •  7 BDE Pilot (Use Case) ideas / specifications Requirements Use case pilots Online survey Interviews
  • 12. BDE Core Question Matrix Elements of the RE model Questions to people within the specific Societal Challenge (grouped by type of interviewee) Business Strategic Technical Domain Experts Stories Question Question Question Question In this element, stories which describe the current status and future development are asked Question Question Question Question Personas Question Question Question Question In this element, typical personas which play a role are described Question Question Question Question Data Question Question Question Question This element is to describe the data in amount, quality, type, usage, etc. Question Question Question Question Technologies Question Question Question Question In this element, the technical requirements to our specific solution are described Question Question Question Question Other Question Question Question Question
  • 13. BDE Stakeholder Survey The empirical methodology of online surveys generally coincides with problems of representativity. Samples generated through online surveys are regarded as biased, especially in terms of age, sex and education. Additionally lower response rates compared to other methods, self- selection and the lack of verifiability of demographic information provided by the respondents do not allow to draw conclusions beyond the sample ascertained by the survey itself.
  • 14. BDE Stakeholder Survey - Participants Participants: sector and organisation size
  • 15. BDE Stakeholder Survey - Participants Self-definition of role in the sector
  • 16. BDE Stakeholder Survey - Participants Participation in EU funded projects
  • 17. BDE Stakeholder Survey - Participants Years of IT Experience
  • 18. BDE Stakeholder Survey - Results Importance of Volume Importance of Velocity
  • 19. BDE Stakeholder Survey - Results Importance of Variety Efficiency of Data Infrastructures
  • 20. BDE Stakeholder Survey - Results Big Data Volume Velocity Variety Veracity •  Not an issue •  Would be nice to have •  Very important •  “mostly economic and Social Science data” •  Not so much data •  “Increasingly important” •  Very important “Data inconsistencies and ambiguities are solved before processing”
  • 21. BDE Stakeholder Survey - Results Investments in Big Data Technologies Investments per Orgaisation Size
  • 22. BDE Stakeholder Survey – Results: Growth of Data Volumes
  • 23. BDE Stakeholder Survey – Results: Long Term Preservation
  • 24. BDE Stakeholder Survey – Results: Long Term Preservation
  • 25. BDE Stakeholder Survey – Results: Long Term Preservation ¥  Long term preservation of data o  SC6 has the infrastructure in place for long­term preservation of data o  “Current practice is a core service where data is held in a central place within a national infrastructure, and secure remote access is provided to each social research team.” ¥  Data processing o  “We use small samples or just the “main information”’ of data needed.”
  • 26. BDE Stakeholder Survey – Results Need of Processing Large Volumes of Data per Organisation Size
  • 27. BIG DATA EUROPE TECHNICAL REQUIREMENTS & ARCHITECTURE / COMPONENTS   Integrating Big Data, Software & Communities for Addressing Europe’s Societal Challenges
  • 28. Blueprint of the Data Aggregator Platform                         Batch Layer Speed Layer Data Storage Real-time data & Transactions …                 Batch View Real-time View messagepassing message passing Applications & Showcases Real-time dashboards Domain-specific BDE apps Big Data Analytics In-stream Mining BDEPlatform&Intelligence Input data Stream Spatial Social Statistical Temporal Transactional Imagery + Semantic Layer Lambda Architecture
  • 29. Spark  master dispatcher Spark   worker Spark   worker Target  situation Spark   master dispatcher Spark   worker Spark   worker Worker1 Worker2 Worker3 Deployed  situation Big Data solutions – technical challenges
  • 30. Work  distributor  &  monitor Work  executorsWork  initiator Spark  master dispatcher Spark   worker Spark   worker Target  situation Spark   master dispatcher Spark   worker Spark   worker Worker1 Worker2 Worker3 Deployed  situation Big Data solutions – technical challenges
  • 31. BDE platform – generic robust resource management
  • 32. BDE platform – generic robust resource management
  • 33. Announcements…. •  HangOut, 23.11.2015, 11.00am -12.00pm CET (SC2) INRA’s Big Data Perspectives and Implementation Challenges •  HangOut, 25.11.2015, 14.00pm -15.00pm CET (SC1) Challenge of Health, Demographic Change and Wellbeing •  HangOut, 08.12.2015, 11.00pm -12.00pm CET (SC3) Big Data in the energy domain •  Big Data Europe MeetUp Vienna, 15.12.2015, 16:00-19:30pm CET, LINK •  2016 Conference on Big Data from Space, March 15, 2016, LINK SEMANTiCS2016, early September 2016 in Leipzig, Germany, http://www.semantics.cc EDF2016, 29-30 June 2016 Eindhoven, Netherlands, http://2016.data-forum.eu
  • 34. BDE Channels for Societal Challenge 6 •  Overall Website: http://www.big-data-europe.eu •  SC 6 Website: http://www.big-data-europe.eu/social-sciences/ •  W3C Community Group: https://www.w3.org/community/bde-societies/ •  Subscribe BDE Newsletter: http://bit.ly/1PyhXRS Contact the BDE Societal Challenge 6 network Domain: Ivana Ilijasic Versic (CESSDA): ivana.versic@cessda.net Technical: Martin Kaltenböck (Semantic Web Company): m.kaltenboeck@semantic-web.at
  • 35. Workshop 18.11. – Interactive Sessions Session 1: Data in place in the Social Sciences and Humanities •  What are the most important data sources in social sciences available / you are using (open / closed)? •  How are the characteristics along the 4 Vs of Big Data regarding such sources (Volume - Variety - Velocity - Veracity)? Session 2: Risks and Challenges of successful data management •  What are the most important challenges in data management in social sciences? •  What are the most dangerous risks you can think of regarding data management in social sciences? •  SWOT - Analysis Session 3: Technological demands of data •  What technologies are in place in your organisations? •  What technologies are on your roadmap - or are you evaluating at the moment? •  What are the most critical technological issues? Session 4: Legal and policy demands of data •  Open Versus Closed data in social sciences? •  What are the most important legal issues in place? •  What needs to change regarding Policies to enable more efficient data management in social sciences?
  • 36. Martin Kaltenböck, m.kaltenboeck@semantic-web.at Semantic Web Company GmbH Mariahilfer Strasse 70/8, A-1070 Vienna +43-1-4021235 http://www.semantic-web.at http://www.poolparty-software.com http://slideshare.net/semwebcompany http://youtube.com/semwebcompany Your Questions please…. www.big-data-europe.eu 27-Nov-15 #BigDataEurope