SlideShare a Scribd company logo
Delegation /
Organisation
Logo
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 1
Big Data From the Space
2017 Cycle 1st Mapping Meetings
Outsourcing Partner Sp. z o.o.
Bartosz Szkudlarek
Piotr Zaborowski
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 2
We are Outsourcing Partner, a technology
company, specialized in custom software
development and Big Data.
Outsourcing Partner capabilities on Big Data
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 3
What can we bring?
Proven technology experience with common
Big Data technologies.
Outsourcing Partner capabilities on Big Data
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 4
Outsourcing Partner capabilities on Big Data
Our experience
Six projects in Big Data domain, which use
Hadoop, Apache Spark and other
technologies. Two projects for ESA where
the point was to integrated and visualize
massive data.
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 5
Outsourcing Partner capabilities on Big Data
Project name Project subject Technologies Numbers
European Space Agency
GEOSS Web Portal
Data hub portal with search functionality.
Objective of this project was to integrate
two different data sources on one
visualisation platform
HTML5, maps, microservices More than 1 mln resuls
Two different data sources.
European Space Agency
The EO Web – the new
website
Proof of concept for new content
architecture of new Earth Observation
website which collects all information
from domain services.
The primary purpose of this project to
identify and unify content elements from
all EO websites and to provide efficient
mechanism for harvesting, indexing,
categorising and searching content.
HTML5, Elastic Search, Kibana, Google
Analytics
More than 50 websites with
technical documentation about
missions instruments and other
information connected with the
area, over the 500k resources
identified.
Operational, constant dev Proof of concept Operational, complete
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 6
Outsourcing Partner capabilities on Big Data
Project name Project subject Technologies Numbers
Telecommunication sector
T-Mobile
Messaging broker
Communication exchange between
operator and customer is crucial. We
implement communication broker for
text messages (SMS, push notifications,
etc..) which allows to monitor:
• message efficiencies (how many
reminders are needed for force user
to pay delayed payments, what
message force user to buy additional
internet limit),
• message rules ( the system can not
send information about available
internet package if user order
package though any channel).
Casandra, Apache Hadoop The system handled 15 mln
customers, 3 mln message per
day.
Telecommunication sector
T-Mobile
Customer self-service system
To provide services for customers, the
telecommunication company needs to
have many backend systems to support
operations.
The aim of this project was to implement
the mechanism for collecting information
about user activities in one repository.
Except massive amount of data the
challenge was to unify information from
many domains systems.
ELC stack (Elastic Search, Kibana,
Logstash)
Operational, constant dev Proof of concept Operational, complete
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 7
Outsourcing Partner capabilities on Big Data
Project name Project subject Technologies Numbers
Betterware
Retail company
Sale support prediction
mechanism
Together with Betterware, we analyzed
the sales data and singled the sets of
products which are frequently bought by
consumers.
Apache Sparx,
Apache Hadoop,
Tableau Software
8 500 customers, 1 k orders
dally, machine learning
algorithms train on 1 mln
operations (5 years of history
data).
Insurance company
Integration of customer
databases
The aim of the project was to integrate
data about customers and their
operations stored and managed by four
different domain systems. The scope of
the project contains:- data analysis and
providing integrated domain model, -
ETL transformations programming, -
visualization of data based on Tableau
Software
Tableau Software,
Amazon AWS
4 domain system, more than
30 unified domain objects.
Operational, constant dev Proof of concept Operational, complete
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 8
Outsourcing Partner capabilities on Big Data
Project name Project subject Technologies Numbers
Electoral Committee Candidate
for President of the Republic
Media monitoring
During the presidential election in 2015
in Poland we monitored social media
(Facebook, Twitter, Youtube) and digital
newspapers.
From data fetched from social media we
prepared reports of popularity of
particular candidates, sentiment of
comments connected with candidates
and leaders of communities (blog
authors, influencers), we built algorithm
estimates trending phrases for political
domain.
Apache Hadoop,
Apache Spark,
HTML5 reports
Operational, constant dev Proof of concept Operational, complete
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 9
Comments on Big Data from Space (OSP)
• Security and legal recommendations should be defined if applicable
• 4.4 Services and data location with legal consequences policy is not referenced.
Harmonisation should clarify strategy and policy towards data localisation and
promoted licensing models technologies.
• Services reliability
• 4.5.4.6 suitable services reliability or reproducibility for industrial development.
Availability model should be applied (like in the Ground Segment) for platforms
exposed to crowdsource/industry to secure its business models
• Openness to other data sources
• 4.5.4.1 Some proven decision support solutions base on combining satellite data and
other data sinks, thus architecture supporting data integration should be considered.
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 10
Comments on Big Data from Space (OSP)
• Consider exchangeability aspect
• 4.5.X.1 Interoperability and exchangeability can be one of the strategy dimension in
cross domain data flow.
• Consider architectural influence of data organisational spread on usability (technical)
• 4.5.2.1 For data organisation (like CDM) shredding policy should be aligned to current
and potential requirements. Solution should enable generic interfaces be build in
awareness of underlying data distribution while not infrastructure.
• Openness vs predictability on provided platforms
• 4.5.3.1 orchestration and prioritisation: in shared environment extensive experiments
may coexists with operational periodic/stream analytics that should not be
depredated.
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 11
OSP suggestions for Big Data from Space Roadmap
Apart from precise needs and solutions mapping we suggest consideration of
following.
• Standardisation advisory body constituted for new/ongoing initiatives would
enable natural alignment to process and consider new approaches.
• Services and technologies catalogue of state of the art, recommended and
applying setup for members and industry review.
• Layered architecture of systems should be proposed and adopted with common
interfaces to enable interoperability, relocations, third party added value services
development - with respect of blurred borders and dependencies.
• Federalisation tactics should be consolidated.
• Industry-related, legal and security policies and strategies should be defined.
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 12
Conclusions on Big Data from Space from OSP
The most valuable Big Data projects came from
interdisciplinary teams which can juggle data from many
different data sources
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 13
Conclusions on Big Data from Space (OSP)
Data Scientists are mostly
mathematicians and physics.
Significant part of them start
experiments from sample
databases such us IRIS or Lena.
Why can't they use the Agency
resources?
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 14
Conclusions (OSP)
As SME with long SW and big data domain we recognise following challenges in
unlocking data potential according to 5.2 European Strategic Interests:
• High entry threshold - data is closed for non-domain industry companies and
research units.
• Current ESA big data exploitation projects are silo – there is no collaboration and
competition, no place for processing workflow,
• There is (possibly) evaluation gap – resources managed by the Agency are
valuable but unevaluated, there are no (not many) mechanism for collecting
community feedback and evolve,
• Great data and services are of undefined reliability and partly unpredictible
Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 15
Conclusions (OSP)
Useful tools to deal with pitfalls of Big Data exploitation:
• Focusing on the potential customers the Agency should put an effort promoting
and exposing the value of the data,
• Data platform should be as open & simple as possible – the Open Data principle,
• Implement mechanisms of collaboration; define subsets, rate&evaluate, share:
ideas, experiments, results, extend, finally create processing chain,
• Deliver reliable services meeting industry needs or enable commercial
federalisation/transition to business of value added services

More Related Content

PPT
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
PDF
High Performance Data Analysis (HPDA): HPC - Big Data Convergence
PPT
Data as a service
PDF
Data-as-a-Service: DataGraft
PDF
Open data presentation 2013 v0 5
PDF
Accelerating Time to Research Using CloudBank
PDF
Continuous Intelligence: Keeping your AI Application in Production
PDF
20141030 LinDA Workshop echallenges2014 - LinDA project overview
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
High Performance Data Analysis (HPDA): HPC - Big Data Convergence
Data as a service
Data-as-a-Service: DataGraft
Open data presentation 2013 v0 5
Accelerating Time to Research Using CloudBank
Continuous Intelligence: Keeping your AI Application in Production
20141030 LinDA Workshop echallenges2014 - LinDA project overview

What's hot (19)

PDF
HPC Trends for 2017
PDF
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
PDF
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
PDF
Policy Cloud Data Driven - Technical overview
PDF
"Cerved - A business perspective"
PPTX
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
PDF
proDataMarket presentation at "Spatial Data on The Web"
PDF
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
PDF
P. Struijs, Toward the Use of Big Data for European Statistics
PDF
DataGraft: Data-as-a-Service for Open Data
PDF
proDataMarket presentation at "European Data Forum"
PPTX
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
PPTX
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
PDF
Open Data Presentation v1.3 - Nov 2014
PPTX
Sdn in big data
PDF
Industry@RuleML2015 DataGraft
PDF
Leveraging Graphs for Better AI
PDF
Graph Databases and Graph Data Science in Neo4j
ODP
Census Hub Project
HPC Trends for 2017
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
Industry@RuleML2015: Norwegian State of Estate A Reporting Service for the St...
Policy Cloud Data Driven - Technical overview
"Cerved - A business perspective"
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
proDataMarket presentation at "Spatial Data on The Web"
proDataMarket presentation at "Linked Data Europe: Big Geospatial Data"
P. Struijs, Toward the Use of Big Data for European Statistics
DataGraft: Data-as-a-Service for Open Data
proDataMarket presentation at "European Data Forum"
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Open Data Presentation v1.3 - Nov 2014
Sdn in big data
Industry@RuleML2015 DataGraft
Leveraging Graphs for Better AI
Graph Databases and Graph Data Science in Neo4j
Census Hub Project
Ad

Viewers also liked (20)

PDF
PDU 214 Methods of Observation & Interviewing: Observation - Methods & Record...
PDF
Sherri's Ministry Bio - Final
PDF
Faixa de Areia Brasil - B - Documentary film
PPTX
Happy Valentine's Day 2017
DOC
Nuevo documento de microsoft word
PDF
Manual eventos civicos (este archivo es muy necesario en nuestros C.T. me lo ...
PDF
KHÁCH HÀNG TIỀM NĂNG ĐẾN TỪ ĐÂU
PPTX
Costoss y gastos
PDF
Benefícios e desafios que Big Data & Analytics traz para as empresas na jorna...
PDF
2017, l'année de_l'action_par_excellence
PPTX
Diplomado en gestion de proyectos e – lerning
PDF
3 Big Data Trends for 2017
PDF
20170126 big data processing
PDF
Data Mining, Predictive Analytics and Big Data - Course information Spring 2017
PPTX
Uji perbedaan ayda tri_valen_virdya
PPT
Statistik deskriptif(1)
PPTX
5 facts everyone should know about big data presentation
PDF
The importance of data
PDF
Analisis Studi Kelayakan Bisnis
KEY
Big Data Trends
PDU 214 Methods of Observation & Interviewing: Observation - Methods & Record...
Sherri's Ministry Bio - Final
Faixa de Areia Brasil - B - Documentary film
Happy Valentine's Day 2017
Nuevo documento de microsoft word
Manual eventos civicos (este archivo es muy necesario en nuestros C.T. me lo ...
KHÁCH HÀNG TIỀM NĂNG ĐẾN TỪ ĐÂU
Costoss y gastos
Benefícios e desafios que Big Data & Analytics traz para as empresas na jorna...
2017, l'année de_l'action_par_excellence
Diplomado en gestion de proyectos e – lerning
3 Big Data Trends for 2017
20170126 big data processing
Data Mining, Predictive Analytics and Big Data - Course information Spring 2017
Uji perbedaan ayda tri_valen_virdya
Statistik deskriptif(1)
5 facts everyone should know about big data presentation
The importance of data
Analisis Studi Kelayakan Bisnis
Big Data Trends
Ad

Similar to Mapping presentation THAG big data from space (20)

PDF
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
PDF
The Big Data Importance – Tools and their Usage
PPSX
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
PPTX
Dublinked tech workshop_15_dec2011
DOC
Complete-SRS.doc
PDF
R180305120123
PPTX
Standard Safeguarding Dataset - overview for CSCDUG.pptx
PDF
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
PDF
Data & Analytics Framework: how public sector can profit from its immense ass...
PDF
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
PPTX
Big and fast data strategy 2017 jr
PDF
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
PPTX
Big data analytics
PDF
Future of Data Strategy (ASEAN)
PDF
GERSIS INDUSTRY CASES
PDF
Certified Big Data Science Analyst (CBDSA)
PDF
Big Data Analytics Research Report
PDF
The Underutilization of GIS technologies - Q&A with Shane Barrett
PDF
Memory Management in BigData: A Perpective View
PDF
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
Social Media Market Trender with Dache Manager Using Hadoop and Visualization...
The Big Data Importance – Tools and their Usage
Platform for Big Data Analytics and Visual Analytics: CSIRO use cases. Februa...
Dublinked tech workshop_15_dec2011
Complete-SRS.doc
R180305120123
Standard Safeguarding Dataset - overview for CSCDUG.pptx
ESSnet Big Data WP8 Methodology (+ Quality, +IT)
Data & Analytics Framework: how public sector can profit from its immense ass...
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...
Big and fast data strategy 2017 jr
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
Big data analytics
Future of Data Strategy (ASEAN)
GERSIS INDUSTRY CASES
Certified Big Data Science Analyst (CBDSA)
Big Data Analytics Research Report
The Underutilization of GIS technologies - Q&A with Shane Barrett
Memory Management in BigData: A Perpective View
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization

Recently uploaded (20)

PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Sustainable Sites - Green Building Construction
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Digital Logic Computer Design lecture notes
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
composite construction of structures.pdf
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
Well-logging-methods_new................
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
additive manufacturing of ss316l using mig welding
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Sustainable Sites - Green Building Construction
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Model Code of Practice - Construction Work - 21102022 .pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Digital Logic Computer Design lecture notes
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
composite construction of structures.pdf
Embodied AI: Ushering in the Next Era of Intelligent Systems
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Well-logging-methods_new................
Lesson 3_Tessellation.pptx finite Mathematics
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
CH1 Production IntroductoryConcepts.pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
additive manufacturing of ss316l using mig welding
OOP with Java - Java Introduction (Basics)
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx

Mapping presentation THAG big data from space

  • 1. Delegation / Organisation Logo Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 1 Big Data From the Space 2017 Cycle 1st Mapping Meetings Outsourcing Partner Sp. z o.o. Bartosz Szkudlarek Piotr Zaborowski
  • 2. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 2 We are Outsourcing Partner, a technology company, specialized in custom software development and Big Data. Outsourcing Partner capabilities on Big Data
  • 3. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 3 What can we bring? Proven technology experience with common Big Data technologies. Outsourcing Partner capabilities on Big Data
  • 4. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 4 Outsourcing Partner capabilities on Big Data Our experience Six projects in Big Data domain, which use Hadoop, Apache Spark and other technologies. Two projects for ESA where the point was to integrated and visualize massive data.
  • 5. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 5 Outsourcing Partner capabilities on Big Data Project name Project subject Technologies Numbers European Space Agency GEOSS Web Portal Data hub portal with search functionality. Objective of this project was to integrate two different data sources on one visualisation platform HTML5, maps, microservices More than 1 mln resuls Two different data sources. European Space Agency The EO Web – the new website Proof of concept for new content architecture of new Earth Observation website which collects all information from domain services. The primary purpose of this project to identify and unify content elements from all EO websites and to provide efficient mechanism for harvesting, indexing, categorising and searching content. HTML5, Elastic Search, Kibana, Google Analytics More than 50 websites with technical documentation about missions instruments and other information connected with the area, over the 500k resources identified. Operational, constant dev Proof of concept Operational, complete
  • 6. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 6 Outsourcing Partner capabilities on Big Data Project name Project subject Technologies Numbers Telecommunication sector T-Mobile Messaging broker Communication exchange between operator and customer is crucial. We implement communication broker for text messages (SMS, push notifications, etc..) which allows to monitor: • message efficiencies (how many reminders are needed for force user to pay delayed payments, what message force user to buy additional internet limit), • message rules ( the system can not send information about available internet package if user order package though any channel). Casandra, Apache Hadoop The system handled 15 mln customers, 3 mln message per day. Telecommunication sector T-Mobile Customer self-service system To provide services for customers, the telecommunication company needs to have many backend systems to support operations. The aim of this project was to implement the mechanism for collecting information about user activities in one repository. Except massive amount of data the challenge was to unify information from many domains systems. ELC stack (Elastic Search, Kibana, Logstash) Operational, constant dev Proof of concept Operational, complete
  • 7. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 7 Outsourcing Partner capabilities on Big Data Project name Project subject Technologies Numbers Betterware Retail company Sale support prediction mechanism Together with Betterware, we analyzed the sales data and singled the sets of products which are frequently bought by consumers. Apache Sparx, Apache Hadoop, Tableau Software 8 500 customers, 1 k orders dally, machine learning algorithms train on 1 mln operations (5 years of history data). Insurance company Integration of customer databases The aim of the project was to integrate data about customers and their operations stored and managed by four different domain systems. The scope of the project contains:- data analysis and providing integrated domain model, - ETL transformations programming, - visualization of data based on Tableau Software Tableau Software, Amazon AWS 4 domain system, more than 30 unified domain objects. Operational, constant dev Proof of concept Operational, complete
  • 8. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 8 Outsourcing Partner capabilities on Big Data Project name Project subject Technologies Numbers Electoral Committee Candidate for President of the Republic Media monitoring During the presidential election in 2015 in Poland we monitored social media (Facebook, Twitter, Youtube) and digital newspapers. From data fetched from social media we prepared reports of popularity of particular candidates, sentiment of comments connected with candidates and leaders of communities (blog authors, influencers), we built algorithm estimates trending phrases for political domain. Apache Hadoop, Apache Spark, HTML5 reports Operational, constant dev Proof of concept Operational, complete
  • 9. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 9 Comments on Big Data from Space (OSP) • Security and legal recommendations should be defined if applicable • 4.4 Services and data location with legal consequences policy is not referenced. Harmonisation should clarify strategy and policy towards data localisation and promoted licensing models technologies. • Services reliability • 4.5.4.6 suitable services reliability or reproducibility for industrial development. Availability model should be applied (like in the Ground Segment) for platforms exposed to crowdsource/industry to secure its business models • Openness to other data sources • 4.5.4.1 Some proven decision support solutions base on combining satellite data and other data sinks, thus architecture supporting data integration should be considered.
  • 10. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 10 Comments on Big Data from Space (OSP) • Consider exchangeability aspect • 4.5.X.1 Interoperability and exchangeability can be one of the strategy dimension in cross domain data flow. • Consider architectural influence of data organisational spread on usability (technical) • 4.5.2.1 For data organisation (like CDM) shredding policy should be aligned to current and potential requirements. Solution should enable generic interfaces be build in awareness of underlying data distribution while not infrastructure. • Openness vs predictability on provided platforms • 4.5.3.1 orchestration and prioritisation: in shared environment extensive experiments may coexists with operational periodic/stream analytics that should not be depredated.
  • 11. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 11 OSP suggestions for Big Data from Space Roadmap Apart from precise needs and solutions mapping we suggest consideration of following. • Standardisation advisory body constituted for new/ongoing initiatives would enable natural alignment to process and consider new approaches. • Services and technologies catalogue of state of the art, recommended and applying setup for members and industry review. • Layered architecture of systems should be proposed and adopted with common interfaces to enable interoperability, relocations, third party added value services development - with respect of blurred borders and dependencies. • Federalisation tactics should be consolidated. • Industry-related, legal and security policies and strategies should be defined.
  • 12. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 12 Conclusions on Big Data from Space from OSP The most valuable Big Data projects came from interdisciplinary teams which can juggle data from many different data sources
  • 13. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 13 Conclusions on Big Data from Space (OSP) Data Scientists are mostly mathematicians and physics. Significant part of them start experiments from sample databases such us IRIS or Lena. Why can't they use the Agency resources?
  • 14. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 14 Conclusions (OSP) As SME with long SW and big data domain we recognise following challenges in unlocking data potential according to 5.2 European Strategic Interests: • High entry threshold - data is closed for non-domain industry companies and research units. • Current ESA big data exploitation projects are silo – there is no collaboration and competition, no place for processing workflow, • There is (possibly) evaluation gap – resources managed by the Agency are valuable but unevaluated, there are no (not many) mechanism for collecting community feedback and evolve, • Great data and services are of undefined reliability and partly unpredictible
  • 15. Outsourcing Partner Big Data from the Space | 21st of February 2017 | Slide 15 Conclusions (OSP) Useful tools to deal with pitfalls of Big Data exploitation: • Focusing on the potential customers the Agency should put an effort promoting and exposing the value of the data, • Data platform should be as open & simple as possible – the Open Data principle, • Implement mechanisms of collaboration; define subsets, rate&evaluate, share: ideas, experiments, results, extend, finally create processing chain, • Deliver reliable services meeting industry needs or enable commercial federalisation/transition to business of value added services