SlideShare a Scribd company logo
THE DATA RING:
A CANVAS FOR BIG
DATA PROJECTS
September 21th - Frontiers Conference 2017
CHRISTIAN
RACCA
@pendolare
@i_realize
dati.piemonte.it
LEONARDO
CAMICIOTTI
AGENDA
i. A few words about us (TOP-IX & BIG DIVE course).
ii. BIG DATA opportunities, beyond the buzzword.
iii. Open challenges in applied Data Science.
iv. A canvas “Ring” to rule them all…
80+ Members
(15 in 2003)
NON PROFIT
CONSORTIUM
PUBLIC & PRIVATE
PARTICIPATION
MISSION
TO FOSTER
INNOVATION
BY LEVERAGING
INFRASTRUCTURE
ASSETS
EDUCATION
START-UP
CORPORATE
INNOVATION
CIVIC TECH
FUNDED
PROJECTS
IX NORTH-WEST
ITALY
DP
7 collaborators
16 employees
2 directors
TOP-IX CONSORTIUM
OUR ACTIVITIES ABOUT DATA
010100100110100100
0001010101001001
010101001001001
011101010101010
101010101010100
010101001010101
101010101010010
01010101010100
BIG DIVE HAS BEEN
DESIGNED AS AN INTENSIVE
TRAINING PROGRAM AIMED
AT BOOSTING THE TECH
SKILLS IN ORDER TO
EXTRACT VALUE FROM
DATA AND TO GENERATE
IMPACT.
010100100110100100
0001010101001001
010101001001001
011101010101010
101010101010100
010101001010101
101010101010010
01010101010100
01010101010010
010101001010101
WHAT IS BIG DIVE ?
4 weeks
20 divers
8 countries
2012
5 weeks
15 divers
5 countries
2013
5 weeks
15 divers
6 countries
2014
2015
5 weeks
19 divers
6 countries
2017
5 weeks
22 divers
5 countries
2016
5 weeks
20 divers
6 countries
THE BIG DIVE HISTORY
(BIG) DATA
OPPORTUNITIES
BEYOND THE
BUZZWORD
marketing
quantitative
technology
VS
qualitative
innovation
process
research business
THE TERM AMBIGUITY
YES, BIG DATA IS A “BUZZWORD”
Say BIG DATA again!
WHAT’S “NEW” ABOUT DATA
DATA
SKILLS TO EXTRACT
INFORMATION ARE
NOW MORE
ACCESSIBLE
INFRASTRUCTURE AS
A COMMODITY
/ Cloud
/ HPC & HPN
/ Frameworks
CULTURE & APPROACH
/ Complexity science
/ Network thinking
/ Open Innovation
DATA AVAILABILITY
/ Exponential growth
/ Machine VS human
/ Structured VS
un-structured
BIG DATA + ML = The NEW STACK
Big Data technologies are used to
handle core data engineering
challenges, and machine learning is
used to extract value from the data.
COMMON OPEN CHALLENGES
/THE DATA
/THE SKILLS
/FROM PROTOTYPE TO…
/THE RESULTS INTERPRETATION AND
THE EXPLAINABILITY ISSUE
/“GREY ZONES” IN DATA EXPLOITATION
/THE PURSUIT OF INNOVATION
DATA METADATA FEATURES
CHALLENGE #1
DATA REMAINS THE STARTING POINT
Metadata
Features Selection
Refers to the process of extracting useful
information (or features) from existing data.
“Data” that provides information about other
data.
{Descriptive, Structural, Administrative}
Volume
The effective amount of usable data.
No a-priori objective parameters.
On field validation is required.
ABOUT FEATURES…
FROM SOURCE DATA
TO RELEVANT DATA
Noisy or redundant data
makes it more difficult to
discover meaningful patterns.
High-dimensional dataset
requires more complex
models/algorithms and more
computational power.
Features “reduction” Data augmentation
Enriching existing data
with open data or through
third-party data providers.
THE DATA TEAM
CHALLENGE #2
HARD
(TECH)
SKILLS
SOFT
(HUMAN)
SKILLS
STATS,
MATH
SKILLS
SECTOR,
VERTICAL
SKILLS Danger zone
Re-arrangement of the THE DATA SCIENCE VENN DIAGRAM
by Drew Conway
CODING
SKILLS
STILL LOOKING FOR UNICORNS
FROM PROTOTYPE TO
PRODUCTION
CHALLENGE #3
A BABEL OF (CODING) LANGUAGES
PRODUCTION
JAVA, C, C++, …
DATA-DRIVEN
PROTOTYPE
PYTHON, R, D3.JS
REFACTORING
-
DATA
ENGINEERING
[ SCALA, …]
RESULTS INTERPRETATION
&
EXPLAINABILITY
CHALLENGE #4
THE EXPLAINABILITY ISSUE
LOW EXPLAINABILITY
HIGH EXPLAINABILITY
Inferential statistics
Machine learning
Deep learning
GREY ZONES
IN
DATA EXPLOITATION
CHALLENGE #5
THE DARK SIDE OF BIG DATA
/DATA DEMOCRACY VS BIG DATA
OLIGARCHY
/CORRELATION DOES NOT MEAN
CAUSATION
/BUBBLE FILTERS
/BIAS
/HUMAN RIGHTS
GRID NATIVES
VS
COMPLEX NATIVES
CHALLENGE #6
THE GRID VS THE COMPLEXITY
LEVERAGING (BIG) DATA
OPPORTUNITIES
REQUIRES
A PROPER METHOD
THE DATA RING
THE CANVAS APPROACH
The inspiring
precursor
The Data Ring
DATA RING CANVAS
DOWNLOAD, USE,
COMMENT,
REFINE IT (CC LICENSE)
Project name: Designed by: Date: Version:
(D
ata)output
Data
Infrastructure
GOAL(S)
SKI
LLS
PROCESSO
VALORIZZAZI
O
NE
TOO
LS
Data
input
Implementation
T
uning
Interpretation
VAL
UE
Execution
Planning
Data
strategy
Data skills
O
therskills
Benchmark
Metrics
Budget & timing
Outsourcing
Data
governance
Exploration
Hypothesis
Datapreparation
Dataprocessing
Validation
Iteration
Accessibility
Format
Metadata
Features
DatalakeFramework
Storage
Computing
Coding
Dataengineering
(Applied)Datascience
Dataviz
Business
Legal
Social science
Sector expertise
PRO
C
ESS
THANKS!
leonardo.camiciotti@top-ix.org
christian.racca@top-ix.org
www.top-ix.org
www.bigdive.eu
@top_ix
@bigdive_eu

More Related Content

PDF
SC7 Hangout 1: Community Building and user requirements for Big Data in Secur...
PPT
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
PDF
RMDS data science ecosystem approach
PDF
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
PPTX
LiDIA: An integration architecture to query Linked Open Data from multiple da...
PPTX
#opendata Back to the future
PDF
SC7 Hangout 1: Community Building and user requirements for Big Data in Secur...
EDF2014: BIG - NESSI Networking Session: Edward Curry, National University of...
RMDS data science ecosystem approach
AI: The next frontier by Amparo Alonso at Big Data Spain 2017
LiDIA: An integration architecture to query Linked Open Data from multiple da...
#opendata Back to the future

What's hot (17)

PDF
Introduction to FIWARE technology
PPTX
Open data for smart cities
PPTX
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
PDF
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
PDF
2nd International Conference on Data Mining & Machine Learning (DMML 2021)
PPTX
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
PDF
Towards Unified and Native Enrichment in Event Processing Systems
DOCX
Conclusion
PDF
Big Data, Analytics, and Tax Fraud by D. José Borja Tomé at Big Data Spain 2017
PPTX
Open Data: Barriers, Risks, and Opportunities
PDF
Exchanging Data Agreements in the DaaS Model
PPTX
Project overview big data europe
PDF
Call for Papers - 2nd International Conference on Big Data (CBDA 2021)
PPTX
Presentation emerging tecnology
PPTX
7th International Conference on Data Mining and Database (DMDB 2020)
PDF
International Society of Service Innovation Professionals
Introduction to FIWARE technology
Open data for smart cities
Data ethics and machine learning: discrimination, algorithmic bias, and how t...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
2nd International Conference on Data Mining & Machine Learning (DMML 2021)
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
Towards Unified and Native Enrichment in Event Processing Systems
Conclusion
Big Data, Analytics, and Tax Fraud by D. José Borja Tomé at Big Data Spain 2017
Open Data: Barriers, Risks, and Opportunities
Exchanging Data Agreements in the DaaS Model
Project overview big data europe
Call for Papers - 2nd International Conference on Big Data (CBDA 2021)
Presentation emerging tecnology
7th International Conference on Data Mining and Database (DMDB 2020)
International Society of Service Innovation Professionals
Ad

Viewers also liked (11)

PPTX
Canvas
PPTX
Templates: Mapa da Empatia, Canvas da Proposta de Valor, Canvas do Modelo de ...
PDF
Memoria Seminario sobre Canvas Model com Alexander Osterwlader - by Luiz Rolim
PDF
Inovação em modelos de negócios já estabelecidos o analista de modelos de n...
PDF
Palestra Experiência do Cliente
PDF
Seminario Business Model Canvas
PDF
Construindo Produtos Inovadores
PDF
Feedback Canvas - Agile Portugal 2017
PPSX
Strategic Business Model Canvas v3
PDF
From Data to Artificial Intelligence with the Machine Learning Canvas — ODSC ...
PDF
Company Presentation: Canvas
Canvas
Templates: Mapa da Empatia, Canvas da Proposta de Valor, Canvas do Modelo de ...
Memoria Seminario sobre Canvas Model com Alexander Osterwlader - by Luiz Rolim
Inovação em modelos de negócios já estabelecidos o analista de modelos de n...
Palestra Experiência do Cliente
Seminario Business Model Canvas
Construindo Produtos Inovadores
Feedback Canvas - Agile Portugal 2017
Strategic Business Model Canvas v3
From Data to Artificial Intelligence with the Machine Learning Canvas — ODSC ...
Company Presentation: Canvas
Ad

Similar to The DATA RING - A canvas for DATA PROJECT (20)

PDF
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
PPTX
What is the concept of Big Data?
PPTX
AI Project Cycle Summary Class ninth please
PPTX
Data science Innovations January 2018
PDF
Myths and challenges in knowledge extraction and analysis from human-generate...
PPTX
What is Data Science
PPTX
Big Data.pptx
PPTX
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
PPT
Data Science in the Real World: Making a Difference
PDF
INF2190_W1_2016_public
PDF
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
PDF
Data foundation for analytics excellence
PPTX
Big dataorig
PDF
The Impact of the Data Revolution on Official Statistics: Opportunities, Chal...
PPTX
Session 01 designing and scoping a data science project
PPTX
Session 01 designing and scoping a data science project
PPTX
Data Science presentation for explanation of numpy and pandas
PPTX
Sailing on the ocean of 1s and 0s
PDF
Sogeti on big data creating clarity
PDF
Big data 1 4 vint-sogeti-on-big-data-1-of-4-creating clarity with big data
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
What is the concept of Big Data?
AI Project Cycle Summary Class ninth please
Data science Innovations January 2018
Myths and challenges in knowledge extraction and analysis from human-generate...
What is Data Science
Big Data.pptx
[PhDThesis2021] - Augmenting the knowledge pyramid with unconventional data a...
Data Science in the Real World: Making a Difference
INF2190_W1_2016_public
EDF2013: Invited Talk Julie Marguerite: Big data: a new world of opportunitie...
Data foundation for analytics excellence
Big dataorig
The Impact of the Data Revolution on Official Statistics: Opportunities, Chal...
Session 01 designing and scoping a data science project
Session 01 designing and scoping a data science project
Data Science presentation for explanation of numpy and pandas
Sailing on the ocean of 1s and 0s
Sogeti on big data creating clarity
Big data 1 4 vint-sogeti-on-big-data-1-of-4-creating clarity with big data

More from TOP-IX Consortium (19)

PDF
Hrtt 2019 workshop // TOP-IX
PDF
GARR Lightning talk, Data Science su metriche internet da M-LAB
PDF
Trust in the (BIG) DATA Era
PDF
Christian Racca's Presentation @ A bit of history 2017
PDF
Exploring BIGDIVE6 course program
PDF
Piedmont heritage
PDF
Gramsci devoted
PDF
Keep it simple
PDF
PDF
TOP-IX events in Berlin - PRESS RELEASE
PDF
Open data 4 startups (2°edition)
PDF
I Realize Lean Startup Hack-nov2010
PDF
Market Oriented Clouds: the local perspective
PDF
Top-ix Digital Media Session (View Conference 2009) - Workshop
PDF
La convergenza delle tecnologie
PDF
Marketing 2.0
PDF
Free Open Source software come risorsa per le imprese
PDF
Top Ix Dp En (2008 06 17)
PDF
web & media
Hrtt 2019 workshop // TOP-IX
GARR Lightning talk, Data Science su metriche internet da M-LAB
Trust in the (BIG) DATA Era
Christian Racca's Presentation @ A bit of history 2017
Exploring BIGDIVE6 course program
Piedmont heritage
Gramsci devoted
Keep it simple
TOP-IX events in Berlin - PRESS RELEASE
Open data 4 startups (2°edition)
I Realize Lean Startup Hack-nov2010
Market Oriented Clouds: the local perspective
Top-ix Digital Media Session (View Conference 2009) - Workshop
La convergenza delle tecnologie
Marketing 2.0
Free Open Source software come risorsa per le imprese
Top Ix Dp En (2008 06 17)
web & media

Recently uploaded (20)

PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PDF
Microsoft Core Cloud Services powerpoint
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PPTX
Introduction to Inferential Statistics.pptx
PDF
Business Analytics and business intelligence.pdf
PPT
Predictive modeling basics in data cleaning process
PPTX
modul_python (1).pptx for professional and student
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Database Infoormation System (DBIS).pptx
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PDF
annual-report-2024-2025 original latest.
PPTX
IMPACT OF LANDSLIDE.....................
PDF
Transcultural that can help you someday.
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Microsoft Core Cloud Services powerpoint
importance of Data-Visualization-in-Data-Science. for mba studnts
Introduction to Inferential Statistics.pptx
Business Analytics and business intelligence.pdf
Predictive modeling basics in data cleaning process
modul_python (1).pptx for professional and student
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Database Infoormation System (DBIS).pptx
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
annual-report-2024-2025 original latest.
IMPACT OF LANDSLIDE.....................
Transcultural that can help you someday.
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
retention in jsjsksksksnbsndjddjdnFPD.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx

The DATA RING - A canvas for DATA PROJECT