SMALL BIG DATA CONGRESS 2018
Pitches | Madelon Molhoek
| Bram Poppink
| Harrie van de Vlag
| Mike Wilmer
SPEAKING FORMAT
Strict 3 minute pitch
Signal for the final minute
Alarm after 3 minutes
FORM
For active listening and
future connections
Questions and networking
after the pitches
After today pitch summary/ program booklet
with contact information
27 September 20183 | Small Big Data Congress 2018
01 – AD POPPER – XILION
A BIG DATA-DRIVEN B2B AND B2C INFRASTRUCTURE
FOR ROMANIA
4 | Small Big Data Congress 2018 27 September 2018
A BIG DATA-DRIVEN B2B and B2C INFRASTRUCTURE
FOR ROMANIA
TNO small Big Data Congress September 27, 2018
Speaker: Ad Popper MSc
Founder/Owner Xilion webservices
© Xilion webservices
1. Business accelerator
2. Assumption validator
3. Business risk agent
The Romanian market explorer apps
THE HIDDEN ABILITIES
OF BIG DATA
Small scale experience with the apps, shows that bigdata provides usable perceptions and impressions
what to expect in reality.
Similar like successful dating- and booking sites.
Scale-up the apps to a bigdata-driven infrastructure.
THE DUTCH - ROMANIAN
CONNECTION
Romania is the most emerging market in the Eastern EU.
Romania has no history, progressive, and fit to start from scratch.
Positioned as a pilot.
Dutch enterprises, who want to expand their business, are welcome to join the pilot.
THANK YOU FOR YOUR
ATTENTION
02 – MAARTEN KRUITHOF – TNO
DISCRIMINATORY BIAS IN
MACHINE LEARNING DATASETS
10 | Small Big Data Congress 2018 27 September 2018
DISCRIMINATORY BIAS IN
MACHINE LEARNING DATASETS
Dr. ir. M.C. Kruithof
DISCRIMINATORY BIAS
Bias in Machine Learning Datasets
FAIRTEST VISUALISATION
https://github.com/columbia/fairtest
$
$$$
$
$$$
$
$$$
$
$$$
WHAT CAN WE DO
Bias in Machine Learning Datasets
RepresentationData
Learn
Sensitive Property
(e.g. gender)
Predict
Information
03 – LIANNE APPEL – MAASTRICHT UNIVERSITY
ANALYZING PARTITIONED FAIR HEALTH DATA
RESPONSIBLY
15 | Small Big Data Congress 2018 27 September 2018
Current health research is unethical
Randomized clinical trial cohort Population
- Homogenous cohort:
- Similar age
- Same and single
disease
- Similar medical history
- Heterogeneous patients
- Combinations of diseases
- Diverse medical history
Analyzing partitioned FAIR
health data responsibly
Lianne Ippel, PhD
Institute of Data Science (IDS)
Lianne.ippel@maastrichtuniversity.nl
VWData - P8
Solution:
If you can’t bring the data to the research, then you have to
bring the research to the data!
model
Maastricht
study
CBS
FAIR data
station
FAIR data
station
Lianne Ippel
lianne.ippel@maastrichtuniversity.nl
Lianne Ippel
lianne.ippel@maastrichtuniversity.nl
04 – TOON ALBERS – TNO
DIGITAL TRUST FOR
CROSS-ORGANIZATIONAL DATA ANALYSIS
21 | Small Big Data Congress 2018 27 September 2018
Digital Trust for Cross-Organizational Data Analysis
Open: everyone can look at my data / my analysis algorithm.
Mandated: only selected parties can see my data / run my analysis.
Concealed: my data / my analysis algorithms must not be visible to others.
Closed: nobody can look at my data / my analysis (may not leave my premises).
Open
Mandated
Closed
Concealed
Analysis to Data
Data Analysis
Data to Analysis
Open
Mandated
Closed
Concealed
Data to Lake
Elena Lazovik, Toon Albers,
Matthijs Vonder, Bram van der Waaij
Lake
(trusted 3rd party)
Analysis to Lake
Bringing data and analysis together
05 – MARC STEEN – TNO
TRANSPARENCY IN ALGORITHMS —
CULTIVATING TECHNOMORAL VIRTUES
23 | Small Big Data Congress 2018 27 September 2018
Algo-
rithm
Honesty
Self-control
Humility
Justice
Courage
Empathy
Care
Civility
Flexibility
Perspective
Tristan Harris
Virtues: self-control; civility; humility
Kate Raworth
Virtues: justice; perspective;
empathy
Jaron Lanier
Virtues perspective;
justice; flexibility
Cathy O’Neil
Virtues: honesty; justice;
humility; courage
Yuval Noah Harari
Virtues: perspective; humility;
empathy
Sherry Turkle
Virtues: empathy; courage; civility; care
Luciano Floridi
Virtues: honesty; perspective;
flexibility; justice
Aimee van
Wynsberghe
Virtues: care; self-control; justice
Edward Snowden
Virtues: courage; magnanimity;
justice; civility
Safiya Umoja
Noble
Virtues: justice; honesty;
perspective; civility
Marc Steen, senior research scientist at TNO, marc.steen@tno.nl
responsibledatainnovation.org
dearengineerblog.wordpress.com
06 – WARD VENROOIJ – TNO
UNCERTAINTY VISUALIZATION
UNCERTAINTY VISUALIZATION
HOW TO DISPLAY UNCERTAINTY CORRECTLY, INSTEAD OF ADDING SOME MORE….
Ward Venrooij
13:00
IMAGINE… What is the chance I arrive after
13:10?
13:00
EXPERIMENT
Cognitive measures
Perceptual speed
Verbal working memory
Visual working memory
Numeracy
Personality
Conscientiousness
Extraversion
Neuroticism
Locus of Control
Need for Cogntion
Self-esteem
Expertise / Demographic
Chart expertise
Education level
Age
1. Which visual encoding of uncertain information is the most effective?
2. What is the influence of user characteristics on task performance?
• Chance of arriving after a specific time
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
12:45 13:00 13:15 12:45 13:00 13:15
12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15
12:45 13:00 13:15
VISUALIZATION TYPE
74
58
29
17
15
14
0 10 20 30 40 50 60 70 80
Dot-20
Stripe-20
Stripe-50
Density
Gradient
Error bars
Average percentage correct Q2 and Q3 in graph task
Visualizationtype
60% more
accurate than
error bars
USER CHARACTERISTICS
TAKE HOME MESSAGE
Yes, do visualize uncertainty.
But, not create more uncertainty by using the
wrong type of visualization
07 – JARON HARAMBAM – UNIVERSITY OF AMSTERDAM
WIZARD, BE MY ALGORITHM TODAY!
DEMOCRATIZING RECOMMENDER SYSTEMS
WITH ALGORITHMIC PERSONAE
Wizard, be my recommender algorithm today!
Jaron Harambam - Mykola Makhortykh - Dimitrios Bountouridis
08 – RUBEN VROMANS – TNO
DATA-DRIVEN SHARED DECISION MAKING ON
CANCER TREATMENT FOR INDIVIDUAL PATIENTS
PERSONALIZED
EXPLANATIONS
PERSONAL DATA UP-TO-DATE
DATA-DRIVEN
PERSONALIZED
DECISION AIDS
NEDERLANDSE
KANKER
REGISTRATIE
ROBOT-WRITER RECENT DATA
STATISTICAL
MODELS
SHARED DECISION MAKING
Ruben Vromans | PhD Researcher | Tilburg University | IKNL | R.D.Vromans@uvt.nl
09 – RON SNIJDERS – TNO
ONLINE DATA QUALITY ASSURANCE AT SCALE
FOR BIG DATA PIPELINES
Being Trustworthy needs Continuous Attention
Contact: ron.snijders@tno.nl
Complex Model ChainData Behavior over Time
Data Quality
Assurance
Contract Requirements
Indicators:
Trends:
!
…
Horizontal Scaling:
BREAK;
38 | Jongvee
10 – ROOS ROOIJAKKERS – PIPPLE
THE PIPPI PURPOSE OF PIPPLE:
DATA SCIENCE INSPIRED
BY PIPPI LONGSTOCKING
Solving complex problems with
mathematics and creativity
Data science
with purpose
Big on EQ and IQ!
We predict, we optimize, we innovate!
11 – GUILLERMO PEREZ – UNIVERSITY OF ANTWERP
SAFE AND UNDERSTANDABLE ARTIFICIAL INTELLIGENCE:
VERIFICATION OF MACHINE
AND REINFORCEMENT LEARNING
12 – JOOST BOSMAN – TNO
CATCHING BAD GUYS BY
DISCARDING KNOW GOOD GUYS
Bad guy ranking
Inspection
Inspected
Limited Resources
Lo
w
High % Used uninspected
Inspection result
Discarding known
good guys from
inspection results
improves
detection of the
bad guys
Discard good guys
Keep good guys
Bad guy ranking
Rank value
Often
Detection
Poor
Excellent
Inspection
Uninspected Inspected
Occurrenc
e
Never
Inspected
Limited Resources
13 – ERIK KENTIE – SURFSARA
DATA SHARING IN PUBLIC PRIVATE INITIATIVES
Surf members:
Universities
HBO’s
UMC’s
(Knowledge)
institutes
Start-Ups,
Incubators
Data hubs
Economic Boards
R&D –
departments
(EU)-Projects
Phd
Research
Associations
of Companies
Examples :
• Seed Valley (research data)
• Green Village (streaming data, IOT)
Sharing data between SURF members and Businesses
erik.kentie@surfsara.nl
Federated Identity &
Access Management
Supercomputer
Cartesius
Fast Network
Storage
Research Cloud
Innovation Lab
• Security (Auth)
• Technology (Apache/Kafka)
• Compute
• Visualisation and Analyse (Jupyter)
• Governance/Policies
• Roles (Data keeper)
• FAIR dataExpert Consultancy
Innovative ICT + pilot
14 – PEJMAN SHOEIBI OMRANI – TNO
APPLICATION OF BIG DATA
AND MACHINE LEARNING IN GEO-ENERGY
Improved production forecast
Predictive maintenance
Robust decision making
Prevent spills and accidents
Reliable , efficient and safe operation
BIG DATA IN GEO-ENERGY
Uncertainty in
subsurface
and
measurements
Abundance of
data (different
type, volume,
etc.)
Complex
operations
(end of life
time)
Experts
retirement
WHAT TO LEARN FROM OTHER INDUSTRIES?
Together we’re strong(er)
15 – SYLVIE VERBUNT – QUANTBASE
PICTURING DATA
Sbdc2018 master slidedeck-final
16 – RAJAT THOMAS – UNIVERSITY OF AMSTERDAM
DEEP LEARNING MEETS COUNTERFACTUAL INFERENCE
Deep Learning meets
Counterfactual Inference
Rajat Mani Thomas
Towards the promise of personalized medicine:
The problem
TO TREAT?
NOT TO TREAT?
Reality Fantasy
Deep Learning the embeddings
Towards the promise of personalized medicine:
The solution
Technically speaking
Embedding
KL
Divergence
Treatment
outcome
https://arxiv.org/pdf/1605.03661.pdf
Deep Neural
Networks
17 – PIETER VAN EVERDINGEN – PLDN
EXPLORING FURTHER SYNERGIES
BETWEEN BIG AND LINKED DATA
VIA GRAPH-BASED INTELLIGENCE
PITCH - PLATFORM LINKED DATA NETHERLANDS
PIETER VAN EVERDINGEN (PLDN)
Small Big Data Track – September 27th, 2018
#LinkedDataNL
Platform Linked Data Netherlands (PLDN)
Get involved
As participant/lead
• Events
• Working groups
• Publications
As sponsor
• Gold sponsor
• Silver sponsor
• Bronze sponsor
In the steering committee
Contact
Pieter van Everdingen/
Hans van Bragt
platformlinkeddatanl@gmail.com
Website
www.platformlinkeddata.nl
LinkedIn-group LOD Nederland
www.linkedin.com/groups/466278
Twitter @linkeddatanl
hashtag #linkeddatanl
Newsletter
www.pldn.nl/wiki/Nieuwsbrieven
18 – ANATOLY POSTILNIK – FIRST LINE SOFTWARE
HEALTH DATA DIRTY PROBLEM
AND WHAT WE DO ABOUT IT
Sbdc2018 master slidedeck-final
06 – WARD VENROOIJ – TNO
UNCERTAINTY VISUALIZATION
UNCERTAINTY VISUALIZATION
HOW TO DISPLAY UNCERTAINTY CORRECTLY, INSTEAD OF ADDING SOME MORE….
Ward Venrooij
13:00
IMAGINE… What is the chance I arrive after
13:10?
13:00
EXPERIMENT
Cognitive measures
Perceptual speed
Verbal working memory
Visual working memory
Numeracy
Personality
Conscientiousness
Extraversion
Neuroticism
Locus of Control
Need for Cogntion
Self-esteem
Expertise / Demographic
Chart expertise
Education level
Age
1. Which visual encoding of uncertain information is the most effective?
2. What is the influence of user characteristics on task performance?
• Chance of arriving after a specific time
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15
12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15
12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15
VISUALIZATION TYPE
74
58
29
17
15
14
0 10 20 30 40 50 60 70 80
Dot-20
Stripe-20
Stripe-50
Density
Gradient
Error bars
Average percentage correct Q2 and Q3 in graph task
Visualizationtype
60% more
accurate than
error bars
USER CHARACTERISTICS
TAKE HOME MESSAGE
Yes, do visualize uncertainty.
But, not create more uncertainty by using the
wrong type of visualization
19 - MARCEL HENRIQUEZ - RED DATA
DE LIJFSPREUK “PASSIONATE ABOUT DATA” ZICH IN
TOEPASSINGEN VERTAALT
69 | Small Big Data Congress 2018 27 September 2018
THANK YOU FOR
YOUR ATTENTION
Take a look:
TIME.TNO.NL

More Related Content

PDF
Intro to Data Science for Non-Data Scientists
PDF
"Smart Data Web: Connecting data and extracting knowledge", Prof. Dr. Hans Us...
PDF
Data Science Application in Business Portfolio & Risk Management
PDF
AXA x DSSG Meetup Sharing (Feb 2016)
PDF
A data view of the data science process
PDF
Benefiting from Big Data - A New Approach for the Telecom Industry
PDF
Big Data at #WADAY11
PPTX
Keynote Dubai
Intro to Data Science for Non-Data Scientists
"Smart Data Web: Connecting data and extracting knowledge", Prof. Dr. Hans Us...
Data Science Application in Business Portfolio & Risk Management
AXA x DSSG Meetup Sharing (Feb 2016)
A data view of the data science process
Benefiting from Big Data - A New Approach for the Telecom Industry
Big Data at #WADAY11
Keynote Dubai

Similar to Sbdc2018 master slidedeck-final (20)

PDF
Visualisatie voor transparante beslissingen - Big Data Expo 2019
PDF
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
PDF
Big Data Characteristics And Process PowerPoint Presentation Slides
PDF
TOP TEN: Big Data_ Issue 16 _ Dec 2014
PDF
Economic Challenges of Big Data
PDF
Big Data et eGovernment
PDF
Revolutionizing Big Data with AI-Driven Hybrid Soft Computing Techniques
PDF
Revolutionizing Big Data with AI-Driven Hybrid Soft Computing Techniques
PDF
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
PDF
On Big Data Analytics - opportunities and challenges
PDF
Big data and analytics - Petteri Alahuhta
PPTX
Big data and Marketing by Edward Chenard
PPTX
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
PDF
Freek bomhof tno
PDF
EMOS 2018 Big Data methods and techniques
PPTX
Presentation1 (1).pptx
PPTX
Data Visualization
PDF
WSO2Con ASIA 2016: Big Data & Analytics: Moving Beyond Hype to Insight
PPTX
Big data approaches to estimate the impact of EU funding
PPTX
Presentation1.pptx
Visualisatie voor transparante beslissingen - Big Data Expo 2019
Big Data Analytics - Best of the Worst : Anti-patterns & Antidotes
Big Data Characteristics And Process PowerPoint Presentation Slides
TOP TEN: Big Data_ Issue 16 _ Dec 2014
Economic Challenges of Big Data
Big Data et eGovernment
Revolutionizing Big Data with AI-Driven Hybrid Soft Computing Techniques
Revolutionizing Big Data with AI-Driven Hybrid Soft Computing Techniques
Algorithmic Systems Transparency and Accountability in Big Data & Cognitive Era
On Big Data Analytics - opportunities and challenges
Big data and analytics - Petteri Alahuhta
Big data and Marketing by Edward Chenard
#MarketingShake - Edward Chenard - Descubrí el poder del Big Data para Transf...
Freek bomhof tno
EMOS 2018 Big Data methods and techniques
Presentation1 (1).pptx
Data Visualization
WSO2Con ASIA 2016: Big Data & Analytics: Moving Beyond Hype to Insight
Big data approaches to estimate the impact of EU funding
Presentation1.pptx
Ad

Recently uploaded (20)

PDF
Session 11 - Data Visualization Storytelling (2).pdf
PPTX
Machine Learning and working of machine Learning
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
statsppt this is statistics ppt for giving knowledge about this topic
PPTX
eGramSWARAJ-PPT Training Module for beginners
PPTX
IMPACT OF LANDSLIDE.....................
PDF
An essential collection of rules designed to help businesses manage and reduc...
PPT
DU, AIS, Big Data and Data Analytics.ppt
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
PPTX
New ISO 27001_2022 standard and the changes
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPT
statistics analysis - topic 3 - describing data visually
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PDF
Microsoft 365 products and services descrption
PPTX
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
CYBER SECURITY the Next Warefare Tactics
Session 11 - Data Visualization Storytelling (2).pdf
Machine Learning and working of machine Learning
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
statsppt this is statistics ppt for giving knowledge about this topic
eGramSWARAJ-PPT Training Module for beginners
IMPACT OF LANDSLIDE.....................
An essential collection of rules designed to help businesses manage and reduc...
DU, AIS, Big Data and Data Analytics.ppt
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
1 hour to get there before the game is done so you don’t need a car seat for ...
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
New ISO 27001_2022 standard and the changes
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
statistics analysis - topic 3 - describing data visually
expt-design-lecture-12 hghhgfggjhjd (1).ppt
Microsoft 365 products and services descrption
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
CYBER SECURITY the Next Warefare Tactics
Ad

Sbdc2018 master slidedeck-final

  • 1. SMALL BIG DATA CONGRESS 2018 Pitches | Madelon Molhoek | Bram Poppink | Harrie van de Vlag | Mike Wilmer
  • 2. SPEAKING FORMAT Strict 3 minute pitch Signal for the final minute Alarm after 3 minutes
  • 3. FORM For active listening and future connections Questions and networking after the pitches After today pitch summary/ program booklet with contact information 27 September 20183 | Small Big Data Congress 2018
  • 4. 01 – AD POPPER – XILION A BIG DATA-DRIVEN B2B AND B2C INFRASTRUCTURE FOR ROMANIA 4 | Small Big Data Congress 2018 27 September 2018
  • 5. A BIG DATA-DRIVEN B2B and B2C INFRASTRUCTURE FOR ROMANIA TNO small Big Data Congress September 27, 2018 Speaker: Ad Popper MSc Founder/Owner Xilion webservices © Xilion webservices
  • 6. 1. Business accelerator 2. Assumption validator 3. Business risk agent The Romanian market explorer apps
  • 7. THE HIDDEN ABILITIES OF BIG DATA Small scale experience with the apps, shows that bigdata provides usable perceptions and impressions what to expect in reality. Similar like successful dating- and booking sites. Scale-up the apps to a bigdata-driven infrastructure.
  • 8. THE DUTCH - ROMANIAN CONNECTION Romania is the most emerging market in the Eastern EU. Romania has no history, progressive, and fit to start from scratch. Positioned as a pilot. Dutch enterprises, who want to expand their business, are welcome to join the pilot.
  • 9. THANK YOU FOR YOUR ATTENTION
  • 10. 02 – MAARTEN KRUITHOF – TNO DISCRIMINATORY BIAS IN MACHINE LEARNING DATASETS 10 | Small Big Data Congress 2018 27 September 2018
  • 11. DISCRIMINATORY BIAS IN MACHINE LEARNING DATASETS Dr. ir. M.C. Kruithof
  • 12. DISCRIMINATORY BIAS Bias in Machine Learning Datasets
  • 14. WHAT CAN WE DO Bias in Machine Learning Datasets RepresentationData Learn Sensitive Property (e.g. gender) Predict Information
  • 15. 03 – LIANNE APPEL – MAASTRICHT UNIVERSITY ANALYZING PARTITIONED FAIR HEALTH DATA RESPONSIBLY 15 | Small Big Data Congress 2018 27 September 2018
  • 16. Current health research is unethical Randomized clinical trial cohort Population - Homogenous cohort: - Similar age - Same and single disease - Similar medical history - Heterogeneous patients - Combinations of diseases - Diverse medical history
  • 17. Analyzing partitioned FAIR health data responsibly Lianne Ippel, PhD Institute of Data Science (IDS) Lianne.ippel@maastrichtuniversity.nl VWData - P8
  • 18. Solution: If you can’t bring the data to the research, then you have to bring the research to the data! model Maastricht study CBS FAIR data station FAIR data station Lianne Ippel lianne.ippel@maastrichtuniversity.nl
  • 20. 04 – TOON ALBERS – TNO DIGITAL TRUST FOR CROSS-ORGANIZATIONAL DATA ANALYSIS 21 | Small Big Data Congress 2018 27 September 2018
  • 21. Digital Trust for Cross-Organizational Data Analysis Open: everyone can look at my data / my analysis algorithm. Mandated: only selected parties can see my data / run my analysis. Concealed: my data / my analysis algorithms must not be visible to others. Closed: nobody can look at my data / my analysis (may not leave my premises). Open Mandated Closed Concealed Analysis to Data Data Analysis Data to Analysis Open Mandated Closed Concealed Data to Lake Elena Lazovik, Toon Albers, Matthijs Vonder, Bram van der Waaij Lake (trusted 3rd party) Analysis to Lake Bringing data and analysis together
  • 22. 05 – MARC STEEN – TNO TRANSPARENCY IN ALGORITHMS — CULTIVATING TECHNOMORAL VIRTUES 23 | Small Big Data Congress 2018 27 September 2018
  • 23. Algo- rithm Honesty Self-control Humility Justice Courage Empathy Care Civility Flexibility Perspective Tristan Harris Virtues: self-control; civility; humility Kate Raworth Virtues: justice; perspective; empathy Jaron Lanier Virtues perspective; justice; flexibility Cathy O’Neil Virtues: honesty; justice; humility; courage Yuval Noah Harari Virtues: perspective; humility; empathy Sherry Turkle Virtues: empathy; courage; civility; care Luciano Floridi Virtues: honesty; perspective; flexibility; justice Aimee van Wynsberghe Virtues: care; self-control; justice Edward Snowden Virtues: courage; magnanimity; justice; civility Safiya Umoja Noble Virtues: justice; honesty; perspective; civility Marc Steen, senior research scientist at TNO, marc.steen@tno.nl responsibledatainnovation.org dearengineerblog.wordpress.com
  • 24. 06 – WARD VENROOIJ – TNO UNCERTAINTY VISUALIZATION
  • 25. UNCERTAINTY VISUALIZATION HOW TO DISPLAY UNCERTAINTY CORRECTLY, INSTEAD OF ADDING SOME MORE…. Ward Venrooij
  • 26. 13:00 IMAGINE… What is the chance I arrive after 13:10? 13:00
  • 27. EXPERIMENT Cognitive measures Perceptual speed Verbal working memory Visual working memory Numeracy Personality Conscientiousness Extraversion Neuroticism Locus of Control Need for Cogntion Self-esteem Expertise / Demographic Chart expertise Education level Age 1. Which visual encoding of uncertain information is the most effective? 2. What is the influence of user characteristics on task performance? • Chance of arriving after a specific time 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15
  • 28. VISUALIZATION TYPE 74 58 29 17 15 14 0 10 20 30 40 50 60 70 80 Dot-20 Stripe-20 Stripe-50 Density Gradient Error bars Average percentage correct Q2 and Q3 in graph task Visualizationtype 60% more accurate than error bars
  • 30. TAKE HOME MESSAGE Yes, do visualize uncertainty. But, not create more uncertainty by using the wrong type of visualization
  • 31. 07 – JARON HARAMBAM – UNIVERSITY OF AMSTERDAM WIZARD, BE MY ALGORITHM TODAY! DEMOCRATIZING RECOMMENDER SYSTEMS WITH ALGORITHMIC PERSONAE
  • 32. Wizard, be my recommender algorithm today! Jaron Harambam - Mykola Makhortykh - Dimitrios Bountouridis
  • 33. 08 – RUBEN VROMANS – TNO DATA-DRIVEN SHARED DECISION MAKING ON CANCER TREATMENT FOR INDIVIDUAL PATIENTS
  • 34. PERSONALIZED EXPLANATIONS PERSONAL DATA UP-TO-DATE DATA-DRIVEN PERSONALIZED DECISION AIDS NEDERLANDSE KANKER REGISTRATIE ROBOT-WRITER RECENT DATA STATISTICAL MODELS SHARED DECISION MAKING Ruben Vromans | PhD Researcher | Tilburg University | IKNL | R.D.Vromans@uvt.nl
  • 35. 09 – RON SNIJDERS – TNO ONLINE DATA QUALITY ASSURANCE AT SCALE FOR BIG DATA PIPELINES
  • 36. Being Trustworthy needs Continuous Attention Contact: ron.snijders@tno.nl Complex Model ChainData Behavior over Time Data Quality Assurance Contract Requirements Indicators: Trends: ! … Horizontal Scaling:
  • 38. 10 – ROOS ROOIJAKKERS – PIPPLE THE PIPPI PURPOSE OF PIPPLE: DATA SCIENCE INSPIRED BY PIPPI LONGSTOCKING
  • 39. Solving complex problems with mathematics and creativity Data science with purpose Big on EQ and IQ! We predict, we optimize, we innovate!
  • 40. 11 – GUILLERMO PEREZ – UNIVERSITY OF ANTWERP SAFE AND UNDERSTANDABLE ARTIFICIAL INTELLIGENCE: VERIFICATION OF MACHINE AND REINFORCEMENT LEARNING
  • 41. 12 – JOOST BOSMAN – TNO CATCHING BAD GUYS BY DISCARDING KNOW GOOD GUYS
  • 43. Lo w High % Used uninspected Inspection result Discarding known good guys from inspection results improves detection of the bad guys Discard good guys Keep good guys Bad guy ranking Rank value Often Detection Poor Excellent Inspection Uninspected Inspected Occurrenc e Never Inspected Limited Resources
  • 44. 13 – ERIK KENTIE – SURFSARA DATA SHARING IN PUBLIC PRIVATE INITIATIVES
  • 45. Surf members: Universities HBO’s UMC’s (Knowledge) institutes Start-Ups, Incubators Data hubs Economic Boards R&D – departments (EU)-Projects Phd Research Associations of Companies Examples : • Seed Valley (research data) • Green Village (streaming data, IOT) Sharing data between SURF members and Businesses erik.kentie@surfsara.nl Federated Identity & Access Management Supercomputer Cartesius Fast Network Storage Research Cloud Innovation Lab • Security (Auth) • Technology (Apache/Kafka) • Compute • Visualisation and Analyse (Jupyter) • Governance/Policies • Roles (Data keeper) • FAIR dataExpert Consultancy Innovative ICT + pilot
  • 46. 14 – PEJMAN SHOEIBI OMRANI – TNO APPLICATION OF BIG DATA AND MACHINE LEARNING IN GEO-ENERGY
  • 47. Improved production forecast Predictive maintenance Robust decision making Prevent spills and accidents Reliable , efficient and safe operation BIG DATA IN GEO-ENERGY Uncertainty in subsurface and measurements Abundance of data (different type, volume, etc.) Complex operations (end of life time) Experts retirement
  • 48. WHAT TO LEARN FROM OTHER INDUSTRIES? Together we’re strong(er)
  • 49. 15 – SYLVIE VERBUNT – QUANTBASE PICTURING DATA
  • 51. 16 – RAJAT THOMAS – UNIVERSITY OF AMSTERDAM DEEP LEARNING MEETS COUNTERFACTUAL INFERENCE
  • 52. Deep Learning meets Counterfactual Inference Rajat Mani Thomas
  • 53. Towards the promise of personalized medicine: The problem TO TREAT? NOT TO TREAT? Reality Fantasy
  • 54. Deep Learning the embeddings Towards the promise of personalized medicine: The solution
  • 56. 17 – PIETER VAN EVERDINGEN – PLDN EXPLORING FURTHER SYNERGIES BETWEEN BIG AND LINKED DATA VIA GRAPH-BASED INTELLIGENCE
  • 57. PITCH - PLATFORM LINKED DATA NETHERLANDS PIETER VAN EVERDINGEN (PLDN) Small Big Data Track – September 27th, 2018 #LinkedDataNL
  • 58. Platform Linked Data Netherlands (PLDN) Get involved As participant/lead • Events • Working groups • Publications As sponsor • Gold sponsor • Silver sponsor • Bronze sponsor In the steering committee Contact Pieter van Everdingen/ Hans van Bragt platformlinkeddatanl@gmail.com Website www.platformlinkeddata.nl LinkedIn-group LOD Nederland www.linkedin.com/groups/466278 Twitter @linkeddatanl hashtag #linkeddatanl Newsletter www.pldn.nl/wiki/Nieuwsbrieven
  • 59. 18 – ANATOLY POSTILNIK – FIRST LINE SOFTWARE HEALTH DATA DIRTY PROBLEM AND WHAT WE DO ABOUT IT
  • 61. 06 – WARD VENROOIJ – TNO UNCERTAINTY VISUALIZATION
  • 62. UNCERTAINTY VISUALIZATION HOW TO DISPLAY UNCERTAINTY CORRECTLY, INSTEAD OF ADDING SOME MORE…. Ward Venrooij
  • 63. 13:00 IMAGINE… What is the chance I arrive after 13:10? 13:00
  • 64. EXPERIMENT Cognitive measures Perceptual speed Verbal working memory Visual working memory Numeracy Personality Conscientiousness Extraversion Neuroticism Locus of Control Need for Cogntion Self-esteem Expertise / Demographic Chart expertise Education level Age 1. Which visual encoding of uncertain information is the most effective? 2. What is the influence of user characteristics on task performance? • Chance of arriving after a specific time 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 08:45 09:00 09:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15 12:45 13:00 13:15
  • 65. VISUALIZATION TYPE 74 58 29 17 15 14 0 10 20 30 40 50 60 70 80 Dot-20 Stripe-20 Stripe-50 Density Gradient Error bars Average percentage correct Q2 and Q3 in graph task Visualizationtype 60% more accurate than error bars
  • 67. TAKE HOME MESSAGE Yes, do visualize uncertainty. But, not create more uncertainty by using the wrong type of visualization
  • 68. 19 - MARCEL HENRIQUEZ - RED DATA DE LIJFSPREUK “PASSIONATE ABOUT DATA” ZICH IN TOEPASSINGEN VERTAALT 69 | Small Big Data Congress 2018 27 September 2018
  • 69. THANK YOU FOR YOUR ATTENTION Take a look: TIME.TNO.NL

Editor's Notes

  • #14: Beter uitleggen met symbolen
  • #15: Stop symbol after start + Bias Free stamp
  • #17: Current health research is unethical for several reasons, Cohorts are small and homogenous When clinical trial was successful, the treatment gets accepted and handed out to the population, However, this population is very different from the cohort sample And by not taking into account someone’s lifestyle and social economic status The results of clinical trial studies cannot be trusted Because omitted variables, like income, activity measures and social network bias the coefficients of statistical analysis and conclusions are likely to be wrong Unfortunately these social and economic data are hard to get by, and discovery of true, meaningful associations are hindered by a lack of access to data that are not easy to get at
  • #19: In P8, we study access-restricted vertically partitioned data, where health data (from the maastricht study) and social economic data (from CBS) of the same individuals are stored at separate sites can be analyzed, simultaneously, without breaching privacy. Our use case is to uncover social and economic determinants of type II diabetes. To do so, we are creating a technically sound and secure infrastructure, using a Trusted Secure Environment to analyse these data. However, our future goal is to learn associations between these data sets, distributedly
  • #44: -I'm going to tell you how you guys can improve the detection of bad guys amongsy a majority of good guys -This research is applied on the inspection process within the dutch foreign nationals employment law -We want to find the bad guys who illegaly hire foreign nationals -Limited resources->There are too many companies to inspect them all -inspection on a selection of the companies -Selecion is based on ranking that is the result of knowledge and experience of the domain experts -Based on this ranking the highest ranked companies are inspected -How can we make future inspection more effective by using the obtained results -What do the inspection results tell us about the remaining uninspected set
  • #45: -We can exploit the domain knowledge hidden in the results by applying artificial intelligence -When we are training ai on only the inspection result we would need to compensate for the fact that only the highest ranked companies are inspected -However, majority of uninspected set are good guys -> why not use them as good guys? -Significant improvement achieved -But we can do even better: good guys in inspection are outliers in the sense that they look like bad guys -> discarding gives even better detection result -But we can do even better: inspected good guys look like inspected bad guys because they have been included for inspection. These are likely to be outliers, why not discard these inspected good guys. -Discarding known good guys from inspection results improves detection of the bad guys. -Thus we significantly improved bad guy detection by exploiting expert knowledge.
  • #49: Rewarding industry with risks Production problems can occur every second in remote location (requires fast reaction)
  • #60: Praatplaat voor de pitch (ongeveer 300 woorden)