SlideShare a Scribd company logo
histoGraph 
Building a Social graph from image archives 
La science et les effets de réseau, Lyon 
24.03.2014 
www.cubrikproject.eu 
Lars Wieneke, CVCE, Luxembourg 0
The CVCE 
Lars Wieneke, CVCE, Luxembourg 1
About CUbRIK 
 European Community's Seventh Framework 
Program FP7-ICT 
 15 European partners 
 Multimedia search 
processing: Putting 
humans in the loop 
 Demos: History of Europe 
and Fashion 
Lars Wieneke, CVCE, Luxembourg 2
Point of departure 
Images as sources 
Lars Wieneke, CVCE, Luxembourg 3
Goal: Reconstructing and exploring social ties 
through historical sources 
Lars Wieneke, CVCE, Luxembourg 4
Towards the social graph 
4 pillars 
1. Close connection to the requirements of 
researchers in European Integration studies 
2. Structured and referencable repository of 
persons, events and places in time 
3. Efficient indexation process that enables the 
association of faces with identities 
4. Toolchain for analysis and visualization 
Lars Wieneke, CVCE, Luxembourg 5
Towards the social graph 
Sourcing researcher requirements 
Lars Wieneke, CVCE, Luxembourg 6 
Selection of target 
user group 
First draft of the app 
scenario 
Feedback on 
technical scope 
Exploratory 
interviews 
(daily work practices) 
Second draft of the 
app scenario 
Focus group 
(user needs and app 
scenarios) 
Feedback on 
technical feasability 
Lessons learned: 
issues and features 
Specification 
Implementation 1. 
demonstrator 
Workshop: Review of 
app and features 
Revised specification 
Implementation 2. 
demonstrator 
Evaluation and test 
Stage 1 
Stage 2 
Stage 3 
Stage 4 
Stage 5 
Users 
Requirements 
Technology
Towards the social graph 
Structured repository 
Lars Wieneke, CVCE, Luxembourg 7
Towards the social graph 
Indexation process 
8 
Raw content 
High level features 
(automatic annotations) 
Conflict 
(e.g., “Image contains 
‘Romano Prodi’ ” 
? Confidence = low) 
Conflict store Conflict 
manager 
Conflict resolution 
task store 
Conflict resolution 
task: conflict, 
required skill, priority, .. 
CUbRIK app 
for Conflict 
resolution 
Game Crowdtask Q&A 
Lars Wieneke, CVCE, Luxembourg
9 
Towards the social graph 
Indexation process II 
 Human in the loop added value: 
 Verification of identities/places/events ambiguous and temporal only 
possible by putting humans in the loop 
 Integration of multiple perspectives 
 CUbRIK as an open toolbox allows 
follow-up and extension through 
third parties 
 “Vertical” integration: 
GUI, components, crowdsourcing 
integrated in a platform 
Lars Wieneke, CVCE, Luxembourg
10 
Towards the social graph 
Visualization and analysis 
Lars Wieneke, CVCE, Luxembourg
11 
Challenges & Approach 
 Main challenges 
 Detection and identification of identities/places/events in time 
 Verification of identities/places/events in time 
 Analysis of relationships (e.g. co-occurrences) 
 Rights aware crawling and storage 
 Verification of provenance and license information 
 Truth and provenance 
 Approach 
 Crowd-sourced verification of detected faces (false positives/negatives) 
 Verification of identities through/places/events in time social networks of 
experts 
 Visual knowledge discovery/exploration 
 Integrated rights aware crawling and storage 
 Integrated license and provenance management 
27/11/2013 Lars Wieneke, CVCE, Luxembourg
Towards the social graph 
Bringing it all together 
Social Graph Network Analysis 
CROWD 
Research 
Inquiry 
Graph 
Visualizat ion 
Lars Wieneke, CVCE, Luxembourg 12 
Image Indexation 
Media 
harvest ing 
and upload 
Face 
detect ion 
Face 
ident ifi cat ion 
Clickworkers 
Crowd Face 
posit ion 
validat ion 
Copyright 
aware 
crawler 
Provenance 
checker 
License 
checker 
Content 
provider 
tools 
Metadata 
Ent ity 
extract ion 
Identity reconciliation 
Ent ity 
verifi cat ion & 
annotat ion 
Ent itypedia 
Integrat ion 
CROWD 
pre-fi 
ltering 
Text Indexation 
Connect ion 
to the CVCE 
collect ion 
Ent ity 
anntat ion and 
extract ion 
Expert Crowd 
Expert 
CROWD 
verifi cat ion 
Ent itypedia 
Integrat ion 
CROWD Research Inquieries 
Expert Crowd 
Graph 
Visualizat ion 
Analysis of 
the social 
graph 
Graph Query (old: Query for Entities) 
Graph 
Visualizat ion 
Query for 
ent it ies 
Context Expander 
Expansion 
through 
documents 
Expansion 
through 
videos 
Expansion 
through 
images 
Expansion 
through 
related 
ent it ies 
Social Graph construction 
Social 
Graph 
Creat ion 
Content Analysis and 
Enrichment 
Querying 
Feedback acquisition 
and processing 
Y3 component 
Query for 
spat ial 
constraints 
EXP through 
SIMILAR 
images 
WP Event 
detect ion
histoGraph demo 
Time for a Demo! 
Lars Wieneke, CVCE, Luxembourg 13
Pipelining the CUbRIK components: 
Human input from click-workers 
Great choice for simple tasks: 
 Face detection: false positives, false negatives 
 Monetary motivation, via www.microtask.com 
Poor performance on complex tasks: 
 Low resolution images 
 Different angles etc. 
 Actors recurring over time 
Lars Wieneke, CVCE, Luxembourg 14
Pipelining the CUbRIK components: 
Human input from experts 
Capable of complex tasks: 
 In-depth knowledge of key actors 
 Context knowledge allows inferences 
But: Different motivational models! 
 Public goods 
 Reputation 
Lars Wieneke, CVCE, Luxembourg 15
Usage for historians 
 No one truth in history but interpretation, 
context and discussion 
 Therefore need to represent ambivalence, 
contradictions and discussion 
 Close ties between data representation (Social 
graph) and their original context (primary 
sources) 
Lars Wieneke, CVCE, Luxembourg 16
Conclusion 
 Challenges 
 What is truth? Humanities vs. Computer Science 
 Gathering requirements for tools that haven‘t been 
developed yet 
 Engaging crowds 
 Image copyrights 
 Scientific value? 
 Refinement of the application 
 Additional datasources 
 Improvement of the interface 
 Integration of the new components 
Lars Wieneke, CVCE, Luxembourg 17
Outlook 
CUbRIK Presentation 18
Outlook 
CUbRIK Presentation 19
Outlook histoGraph 
CUbRIK Presentation 20
Visit us on 
WWW.CUBRIKPROJECT.EU 
@CUBRIKPROJECT 
CUbRIK Presentation 21 
Or follow us on Twitter

More Related Content

PDF
histoGraph: a case study in Digital Humanities
PDF
Mwera 2014 middleschool_schoolculture_discipline_final_paper
PPT
18019469 history-as-a-discipline
PPTX
Humanist machine interaction for the digital humanities
PPTX
Silviu
PDF
TS Quick Start Guide2013
PPTX
Silviu
PPTX
TEI Conference - CVCE
histoGraph: a case study in Digital Humanities
Mwera 2014 middleschool_schoolculture_discipline_final_paper
18019469 history-as-a-discipline
Humanist machine interaction for the digital humanities
Silviu
TS Quick Start Guide2013
Silviu
TEI Conference - CVCE

Viewers also liked (14)

PPTX
CUbRIK Summer School RHodes histoGraph
PPTX
History of Europe demo at IEEE MMSP 2013
PDF
Algebra de baldor by. aimb
PPTX
Europe’s Beginnings through the Looking Glass: Publishing Historical Document...
PPTX
DH2013: Lars Wieneke – Workshop introduction
PPT
DH2013: Christine Sauter – Results of the task force
PDF
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
PDF
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
PPT
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
PPT
DH2013: Julia Fallon – Legal aspects of UGC
PPTX
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
PPTX
MyPublications: Enabling personal authoring and narrative making
PDF
KIK Skpp8(1) INNOTECH-S
PPTX
майстер клас
CUbRIK Summer School RHodes histoGraph
History of Europe demo at IEEE MMSP 2013
Algebra de baldor by. aimb
Europe’s Beginnings through the Looking Glass: Publishing Historical Document...
DH2013: Lars Wieneke – Workshop introduction
DH2013: Christine Sauter – Results of the task force
DH2013: Stuart Dunn - An emerging field(?): defining the fundamentals of huma...
DH2013: Marion Dupeyrat – Interacting with audiences: overview of participato...
DH2013: Ad Pollé – Europeana 1914-18 & Europeana 1989
DH2013: Julia Fallon – Legal aspects of UGC
Text Encoding and Enrichment for Linguistic Analysis: Archives on the policy ...
MyPublications: Enabling personal authoring and narrative making
KIK Skpp8(1) INNOTECH-S
майстер клас
Ad

Similar to HistoGraph presentation Insa de Lyon (20)

PDF
histoGraph for historians
PPTX
(Industry track) "Interactive networks for digital cultural heritage collecti...
PDF
Humanist machine interaction with histoGraph
PDF
Building a social graph for the history of Europe: the CUbRIK histoGraph
PPTX
So human presentation
PDF
CUbRIK Social Graph Visual Interface
PDF
The CUbRIK histoGraph Factsheet
PPTX
Data visualization through network graphing
PDF
A Semantic Search Approach to Task-Completion Engines
PPTX
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
PPTX
The Web of Data: do we actually understand what we built?
PDF
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
PPTX
EDBT 2015: Summer School Overview
PDF
Mapping big data science
PDF
Improving Online Deliberation with Argument Network Visualization
PDF
DeLiddo&BuckinghamShum-e-Part2014
PDF
ESWC 2015 - EU Networking Session
PPT
A Picture Is Worth A Thousand Questions Docx
PPTX
The Sociology of Nothingness: Challenges of Big Data
PDF
Digital Humanities and “Digital” Social Sciences
histoGraph for historians
(Industry track) "Interactive networks for digital cultural heritage collecti...
Humanist machine interaction with histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraph
So human presentation
CUbRIK Social Graph Visual Interface
The CUbRIK histoGraph Factsheet
Data visualization through network graphing
A Semantic Search Approach to Task-Completion Engines
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
The Web of Data: do we actually understand what we built?
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014
EDBT 2015: Summer School Overview
Mapping big data science
Improving Online Deliberation with Argument Network Visualization
DeLiddo&BuckinghamShum-e-Part2014
ESWC 2015 - EU Networking Session
A Picture Is Worth A Thousand Questions Docx
The Sociology of Nothingness: Challenges of Big Data
Digital Humanities and “Digital” Social Sciences
Ad

Recently uploaded (20)

PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PDF
Sciences of Europe No 170 (2025)
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
bbec55_b34400a7914c42429908233dbd381773.pdf
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
neck nodes and dissection types and lymph nodes levels
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
HPLC-PPT.docx high performance liquid chromatography
ECG_Course_Presentation د.محمد صقران ppt
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Taita Taveta Laboratory Technician Workshop Presentation.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
The KM-GBF monitoring framework – status & key messages.pptx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Biophysics 2.pdffffffffffffffffffffffffff
Phytochemical Investigation of Miliusa longipes.pdf
Sciences of Europe No 170 (2025)

HistoGraph presentation Insa de Lyon

  • 1. histoGraph Building a Social graph from image archives La science et les effets de réseau, Lyon 24.03.2014 www.cubrikproject.eu Lars Wieneke, CVCE, Luxembourg 0
  • 2. The CVCE Lars Wieneke, CVCE, Luxembourg 1
  • 3. About CUbRIK  European Community's Seventh Framework Program FP7-ICT  15 European partners  Multimedia search processing: Putting humans in the loop  Demos: History of Europe and Fashion Lars Wieneke, CVCE, Luxembourg 2
  • 4. Point of departure Images as sources Lars Wieneke, CVCE, Luxembourg 3
  • 5. Goal: Reconstructing and exploring social ties through historical sources Lars Wieneke, CVCE, Luxembourg 4
  • 6. Towards the social graph 4 pillars 1. Close connection to the requirements of researchers in European Integration studies 2. Structured and referencable repository of persons, events and places in time 3. Efficient indexation process that enables the association of faces with identities 4. Toolchain for analysis and visualization Lars Wieneke, CVCE, Luxembourg 5
  • 7. Towards the social graph Sourcing researcher requirements Lars Wieneke, CVCE, Luxembourg 6 Selection of target user group First draft of the app scenario Feedback on technical scope Exploratory interviews (daily work practices) Second draft of the app scenario Focus group (user needs and app scenarios) Feedback on technical feasability Lessons learned: issues and features Specification Implementation 1. demonstrator Workshop: Review of app and features Revised specification Implementation 2. demonstrator Evaluation and test Stage 1 Stage 2 Stage 3 Stage 4 Stage 5 Users Requirements Technology
  • 8. Towards the social graph Structured repository Lars Wieneke, CVCE, Luxembourg 7
  • 9. Towards the social graph Indexation process 8 Raw content High level features (automatic annotations) Conflict (e.g., “Image contains ‘Romano Prodi’ ” ? Confidence = low) Conflict store Conflict manager Conflict resolution task store Conflict resolution task: conflict, required skill, priority, .. CUbRIK app for Conflict resolution Game Crowdtask Q&A Lars Wieneke, CVCE, Luxembourg
  • 10. 9 Towards the social graph Indexation process II  Human in the loop added value:  Verification of identities/places/events ambiguous and temporal only possible by putting humans in the loop  Integration of multiple perspectives  CUbRIK as an open toolbox allows follow-up and extension through third parties  “Vertical” integration: GUI, components, crowdsourcing integrated in a platform Lars Wieneke, CVCE, Luxembourg
  • 11. 10 Towards the social graph Visualization and analysis Lars Wieneke, CVCE, Luxembourg
  • 12. 11 Challenges & Approach  Main challenges  Detection and identification of identities/places/events in time  Verification of identities/places/events in time  Analysis of relationships (e.g. co-occurrences)  Rights aware crawling and storage  Verification of provenance and license information  Truth and provenance  Approach  Crowd-sourced verification of detected faces (false positives/negatives)  Verification of identities through/places/events in time social networks of experts  Visual knowledge discovery/exploration  Integrated rights aware crawling and storage  Integrated license and provenance management 27/11/2013 Lars Wieneke, CVCE, Luxembourg
  • 13. Towards the social graph Bringing it all together Social Graph Network Analysis CROWD Research Inquiry Graph Visualizat ion Lars Wieneke, CVCE, Luxembourg 12 Image Indexation Media harvest ing and upload Face detect ion Face ident ifi cat ion Clickworkers Crowd Face posit ion validat ion Copyright aware crawler Provenance checker License checker Content provider tools Metadata Ent ity extract ion Identity reconciliation Ent ity verifi cat ion & annotat ion Ent itypedia Integrat ion CROWD pre-fi ltering Text Indexation Connect ion to the CVCE collect ion Ent ity anntat ion and extract ion Expert Crowd Expert CROWD verifi cat ion Ent itypedia Integrat ion CROWD Research Inquieries Expert Crowd Graph Visualizat ion Analysis of the social graph Graph Query (old: Query for Entities) Graph Visualizat ion Query for ent it ies Context Expander Expansion through documents Expansion through videos Expansion through images Expansion through related ent it ies Social Graph construction Social Graph Creat ion Content Analysis and Enrichment Querying Feedback acquisition and processing Y3 component Query for spat ial constraints EXP through SIMILAR images WP Event detect ion
  • 14. histoGraph demo Time for a Demo! Lars Wieneke, CVCE, Luxembourg 13
  • 15. Pipelining the CUbRIK components: Human input from click-workers Great choice for simple tasks:  Face detection: false positives, false negatives  Monetary motivation, via www.microtask.com Poor performance on complex tasks:  Low resolution images  Different angles etc.  Actors recurring over time Lars Wieneke, CVCE, Luxembourg 14
  • 16. Pipelining the CUbRIK components: Human input from experts Capable of complex tasks:  In-depth knowledge of key actors  Context knowledge allows inferences But: Different motivational models!  Public goods  Reputation Lars Wieneke, CVCE, Luxembourg 15
  • 17. Usage for historians  No one truth in history but interpretation, context and discussion  Therefore need to represent ambivalence, contradictions and discussion  Close ties between data representation (Social graph) and their original context (primary sources) Lars Wieneke, CVCE, Luxembourg 16
  • 18. Conclusion  Challenges  What is truth? Humanities vs. Computer Science  Gathering requirements for tools that haven‘t been developed yet  Engaging crowds  Image copyrights  Scientific value?  Refinement of the application  Additional datasources  Improvement of the interface  Integration of the new components Lars Wieneke, CVCE, Luxembourg 17
  • 21. Outlook histoGraph CUbRIK Presentation 20
  • 22. Visit us on WWW.CUBRIKPROJECT.EU @CUBRIKPROJECT CUbRIK Presentation 21 Or follow us on Twitter

Editor's Notes

  • #2: Ok, let’s start with a joke to relax you guys a bit: How can you tell that a computer scientist received communications training?
  • #11: Kern des Problems: keine automatische Funktionalität, daher CUbRIK Ansatz mit Humans in the loop Dann Bullet: open toolbox Integration verschiedener Modelle Auf Grundherausforderung beziehen Nicht nur auf Personen beziehen, allgemeiner auf Events/Orten Auf Grundlage der CUbRIK Entwicklungen… Weiterentwicklung! Neue Ansätze durch Dritte
  • #12: Kern des Problems: keine automatische Funktionalität, daher CUbRIK Ansatz mit Humans in the loop Dann Bullet: open toolbox Integration verschiedener Modelle Auf Grundherausforderung beziehen Nicht nur auf Personen beziehen, allgemeiner auf Events/Orten Auf Grundlage der CUbRIK Entwicklungen… Weiterentwicklung! Neue Ansätze durch Dritte
  • #13: Technical challenges: Vielleicht doch nach Bedeutung sortieren! Auf das gesamte Bild beziehen, Ausblick integrieren. Noch zu stark auf Identities ausgerichtet, was kann man damit machen, wenn man diese hat? Visual knowledge discovery/exploration