SlideShare a Scribd company logo
The Information Workbench
 Interacting with the Web of Data




    Peter Haase,
    Andreas Eberhart          Thanh Tran
    Ulrich Walter         Günter Ladwig
    Sebastian Godelet    Andreas Wagner
    Tobias Mathäß              Lei Zhang
    Claudiu Dragulin         Rudi Studer
The Information Workbench
• Addressing the entire lifecycle of interacting with the Web of Data
    –   Integration of data sources
    –   Content generation by the end user
    –   Provenance
    –   Search and Exploration
    –   Visualization
    –   Publishing

• Integrated management of heterogeneous                    User-         Wikipedia
  data sources                                            generated
                                                                          DBpedia, Yago
    –   Structured and unstructured
    –   Published and user-generated                                      Earthquake (Data.gov)

    –   Static and dynamic
    –   Open domain                          Structured               Dynamic
Data Sources in the Application
• Entire English Wikipedia

• Data from Linked Open Data
   – DBpedia
   – YAGO
   –…

• Data from Data.gov (US Government)
   – E.g. live data about earthquakes

• Many more
Semantic Search
• Hybrid Search: Structured queries combined
  with keywords across structured and
  unstructured data sources

• Query interpretation: Translation of keywords
  into hybrid queries

• Keyword search combined with faceted
  search: Iterative refinement process based on
  keywords and operations on facets
Living UI
• Continuous, seamless and personal user experience across domains

• Widget-based user interface

• Multiple paradigms for interaction: browsing, visualization,
   editing, knowledge acquisition

• Mashups with external sources


• Automated selection of widgets based on available data

• Customization and personalization
Demo
• http://iwb.fluidops.com/
Conclusions
• The meaning of data has to play a central role.
    – Integrated management of unstructured and structured / semantic data
    – Semantics exploited throughout the complete lifecycle of interaction with the
      data

• Various, heterogeneous information sources
    – Management of real-life data from various sources, provenance
    – Heterogenous in: structured/unstructured, static / dynamic, published / user-
      generated

• The application has to be an end-user application, i.e. an application that
  provides a practical value to domain experts.
    – Open world, open domain, provides value to end users across domains
    – Can be tailored to specific domains
    – Also applicable to enterprise scenarios: E.g. in Data Center Management
Conclusions
•   The application provides an attractive and functional Web interface
     •   Widget-based, living UI
     •   Web 2.0-like interaction


•   Functionality goes beyond pure information retrieval.
    The results should be as accurate as possible.
    •    Addressing the complete lifecycle of the interaction with the data
    •    Novel paradigms for search, enabling precise answers to complex information needs against
         hybrid data

•   There is a use of dynamic data
    •    Integration of real-time, live-data sources


•   Multi-media documents are used in some way
    •    Web 2.0-syle mashups with external sources, such as Youtube, Twitter


•   Scalability
    •    Large unstructured corpus (incl. Wikipedia), large subset of LOD
Thank You!
Platform for
                Application Building
•   Custom providers for legacy and enterprise data sources
•   Easily extensible from the backend to the UI
•   Deployment on Cloud Infrastructures
•   Open Source release planned

More Related Content

KEY
Intro to Info Arch
PDF
Sakai09 Repo Case Study
PPTX
Digital libraries
PPTX
Delivering biodiversity knowledge in the information age
PPTX
Jyoti singh
PPTX
Next Steps for IMLS's National Digital Platform
PDF
An ontology-based context aware system for Selective Dissemination of Informa...
PPTX
Scratchpads: the Virtual Research Environment for biodiversity data
Intro to Info Arch
Sakai09 Repo Case Study
Digital libraries
Delivering biodiversity knowledge in the information age
Jyoti singh
Next Steps for IMLS's National Digital Platform
An ontology-based context aware system for Selective Dissemination of Informa...
Scratchpads: the Virtual Research Environment for biodiversity data

What's hot (20)

PPTX
ECS2019 - Managing Content Types in the Modern World
PPTX
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
PPTX
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
PPTX
Introduction to the CTDA
PPT
From Records to Data with Recollection
PPTX
GBIF: An infrastructure for infrastructures
PDF
OpenMinteD Project - building a TDM infrastructure
PDF
Social Feed Manager presentation at WASAPI Symposium
PPTX
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
PPT
From Records to Data
PDF
Digital Library Initiatives in India : An Overview
PPT
Fuller Disclosure: Getting More Collections into the Network Flow
PPTX
Towards long-term preservation of linked data - the PRELIDA project
PDF
Networking Systems in Libraries
PPTX
Levels of Service for Digital Libraries
PPTX
National Digital Library
PDF
A distributed network of digital heritage information by Enno Meijers - Europ...
PPTX
Open archives initiatives(final)
PPT
non-slides-Thatcamp
ECS2019 - Managing Content Types in the Modern World
Foundations to Actions: Extending Innovations to Digital Libraries in Partner...
WORLDMAP: A SPATIAL INFRASTRUCTURE TO SUPPORT TEACHING AND RESEARCH (BROWN BA...
Introduction to the CTDA
From Records to Data with Recollection
GBIF: An infrastructure for infrastructures
OpenMinteD Project - building a TDM infrastructure
Social Feed Manager presentation at WASAPI Symposium
Exposing Library Content with the NISO Metasearch XML Gateway Protocol
From Records to Data
Digital Library Initiatives in India : An Overview
Fuller Disclosure: Getting More Collections into the Network Flow
Towards long-term preservation of linked data - the PRELIDA project
Networking Systems in Libraries
Levels of Service for Digital Libraries
National Digital Library
A distributed network of digital heritage information by Enno Meijers - Europ...
Open archives initiatives(final)
non-slides-Thatcamp
Ad

Similar to The Information Workbench - (20)

PDF
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
PDF
Linked (Open) Data
PPTX
Are you ready for BIG DATA?
PDF
The Web of Data: The W3C Semantic Web Initiative
PPTX
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
PPTX
Data commons bonazzi bd2 k fundamentals of science feb 2017
PPTX
Unit-I- Introduction- Traits of Big Data-Final.pptx
PPTX
PPT 1.1.2.pptx ehhllo hi hwi bdfhd dbdhu
PPTX
g-Social - Enhancing e-Science Tools with Social Networking Functionality
PDF
Web-Scale Discovery: Post Implementation
PDF
Bertenthal
PPTX
Breaking Down Walls in Enterprise with Social Semantics
PDF
Ircdl damico del-bimbo-meoni
PPTX
A Year in Review - Building a Comprehensive Data Management Program
PPTX
FAIRDOM data management support for ERACoBioTech Proposals
PPTX
Data.gov Overview, August 2012
PPTX
IntrO To Management Chapter 1 and 2 slid
PPTX
ISWC 2012 Keynote
PPTX
Web Engineering Process Models- An introduction.pptx
PPTX
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Sirris innovate2011 - Smart Products with smart data - introduction, Dr. Elen...
Linked (Open) Data
Are you ready for BIG DATA?
The Web of Data: The W3C Semantic Web Initiative
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Data commons bonazzi bd2 k fundamentals of science feb 2017
Unit-I- Introduction- Traits of Big Data-Final.pptx
PPT 1.1.2.pptx ehhllo hi hwi bdfhd dbdhu
g-Social - Enhancing e-Science Tools with Social Networking Functionality
Web-Scale Discovery: Post Implementation
Bertenthal
Breaking Down Walls in Enterprise with Social Semantics
Ircdl damico del-bimbo-meoni
A Year in Review - Building a Comprehensive Data Management Program
FAIRDOM data management support for ERACoBioTech Proposals
Data.gov Overview, August 2012
IntrO To Management Chapter 1 and 2 slid
ISWC 2012 Keynote
Web Engineering Process Models- An introduction.pptx
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Ad

Recently uploaded (20)

PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Cloud computing and distributed systems.
PDF
Electronic commerce courselecture one. Pdf
PDF
KodekX | Application Modernization Development
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Review of recent advances in non-invasive hemoglobin estimation
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Cloud computing and distributed systems.
Electronic commerce courselecture one. Pdf
KodekX | Application Modernization Development
“AI and Expert System Decision Support & Business Intelligence Systems”
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
cuic standard and advanced reporting.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Monthly Chronicles - July 2025
Encapsulation_ Review paper, used for researhc scholars
20250228 LYD VKU AI Blended-Learning.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Diabetes mellitus diagnosis method based random forest with bat algorithm

The Information Workbench -

  • 1. The Information Workbench Interacting with the Web of Data Peter Haase, Andreas Eberhart Thanh Tran Ulrich Walter Günter Ladwig Sebastian Godelet Andreas Wagner Tobias Mathäß Lei Zhang Claudiu Dragulin Rudi Studer
  • 2. The Information Workbench • Addressing the entire lifecycle of interacting with the Web of Data – Integration of data sources – Content generation by the end user – Provenance – Search and Exploration – Visualization – Publishing • Integrated management of heterogeneous User- Wikipedia data sources generated DBpedia, Yago – Structured and unstructured – Published and user-generated Earthquake (Data.gov) – Static and dynamic – Open domain Structured Dynamic
  • 3. Data Sources in the Application • Entire English Wikipedia • Data from Linked Open Data – DBpedia – YAGO –… • Data from Data.gov (US Government) – E.g. live data about earthquakes • Many more
  • 4. Semantic Search • Hybrid Search: Structured queries combined with keywords across structured and unstructured data sources • Query interpretation: Translation of keywords into hybrid queries • Keyword search combined with faceted search: Iterative refinement process based on keywords and operations on facets
  • 5. Living UI • Continuous, seamless and personal user experience across domains • Widget-based user interface • Multiple paradigms for interaction: browsing, visualization, editing, knowledge acquisition • Mashups with external sources • Automated selection of widgets based on available data • Customization and personalization
  • 7. Conclusions • The meaning of data has to play a central role. – Integrated management of unstructured and structured / semantic data – Semantics exploited throughout the complete lifecycle of interaction with the data • Various, heterogeneous information sources – Management of real-life data from various sources, provenance – Heterogenous in: structured/unstructured, static / dynamic, published / user- generated • The application has to be an end-user application, i.e. an application that provides a practical value to domain experts. – Open world, open domain, provides value to end users across domains – Can be tailored to specific domains – Also applicable to enterprise scenarios: E.g. in Data Center Management
  • 8. Conclusions • The application provides an attractive and functional Web interface • Widget-based, living UI • Web 2.0-like interaction • Functionality goes beyond pure information retrieval. The results should be as accurate as possible. • Addressing the complete lifecycle of the interaction with the data • Novel paradigms for search, enabling precise answers to complex information needs against hybrid data • There is a use of dynamic data • Integration of real-time, live-data sources • Multi-media documents are used in some way • Web 2.0-syle mashups with external sources, such as Youtube, Twitter • Scalability • Large unstructured corpus (incl. Wikipedia), large subset of LOD
  • 10. Platform for Application Building • Custom providers for legacy and enterprise data sources • Easily extensible from the backend to the UI • Deployment on Cloud Infrastructures • Open Source release planned