Linked Metadata by Design
Presented by: Shirley Bacso
Data Architect at Ingka Digital, Netherlands
A few words
about me
• Data Architect at Ingka Digital, Netherlands
• Specialize in metadata
• Love problem-solving & learning
• Graph enthusiast since June 2021
My journey
June 2021
Oct 2021
May 2022
Implementing
Business
Glossary
Ingka Graph
Event with
Neo4j
Feb 2023
Knowledge Graphs
2023, Prof. Dr.
Harald Sack
Oct 2023
DKG MVP
NeoDash
Feb 2024
Learn graph
technology
Exam case
studies
Research metadata
management
best practices
Study organization
realities
Observe people
Hunt for metadata
Hunt for metadata
Hunt for metadata
Observe people
Study organization
realities
Hunt for metadata
Observe people
Metadata – the knowledge of our data Dec 2023
Rationale
What types of questions should be answered by
the knowledge graph?
Do all the data
columns/fields have
business names?
Are the data compliant
to (retention) rules?
What are the gaps
between the data quality
dimension goals and the
reality?
Do all the databases
connected to the
software systems?
What systems does a
certain digital domain
have?
What type of personal
data does a software
system have?
Find the most relevant
dataset
(recommendation
engine) and request
access.
Which data products are
overlapping? (Think
about the Principle of
Data as a Product)
How was the data
transformed throughout
the lifetime?
(Provenance and
operational lineage)
…
Which knowledge areas should be covered ?
Collectively capture
the broader knowledge
about data
arrows.app
What concepts and relationships should be in
the knowledge graph?
The Ontology Development Process
Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology
Ontology development in practice is an iterative process that repeats
continuously and improves the ontology.
What types of questions
should be answered by the
knowledge graph? Which knowledge areas
should be covered by the
knowledge graph? What concepts and
relationships should be in
the knowledge graph?
Pain points of “Metadata as a by-product” and relieves
DAMA DMBOK has pointed out:
“Metadata is often created as a
by-product of application
processing rather than as an
end product (i.e.. it is not
created with consumption in
mind). As with other forms of
data, there is a lot of work in
preparing Metadata before it
can be integrated.”
What is linked
metadata by
design?
• Linked metadata represents the integration
of outcomes from human collaboration. It
collectively forms the knowledge about
data, which is captured in the data
knowledge graph.
• By design emphasizes that the capture and
integration of metadata should start from
the conceptual level and continue through
the logical level, as human collaboration
begins at the design phase of data product
development.
Solution
Architecture
Various
metadata
sources
L0 L1 L2 L3
“Sources of knowledge” “Data knowledge graph”
What has been done
Column
Table
Schema
System group
Database
Digital domain
Digital unit
System
Architectural area
The current version of the
DKG has close to 1 million of
nodes and 1 million of edges
and it will continue to grow.
This is a snapshot from Neo4j
Bloom. Only 10K nodes (1%)
are displayed here.
Ingka Data Knowledge Graph (DKG)
NeoDash (UI)
Query
Visualize
Post changes
Ingka Data
Knowledge Graph
(DB)
https://github.com/neo4j-labs/neodash
NeoDash - Neo4j Dashboard Builder
NeoDash is an open source tool for
visualizing your Neo4j data. It lets you group
visualizations together as dashboards, and
allow for interactions between reports.
Explicit, formal and shareable
knowledge about data/ data products
stored in a graph database
Catalogued
Systems
Quick Insights
Intuitive Visuals
Coherent Understanding
Ø Systems are categorized
• by relations to other asset
types:
• Architectural Area
• System Group
• Digital Unit | Digital Domain
• by categorical attributes
Ø Explore details of the
system in interest
Catalogued
RDBs
Ø Datasources
are categorized by relations to
other asset types:
System à Digital Unità Digital Domain
Ø Explore details of the
datasource in interest and
its hierarchical assets
Column à Table à Schema à
Datasource
Quick Insights
Intuitive Visuals
Coherent Understanding
Consumer -
Universal
Search
Universal and
exploratory search for
various asset types, e.g.
System, Datasource…
Find | Inspect | Connect | Verify
Without the need to build and
maintain UIs
Link Data Concept to:
• Business Term
• Data Entities
• …
Link Data Attribute/Entity to:
• Business Term
• Code List
• Rules
• Policies
• …
Link Data Attribute to:
• Columns, Fields...
Provider –
Link Metadata
Current development and future
Column
Table
Schema
System group
Database
Digital domain
Digital unit
System
Architectural area
Near future potentials
• Knowledge graph completion: Link prediction with Word
Embeddings and KG Embeddings
Near future potentials
• Knowledge graph completion: Link prediction with Word
Embeddings and KG Embeddings
Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology
Near future potentials
• Better search: Use combined capabilities of semantic and
structural search
• Refinement enables more precise or more complete search
results.
Query String
• Enables complementing search results with additional
associated or similar information.
Cross Referencing
• Enables the determination of nearby results and results related
by content.
Fuzzy Search
• Enables visualization and navigation of the search space.
Exploratory Search via
Linked Data
• Enables complementing search results with implicitly given
information.
Reasoning
Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology
Near future potentials
• Leverage LLM + KG RAG
Tomaz Bratanic
Learn | Share | Collaborate
Ask DKG
anything about
your data
Near future potentials
• Build fast apps in OpenSource and InnerSource
Knowledge Building Community
o Terminology, ontology and modeling
o Data governance
o Data engineering
o Graph data science
o UX design and UI building
o …
• Iteration of curating and extracting knowledge
• Following the architecture of building and
consuming the knowledge graph
• By the many people, for the many people
INGKA DIGITAL: Linked Metadata by Design
Thank you!
Reach out on Shirley Bacso

More Related Content

PDF
Ingka Digital: Linked Metadata by Design
PDF
Neo4j – The Fastest Path to Scalable Real-Time Analytics
PDF
Unlock Your Data for ML & AI using Data Virtualization
PDF
Neo4j on Microsoft Azure
PDF
Big Data Evolution
PDF
data-science-roadmap Mục tiêu hướng tới Data Science
PDF
This is ChatGPT Book Data Science Roadmap.pdf
PDF
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Ingka Digital: Linked Metadata by Design
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Unlock Your Data for ML & AI using Data Virtualization
Neo4j on Microsoft Azure
Big Data Evolution
data-science-roadmap Mục tiêu hướng tới Data Science
This is ChatGPT Book Data Science Roadmap.pdf
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02

Similar to INGKA DIGITAL: Linked Metadata by Design (20)

PDF
How to build your own Delve: combining machine learning, big data and SharePoint
PPTX
GraphTalks Rome - Selecting the right Technology
PPTX
Getting started with with SharePoint Syntex
PPTX
NDC Oslo : A Practical Introduction to Data Science
PPTX
Delivering a Linked Data warehouse and realising the power of graphs
PPTX
Chapter 1 Introduction to Data Science (Computing)
PDF
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
PPTX
SegmentOfOne
PDF
Enterprise Data Marketplace: A Centralized Portal for All Your Data Assets
PDF
Big data and oracle
PDF
Team Data Science Process Presentation (TDSP), Aug 29, 2017
PDF
Predictions for the Future of Graph Database
PDF
GraphTour 2020 - Neo4j: What's New?
PDF
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
PPTX
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
PPTX
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
PPTX
Data Mesh in Azure using Cloud Scale Analytics (WAF)
PPTX
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
PDF
La strada verso il successo con i database a grafo, la Graph Data Science e l...
PDF
K7 Ultimate Security Crack FREE latest version 2025
How to build your own Delve: combining machine learning, big data and SharePoint
GraphTalks Rome - Selecting the right Technology
Getting started with with SharePoint Syntex
NDC Oslo : A Practical Introduction to Data Science
Delivering a Linked Data warehouse and realising the power of graphs
Chapter 1 Introduction to Data Science (Computing)
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
SegmentOfOne
Enterprise Data Marketplace: A Centralized Portal for All Your Data Assets
Big data and oracle
Team Data Science Process Presentation (TDSP), Aug 29, 2017
Predictions for the Future of Graph Database
GraphTour 2020 - Neo4j: What's New?
Neo4j GraphTalk Düsseldorf - Building intelligent solutions with Graphs
Get Started with the Most Advanced Edition Yet of Neo4j Graph Data Science
Ben Gardner | Delivering a Linked Data warehouse and integrating across the w...
Data Mesh in Azure using Cloud Scale Analytics (WAF)
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
La strada verso il successo con i database a grafo, la Graph Data Science e l...
K7 Ultimate Security Crack FREE latest version 2025
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
Ad

Recently uploaded (20)

PDF
Salesforce Agentforce AI Implementation.pdf
PDF
BoxLang Dynamic AWS Lambda - Japan Edition
PDF
Visual explanation of Dijkstra's Algorithm using Python
DOC
UTEP毕业证学历认证,宾夕法尼亚克拉里恩大学毕业证未毕业
DOCX
Modern SharePoint Intranet Templates That Boost Employee Engagement in 2025.docx
PPTX
Tech Workshop Escape Room Tech Workshop
PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
PPTX
Computer Software - Technology and Livelihood Education
PDF
Guide to Food Delivery App Development.pdf
PDF
Topaz Photo AI Crack New Download (Latest 2025)
PDF
Practical Indispensable Project Management Tips for Delivering Successful Exp...
PPTX
MLforCyber_MLDataSetsandFeatures_Presentation.pptx
PDF
novaPDF Pro 11.9.482 Crack + License Key [Latest 2025]
PDF
Introduction to Ragic - #1 No Code Tool For Digitalizing Your Business Proces...
PDF
Microsoft Office 365 Crack Download Free
PDF
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
PDF
How Tridens DevSecOps Ensures Compliance, Security, and Agility
PDF
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
PDF
The Dynamic Duo Transforming Financial Accounting Systems Through Modern Expe...
PDF
DNT Brochure 2025 – ISV Solutions @ D365
Salesforce Agentforce AI Implementation.pdf
BoxLang Dynamic AWS Lambda - Japan Edition
Visual explanation of Dijkstra's Algorithm using Python
UTEP毕业证学历认证,宾夕法尼亚克拉里恩大学毕业证未毕业
Modern SharePoint Intranet Templates That Boost Employee Engagement in 2025.docx
Tech Workshop Escape Room Tech Workshop
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
Computer Software - Technology and Livelihood Education
Guide to Food Delivery App Development.pdf
Topaz Photo AI Crack New Download (Latest 2025)
Practical Indispensable Project Management Tips for Delivering Successful Exp...
MLforCyber_MLDataSetsandFeatures_Presentation.pptx
novaPDF Pro 11.9.482 Crack + License Key [Latest 2025]
Introduction to Ragic - #1 No Code Tool For Digitalizing Your Business Proces...
Microsoft Office 365 Crack Download Free
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
How Tridens DevSecOps Ensures Compliance, Security, and Agility
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
The Dynamic Duo Transforming Financial Accounting Systems Through Modern Expe...
DNT Brochure 2025 – ISV Solutions @ D365

INGKA DIGITAL: Linked Metadata by Design

  • 1. Linked Metadata by Design Presented by: Shirley Bacso Data Architect at Ingka Digital, Netherlands
  • 2. A few words about me • Data Architect at Ingka Digital, Netherlands • Specialize in metadata • Love problem-solving & learning • Graph enthusiast since June 2021
  • 3. My journey June 2021 Oct 2021 May 2022 Implementing Business Glossary Ingka Graph Event with Neo4j Feb 2023 Knowledge Graphs 2023, Prof. Dr. Harald Sack Oct 2023 DKG MVP NeoDash Feb 2024 Learn graph technology Exam case studies Research metadata management best practices Study organization realities Observe people Hunt for metadata Hunt for metadata Hunt for metadata Observe people Study organization realities Hunt for metadata Observe people Metadata – the knowledge of our data Dec 2023
  • 5. What types of questions should be answered by the knowledge graph? Do all the data columns/fields have business names? Are the data compliant to (retention) rules? What are the gaps between the data quality dimension goals and the reality? Do all the databases connected to the software systems? What systems does a certain digital domain have? What type of personal data does a software system have? Find the most relevant dataset (recommendation engine) and request access. Which data products are overlapping? (Think about the Principle of Data as a Product) How was the data transformed throughout the lifetime? (Provenance and operational lineage) …
  • 6. Which knowledge areas should be covered ? Collectively capture the broader knowledge about data
  • 7. arrows.app What concepts and relationships should be in the knowledge graph?
  • 8. The Ontology Development Process Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology Ontology development in practice is an iterative process that repeats continuously and improves the ontology. What types of questions should be answered by the knowledge graph? Which knowledge areas should be covered by the knowledge graph? What concepts and relationships should be in the knowledge graph?
  • 9. Pain points of “Metadata as a by-product” and relieves DAMA DMBOK has pointed out: “Metadata is often created as a by-product of application processing rather than as an end product (i.e.. it is not created with consumption in mind). As with other forms of data, there is a lot of work in preparing Metadata before it can be integrated.”
  • 10. What is linked metadata by design? • Linked metadata represents the integration of outcomes from human collaboration. It collectively forms the knowledge about data, which is captured in the data knowledge graph. • By design emphasizes that the capture and integration of metadata should start from the conceptual level and continue through the logical level, as human collaboration begins at the design phase of data product development.
  • 12. Architecture Various metadata sources L0 L1 L2 L3 “Sources of knowledge” “Data knowledge graph”
  • 14. Column Table Schema System group Database Digital domain Digital unit System Architectural area The current version of the DKG has close to 1 million of nodes and 1 million of edges and it will continue to grow. This is a snapshot from Neo4j Bloom. Only 10K nodes (1%) are displayed here. Ingka Data Knowledge Graph (DKG)
  • 15. NeoDash (UI) Query Visualize Post changes Ingka Data Knowledge Graph (DB) https://github.com/neo4j-labs/neodash NeoDash - Neo4j Dashboard Builder NeoDash is an open source tool for visualizing your Neo4j data. It lets you group visualizations together as dashboards, and allow for interactions between reports. Explicit, formal and shareable knowledge about data/ data products stored in a graph database
  • 16. Catalogued Systems Quick Insights Intuitive Visuals Coherent Understanding Ø Systems are categorized • by relations to other asset types: • Architectural Area • System Group • Digital Unit | Digital Domain • by categorical attributes Ø Explore details of the system in interest
  • 17. Catalogued RDBs Ø Datasources are categorized by relations to other asset types: System à Digital Unità Digital Domain Ø Explore details of the datasource in interest and its hierarchical assets Column à Table à Schema à Datasource Quick Insights Intuitive Visuals Coherent Understanding
  • 18. Consumer - Universal Search Universal and exploratory search for various asset types, e.g. System, Datasource…
  • 19. Find | Inspect | Connect | Verify Without the need to build and maintain UIs Link Data Concept to: • Business Term • Data Entities • … Link Data Attribute/Entity to: • Business Term • Code List • Rules • Policies • … Link Data Attribute to: • Columns, Fields... Provider – Link Metadata
  • 22. Near future potentials • Knowledge graph completion: Link prediction with Word Embeddings and KG Embeddings
  • 23. Near future potentials • Knowledge graph completion: Link prediction with Word Embeddings and KG Embeddings Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology
  • 24. Near future potentials • Better search: Use combined capabilities of semantic and structural search • Refinement enables more precise or more complete search results. Query String • Enables complementing search results with additional associated or similar information. Cross Referencing • Enables the determination of nearby results and results related by content. Fuzzy Search • Enables visualization and navigation of the search space. Exploratory Search via Linked Data • Enables complementing search results with implicitly given information. Reasoning Knowledge Graphs 2023, Prof. Dr. Harald Sack, FIZ Karlsruhe – Leibniz Institute for Information Infrastructure & Karlsruhe Institute of Technology
  • 25. Near future potentials • Leverage LLM + KG RAG Tomaz Bratanic Learn | Share | Collaborate Ask DKG anything about your data
  • 26. Near future potentials • Build fast apps in OpenSource and InnerSource
  • 27. Knowledge Building Community o Terminology, ontology and modeling o Data governance o Data engineering o Graph data science o UX design and UI building o … • Iteration of curating and extracting knowledge • Following the architecture of building and consuming the knowledge graph • By the many people, for the many people
  • 29. Thank you! Reach out on Shirley Bacso