SlideShare a Scribd company logo
Graph Usage
for
Fraud Detection
and Bias Mitigation
Danish Business Authority (DBA)
●
Business registrations
●
Central Business Registry (CVR)
●
Fiscal report audits
●
Business support schemes
(eg: Covid-19, IT-security...etc)
●
Legal oversight & control
ML-Lab IKP
3
Why Machine Learning
• Make it easy to be a law-abiding company
AND: Make it hard to swindle
●
~800.000 companies in Denmark – impossible to check
everything by hand
• Focus efforts where most needed
• Requires data, infrastructure and software tools
4
Intelligent Kubernetes Platform
Four Main Components:​
●
Kubernilla: Vanilla version of Kubernetes, highly opinionated​
●
RaceTrack: Deployment system (www.github.com/theracetrack)​
●
CatWalk: Evaluation component​
●
RecordKeeper: Platform wide system event logger​
Plus: Data Warehouse (postgreSQL, Neo4J)
Development:​
Idempotent system design
Infrastructure as Code
One source of truth​
Knowledge Graph (postgreSQL → Neo4j)
●
CVR (Businesses, people,
addresses … etc)
●
DBA Cases
●
Fiscal Reports
...and much more ...
●
Labels: 50
●
Relationship types: 41
●
Node Properties: 237
●
Nodes: 445 mio
●
Edges: 688 mio
→ Forms basis for ML efforts
6
Example: Meta Graph
Apoc.meta.graph()
7
data
Registry data + metadata + observations
8
data
Registry data + metadata + observations
9
data
metadata
10
data
metadata
11
data
metadata
12
data
metadata
13
data
metadata
Machine learning
Group
Shared Client
14
data
metadata
Machine learning
Group
Shared Client
●
Automatic control of new data
●
Exploits what we already know
●
Uses machine insights
15
●
All Decision made by humans
– ML in supporting role
ML at the Business Authority
16
Pitfalls
●
ML: It is easy to do something:
→ but also extremely easy to do
it wrong
●
Any ML model reflects its training
data
●
ML is only as strong as the data
17
Doing it wrong: Unethical AI
United States: Repeat criminal offenders
●
Guided prison sentence lengths
●
Biased towards colored people
Netherlands: Child care benefits fraud
●
10.000s families effected
●
Many low-income families
●
Many pushed into poverty
●
Several suicides
●
Government resigned
18
Motivation / Bias
●
Build fair & ethical models
●
EU: Artificial Intelligence Act
(https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex%3A52021PC0206)
●
EU: GDPR
(https://eur-lex.europa.eu/eli/reg/2016/679/oj)
Data ‘landscape’
Used data
known unknown
Unknown unknown
19
Motivation / Bias
●
Build fair & ethical models
●
EU: Artificial Intelligence Act
(https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex%3A52021PC0206)
●
EU: GDPR
(https://eur-lex.europa.eu/eli/reg/2016/679/oj)
●
Challenge: Follow data trail, explain origin of
knowledge and conclusions
●
Our Answers: RecordKeeper & X-Rai framework
[Transparent, Responsible, Explainable AI:
https://pure.itu.dk/en/publications/x-rai-a-framework-for-the-transparent-responsible-and-accurate-us]
Data ‘landscape’
Used data
known population
Un-known population
20
ML at the Business Authority
●
Need for complete traceability
Traceability need
21
Flow
●
Describes quantity
traversing network
●
e.g: Traffic, Railways,
Water pipes
●
Knowledge graph:
Springs, pipes and sinks
https://yoshuabengio.org/2022/03/05/generative-flow-networks/
22
Example: Meta Graph
Apoc.meta.graph()
23
Example: Meta Graph
Apoc.meta.graph()
Capture causal flow, eg
ML-Model
Query
Output
24
RecordKeeper: System Event Logger
●
Server / Client system, Python
●
Passive component: Listening only
●
Platform Event Message (PEM):
●
One action on the cluster, Unique ID
●
Emitter ID
●
Predecessor ID known
●
Artifacts: Data references
●
Builds graph of PEMs and Artifacts
-> Facilitates explainability on the cluster
25
PEM Directed Acyclic Graphs (DAG)
●
Each event creates a PEM
●
PEMs can create or reference artifacts
Data ingest
Data
Warehouse
Model
Training
PEM
1
PEM
2
PEM
3
Components:
(Emitters)
DAG:
Artifacts: References Main knowledge Graph
26
RK Graph
27
Flow Networks
●
Edges as ‘action paths’
●
Probability representations
●
Inspired by Bangio et al.: [https://arxiv.org/abs/2106.04399v2]
[Flow software package: https://github.com/GFNOrg/gflownet]
28
●
Trace out data usage
●
PageRank for node importance
●
Bias Detection
– at training and runtime
– sink scores
Explainability & Bias detection
user
ML-models
data
Ss=∑ F(s ,a')−∑ F(s ,a)
29
●
Trace out data usage
●
PageRank for node importance
●
Bias Detection
– at training and runtime
– sink scores
●
Data driven insights for
explainability,
model retirement or
re-training
Explainability & Bias detection
service
Consumer
ML-models
data
ML-score
ML-score
30
●
Reward: ML-Score
●
Train Graph Neural Network
●
Learn flow structure
●
Meta Tensor Model across
data, actions and scores
Idea: Meta Model
user
ML-models
data
ML-score
ML-score
31
Closing Remarks
●
Knowledge Graphs facilitate ML-efforts at Danish Business Authority
●
Focus on Transparent, Responsible and Explainable AI (X-Rai)
●
RecordKeeper generates Causal knowledge graphs
(explainability, bias mitigation, Flow tensor models)
Open Sourcing main components
RaceTrack, adaptable launch system already publicly available at:
http:github.com/theracetrack
32
Flow across data example
33
●
Creates artifacts
●
RK plugin
●
Model calls
34
Graph Test
Example
●
Unused Nodes
DBA Data Journey
Doing it wrong
●
Great Britain: Student Grade Assignment
●
100.000s students affected
●
Lower grades prevented education admission

More Related Content

PDF
The three layers of a knowledge graph and what it means for authoring, storag...
PPTX
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
PPTX
EY: Why graph technology makes sense for fraud detection and customer 360 pro...
PDF
Fraud Detection with Graphs at the Danish Business Authority
PPTX
ENEL Electricity Grids on Neo4j Graph DB
PDF
Bertelsmann: BeTrend – Building a Trend Aggregation Machine.pdf
PPTX
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
PPTX
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...
The three layers of a knowledge graph and what it means for authoring, storag...
Banking Circle: Money Laundering Beware: A Modern Approach to AML with Machin...
EY: Why graph technology makes sense for fraud detection and customer 360 pro...
Fraud Detection with Graphs at the Danish Business Authority
ENEL Electricity Grids on Neo4j Graph DB
Bertelsmann: BeTrend – Building a Trend Aggregation Machine.pdf
Volvo Cars - Retrieving Safety Insights using Graphs (GraphSummit Stockholm 2...
EY + Neo4j: Why graph technology makes sense for fraud detection and customer...

What's hot (20)

PDF
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
PDF
Building a modern data stack to maintain an efficient and safe electrical grid
PDF
The Knowledge Graph Explosion
PDF
Optimizing Your Supply Chain with the Neo4j Graph
PDF
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
PDF
Modern Data Challenges require Modern Graph Technology
PDF
8 Steps to Creating a Data Strategy
PDF
Enterprise Architecture vs. Data Architecture
PDF
Data-centric design and the knowledge graph
PDF
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
PPTX
Elsevier: Empowering Knowledge Discovery in Research with Graphs
PDF
Workshop Introduction to Neo4j
PDF
Improving Data Literacy Around Data Architecture
PPTX
GraphAware: Insights Discovery with KGs: Bringing Archives to Life (GraphSumm...
PDF
Graph Databases for Master Data Management
PDF
Neo4j : Graphes de Connaissance, IA et LLMs
PPTX
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
PDF
How Will Knowledge Graphs Improve Clinical Reporting Workflows
PDF
Amsterdam - The Neo4j Graph Data Platform Today & Tomorrow
PDF
Intro to Cypher
Pourquoi Leroy Merlin a besoin d'un Knowledge Graph ?
Building a modern data stack to maintain an efficient and safe electrical grid
The Knowledge Graph Explosion
Optimizing Your Supply Chain with the Neo4j Graph
Optimizing the Supply Chain with Knowledge Graphs, IoT and Digital Twins_Moor...
Modern Data Challenges require Modern Graph Technology
8 Steps to Creating a Data Strategy
Enterprise Architecture vs. Data Architecture
Data-centric design and the knowledge graph
Knowledge Graphs for Transformation: Dynamic Context for the Intelligent Ente...
Elsevier: Empowering Knowledge Discovery in Research with Graphs
Workshop Introduction to Neo4j
Improving Data Literacy Around Data Architecture
GraphAware: Insights Discovery with KGs: Bringing Archives to Life (GraphSumm...
Graph Databases for Master Data Management
Neo4j : Graphes de Connaissance, IA et LLMs
Neo4j GraphSummit London - The Path To Success With Graph Database and Data S...
How Will Knowledge Graphs Improve Clinical Reporting Workflows
Amsterdam - The Neo4j Graph Data Platform Today & Tomorrow
Intro to Cypher
Ad

Similar to Danish Business Authority: Explainability and causality in relation to ML Ops (20)

PPTX
GraphTalk Copenhagen - Fraud Detection with Graphs
PPTX
Graph AI Industrial Applications: From Explainability To Discovery
PDF
Beyond-Accuracy Perspectives: Explainability and Fairness
PDF
ML & Graph algorithms to prevent financial crime in digital payments
PPTX
Apache Spark GraphX & GraphFrame Synthetic ID Fraud Use Case
PDF
Neo4j - Responsible AI
PDF
Leveraging Graphs for Better AI
PDF
TigerGraph UI Toolkits Financial Crimes
PDF
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
PPTX
Ethical AI - Open Compliance Summit 2020
PPTX
Responsible AI
PDF
How Graphs Enhance AI
PDF
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
PDF
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
PDF
How Graph Technology is Changing AI
PDF
Metadata and the Power of Pattern-Finding
PDF
Spark, GraphX, and Blockchains: Building a Behavioral Analytics Platform for ...
PPTX
Explainable AI in Industry (AAAI 2020 Tutorial)
PPTX
Data Con LA 2022 - Who Owns That Yacht? How Graphs Are Used to Identify Asset...
PDF
Leveraging Graphs for Better AI
GraphTalk Copenhagen - Fraud Detection with Graphs
Graph AI Industrial Applications: From Explainability To Discovery
Beyond-Accuracy Perspectives: Explainability and Fairness
ML & Graph algorithms to prevent financial crime in digital payments
Apache Spark GraphX & GraphFrame Synthetic ID Fraud Use Case
Neo4j - Responsible AI
Leveraging Graphs for Better AI
TigerGraph UI Toolkits Financial Crimes
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
Ethical AI - Open Compliance Summit 2020
Responsible AI
How Graphs Enhance AI
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
How Graph Technology is Changing AI
Metadata and the Power of Pattern-Finding
Spark, GraphX, and Blockchains: Building a Behavioral Analytics Platform for ...
Explainable AI in Industry (AAAI 2020 Tutorial)
Data Con LA 2022 - Who Owns That Yacht? How Graphs Are Used to Identify Asset...
Leveraging Graphs for Better AI
Ad

More from Neo4j (20)

PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
PDF
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
PDF
GraphSummit Singapore Master Deck - May 20, 2025
PPTX
Graphs & GraphRAG - Essential Ingredients for GenAI
PPTX
Neo4j Knowledge for Customer Experience.pptx
PPTX
GraphTalk New Zealand - The Art of The Possible.pptx
PDF
Neo4j: The Art of the Possible with Graph
PDF
Smarter Knowledge Graphs For Public Sector
PDF
GraphRAG and Knowledge Graphs Exploring AI's Future
PDF
Matinée GenAI & GraphRAG Paris - Décembre 24
PDF
ANZ Presentation: GraphSummit Melbourne 2024
PDF
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
PDF
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
PDF
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
PDF
Démonstration Digital Twin Building Wire Management
PDF
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
PDF
Démonstration Supply Chain - GraphTalk Paris
PDF
The Art of Possible - GraphTalk Paris Opening Session
PPTX
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
PDF
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Jin Foo - Prospa GraphSummit Sydney Presentation.pdf
GraphSummit Singapore Master Deck - May 20, 2025
Graphs & GraphRAG - Essential Ingredients for GenAI
Neo4j Knowledge for Customer Experience.pptx
GraphTalk New Zealand - The Art of The Possible.pptx
Neo4j: The Art of the Possible with Graph
Smarter Knowledge Graphs For Public Sector
GraphRAG and Knowledge Graphs Exploring AI's Future
Matinée GenAI & GraphRAG Paris - Décembre 24
ANZ Presentation: GraphSummit Melbourne 2024
Google Cloud Presentation GraphSummit Melbourne 2024: Building Generative AI ...
Telstra Presentation GraphSummit Melbourne: Optimising Business Outcomes with...
Hands-On GraphRAG Workshop: GraphSummit Melbourne 2024
Démonstration Digital Twin Building Wire Management
Swiss Life - Les graphes au service de la détection de fraude dans le domaine...
Démonstration Supply Chain - GraphTalk Paris
The Art of Possible - GraphTalk Paris Opening Session
How Siemens bolstered supply chain resilience with graph-powered AI insights ...
Knowledge Graphs for AI-Ready Data and Enterprise Deployment - Gartner IT Sym...

Recently uploaded (20)

PDF
PPT Items # 6&7 - 900 Cambridge Oval Right-of-Way
PPT
generalgeologygroundwaterchapt11-181117073208.ppt
PDF
PPT Item #s 2&3 - 934 Patterson SUP & Final Review
PPTX
AMO Pune Complete information and work profile
PDF
Item # 2 - 934 Patterson Specific Use Permit (SUP)
DOC
LU毕业证学历认证,赫尔大学毕业证硕士的学历和学位
PDF
The Detrimental Impacts of Hydraulic Fracturing for Oil and Gas_ A Researched...
PDF
buyers sellers meeting of mangoes in mahabubnagar.pdf
PPTX
PCCR-ROTC-UNIT-ORGANIZATIONAL-STRUCTURE-pptx-Copy (1).pptx
PDF
Item # 3 - 934 Patterson Final Review.pdf
PPTX
Social_Medias_Parents_Education_PPT.pptx
PDF
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
PDF
26.1.2025 venugopal K Awarded with commendation certificate.pdf
PPTX
Portland FPDR Oregon Legislature 2025.pptx
PDF
ISO-9001-2015-internal-audit-checklist2-sample.pdf
PDF
Creating Memorable Moments_ Personalized Plant Gifts.pdf
PPTX
Omnibus rules on leave administration.pptx
DOCX
EAPP.docxdffgythjyuikuuiluikluikiukuuuuuu
PPTX
sepsis.pptxMNGHGBDHSB KJHDGBSHVCJB KJDCGHBYUHFB SDJKFHDUJ
PDF
Item # 5 - 5307 Broadway St final review
PPT Items # 6&7 - 900 Cambridge Oval Right-of-Way
generalgeologygroundwaterchapt11-181117073208.ppt
PPT Item #s 2&3 - 934 Patterson SUP & Final Review
AMO Pune Complete information and work profile
Item # 2 - 934 Patterson Specific Use Permit (SUP)
LU毕业证学历认证,赫尔大学毕业证硕士的学历和学位
The Detrimental Impacts of Hydraulic Fracturing for Oil and Gas_ A Researched...
buyers sellers meeting of mangoes in mahabubnagar.pdf
PCCR-ROTC-UNIT-ORGANIZATIONAL-STRUCTURE-pptx-Copy (1).pptx
Item # 3 - 934 Patterson Final Review.pdf
Social_Medias_Parents_Education_PPT.pptx
2025 Shadow report on Ukraine's progression regarding Chapter 29 of the acquis
26.1.2025 venugopal K Awarded with commendation certificate.pdf
Portland FPDR Oregon Legislature 2025.pptx
ISO-9001-2015-internal-audit-checklist2-sample.pdf
Creating Memorable Moments_ Personalized Plant Gifts.pdf
Omnibus rules on leave administration.pptx
EAPP.docxdffgythjyuikuuiluikluikiukuuuuuu
sepsis.pptxMNGHGBDHSB KJHDGBSHVCJB KJDCGHBYUHFB SDJKFHDUJ
Item # 5 - 5307 Broadway St final review

Danish Business Authority: Explainability and causality in relation to ML Ops

  • 2. Danish Business Authority (DBA) ● Business registrations ● Central Business Registry (CVR) ● Fiscal report audits ● Business support schemes (eg: Covid-19, IT-security...etc) ● Legal oversight & control ML-Lab IKP
  • 3. 3 Why Machine Learning • Make it easy to be a law-abiding company AND: Make it hard to swindle ● ~800.000 companies in Denmark – impossible to check everything by hand • Focus efforts where most needed • Requires data, infrastructure and software tools
  • 4. 4 Intelligent Kubernetes Platform Four Main Components:​ ● Kubernilla: Vanilla version of Kubernetes, highly opinionated​ ● RaceTrack: Deployment system (www.github.com/theracetrack)​ ● CatWalk: Evaluation component​ ● RecordKeeper: Platform wide system event logger​ Plus: Data Warehouse (postgreSQL, Neo4J) Development:​ Idempotent system design Infrastructure as Code One source of truth​
  • 5. Knowledge Graph (postgreSQL → Neo4j) ● CVR (Businesses, people, addresses … etc) ● DBA Cases ● Fiscal Reports ...and much more ... ● Labels: 50 ● Relationship types: 41 ● Node Properties: 237 ● Nodes: 445 mio ● Edges: 688 mio → Forms basis for ML efforts
  • 7. 7 data Registry data + metadata + observations
  • 8. 8 data Registry data + metadata + observations
  • 14. 14 data metadata Machine learning Group Shared Client ● Automatic control of new data ● Exploits what we already know ● Uses machine insights
  • 15. 15 ● All Decision made by humans – ML in supporting role ML at the Business Authority
  • 16. 16 Pitfalls ● ML: It is easy to do something: → but also extremely easy to do it wrong ● Any ML model reflects its training data ● ML is only as strong as the data
  • 17. 17 Doing it wrong: Unethical AI United States: Repeat criminal offenders ● Guided prison sentence lengths ● Biased towards colored people Netherlands: Child care benefits fraud ● 10.000s families effected ● Many low-income families ● Many pushed into poverty ● Several suicides ● Government resigned
  • 18. 18 Motivation / Bias ● Build fair & ethical models ● EU: Artificial Intelligence Act (https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex%3A52021PC0206) ● EU: GDPR (https://eur-lex.europa.eu/eli/reg/2016/679/oj) Data ‘landscape’ Used data known unknown Unknown unknown
  • 19. 19 Motivation / Bias ● Build fair & ethical models ● EU: Artificial Intelligence Act (https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=celex%3A52021PC0206) ● EU: GDPR (https://eur-lex.europa.eu/eli/reg/2016/679/oj) ● Challenge: Follow data trail, explain origin of knowledge and conclusions ● Our Answers: RecordKeeper & X-Rai framework [Transparent, Responsible, Explainable AI: https://pure.itu.dk/en/publications/x-rai-a-framework-for-the-transparent-responsible-and-accurate-us] Data ‘landscape’ Used data known population Un-known population
  • 20. 20 ML at the Business Authority ● Need for complete traceability Traceability need
  • 21. 21 Flow ● Describes quantity traversing network ● e.g: Traffic, Railways, Water pipes ● Knowledge graph: Springs, pipes and sinks https://yoshuabengio.org/2022/03/05/generative-flow-networks/
  • 23. 23 Example: Meta Graph Apoc.meta.graph() Capture causal flow, eg ML-Model Query Output
  • 24. 24 RecordKeeper: System Event Logger ● Server / Client system, Python ● Passive component: Listening only ● Platform Event Message (PEM): ● One action on the cluster, Unique ID ● Emitter ID ● Predecessor ID known ● Artifacts: Data references ● Builds graph of PEMs and Artifacts -> Facilitates explainability on the cluster
  • 25. 25 PEM Directed Acyclic Graphs (DAG) ● Each event creates a PEM ● PEMs can create or reference artifacts Data ingest Data Warehouse Model Training PEM 1 PEM 2 PEM 3 Components: (Emitters) DAG: Artifacts: References Main knowledge Graph
  • 27. 27 Flow Networks ● Edges as ‘action paths’ ● Probability representations ● Inspired by Bangio et al.: [https://arxiv.org/abs/2106.04399v2] [Flow software package: https://github.com/GFNOrg/gflownet]
  • 28. 28 ● Trace out data usage ● PageRank for node importance ● Bias Detection – at training and runtime – sink scores Explainability & Bias detection user ML-models data Ss=∑ F(s ,a')−∑ F(s ,a)
  • 29. 29 ● Trace out data usage ● PageRank for node importance ● Bias Detection – at training and runtime – sink scores ● Data driven insights for explainability, model retirement or re-training Explainability & Bias detection service Consumer ML-models data ML-score ML-score
  • 30. 30 ● Reward: ML-Score ● Train Graph Neural Network ● Learn flow structure ● Meta Tensor Model across data, actions and scores Idea: Meta Model user ML-models data ML-score ML-score
  • 31. 31 Closing Remarks ● Knowledge Graphs facilitate ML-efforts at Danish Business Authority ● Focus on Transparent, Responsible and Explainable AI (X-Rai) ● RecordKeeper generates Causal knowledge graphs (explainability, bias mitigation, Flow tensor models) Open Sourcing main components RaceTrack, adaptable launch system already publicly available at: http:github.com/theracetrack
  • 36. Doing it wrong ● Great Britain: Student Grade Assignment ● 100.000s students affected ● Lower grades prevented education admission