SlideShare a Scribd company logo
10
Most read
18
Most read
!
!
!
SATYAM AGARWALA
DEVELOPER
DATA
ANONYMIZATION
Why do we need data?
What is data anonymization?
Why anonymize data?
Data anonymization
Data anonymization
How do we anonymize data?
https://github.com/sunitparekh/data-anonymization
Sunit Parekh
Satyam Agarwala
You choose which
attributes to
anonymize
!
!
first name
last name
address
zipcode
handphone
birth date
!
!
: Satyam
: Agarwala
: 87B Amoy Street
: 069906
: 8765 4321
: 01/01/1945
BLACKLIST
WHITELIST
You choose which
attributes NOT to
anonymize
!
!
first name
last name
address
zipcode
handphone
birth date
!
!
: Satyam
: Woodward
: 10 Downing Street
: 123456
: 8765 4321
: 01/01/1945
Show me!
Script
(DSL, strategies, parallelization)
ORM
(RDBMS, NoSQL)
source DB destination DB
SUMMARY
GOTCHAS
FK CONSTRAINTS
Disable foreign key checks globally before running the script.
!
UNIQUE CONSTRAINTS
Whitelist or ensure a sequential non-random strategy for attributes that need to
be unique.
Are there other ways to anonymize
data?
FORMAL APPROACH
k-anonymity
!
l-diversity
!
t-closeness
!
δ-presence
ALTERNATIVE TOOLS
Arx
https://github.com/arx-
deidentifier/arx
THANK YOU!

More Related Content

PDF
Data Privacy: Anonymization & Re-Identification
PDF
What is Differential Privacy?
PDF
Data Privatisation, Data Anonymisation, Data Pseudonymisation and Differentia...
PDF
Data Protection Predictions for 2023.pdf
PPT
Data Protection Act
PPTX
Cryptography - Block cipher & stream cipher
PPTX
Creating a Data Management Plan
PPTX
Double DES & Triple DES
Data Privacy: Anonymization & Re-Identification
What is Differential Privacy?
Data Privatisation, Data Anonymisation, Data Pseudonymisation and Differentia...
Data Protection Predictions for 2023.pdf
Data Protection Act
Cryptography - Block cipher & stream cipher
Creating a Data Management Plan
Double DES & Triple DES

What's hot (20)

PDF
An overview of methods for data anonymization
PDF
RGPD / GDPR : Principes, Démarche, Outils
PPTX
Presentation on GDPR
PPT
Hash crypto
PPTX
Protect your Database with Data Masking & Enforced Version Control
PPTX
PDF
The Definitive Guide to Data Loss Prevention
PPTX
Trible data encryption standard (3DES)
ODP
Encryption presentation final
PPTX
Data protection ppt
PDF
Data sharing: How, what and why?
PDF
Overview on data privacy
PPTX
Data mining
PDF
Place of Service-1.pdf
PDF
2. public key cryptography and RSA
PPTX
Machine Learning in Big Data
PDF
Hierarchical Clustering
PPTX
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
PDF
Healthcare fraud detection
PPTX
MD5 ALGORITHM.pptx
An overview of methods for data anonymization
RGPD / GDPR : Principes, Démarche, Outils
Presentation on GDPR
Hash crypto
Protect your Database with Data Masking & Enforced Version Control
The Definitive Guide to Data Loss Prevention
Trible data encryption standard (3DES)
Encryption presentation final
Data protection ppt
Data sharing: How, what and why?
Overview on data privacy
Data mining
Place of Service-1.pdf
2. public key cryptography and RSA
Machine Learning in Big Data
Hierarchical Clustering
Hierarchical Clustering | Hierarchical Clustering in R |Hierarchical Clusteri...
Healthcare fraud detection
MD5 ALGORITHM.pptx
Ad

Viewers also liked (20)

PDF
ARX - a comprehensive tool for anonymizing / de-identifying biomedical data
PDF
Engineering data privacy - The ARX data anonymization tool
PDF
Data Privacy and Anonymization
PDF
ARX - a comprehensive tool for anonymizing / de-identifying biomedical data
PDF
International Journal of Engineering Research and Development
PPTX
Anonymizing Health Data
DOCX
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
PPTX
Protecting patients privacy slide presentation
PPT
slides
PPT
Data Privacy in India and data theft
PDF
Matinée Découverte Big Data & Data Science - 24012017
PDF
Privacy Preserving Data Mining
PDF
A Review Study on the Privacy Preserving Data Mining Techniques and Approaches
PPT
Data mining and privacy preserving in data mining
PPTX
Privacy act
PDF
Privacy Protectin Models and Defamation caused by k-anonymity
PPTX
Privacy in India: Legal issues
PPTX
Presentation on Information Privacy
PPTX
Privacy , Security and Ethics Presentation
PPT
Privacy preserving dm_ppt
ARX - a comprehensive tool for anonymizing / de-identifying biomedical data
Engineering data privacy - The ARX data anonymization tool
Data Privacy and Anonymization
ARX - a comprehensive tool for anonymizing / de-identifying biomedical data
International Journal of Engineering Research and Development
Anonymizing Health Data
Closeness through-microaggregation-strict-privacy-with-enhanced-utility-prese...
Protecting patients privacy slide presentation
slides
Data Privacy in India and data theft
Matinée Découverte Big Data & Data Science - 24012017
Privacy Preserving Data Mining
A Review Study on the Privacy Preserving Data Mining Techniques and Approaches
Data mining and privacy preserving in data mining
Privacy act
Privacy Protectin Models and Defamation caused by k-anonymity
Privacy in India: Legal issues
Presentation on Information Privacy
Privacy , Security and Ethics Presentation
Privacy preserving dm_ppt
Ad

Similar to Data anonymization (20)

PPTX
Automatski - The Internet of Things - Privacy in IoT
PDF
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
PDF
Data Anonymization For Better Software Testing
PPTX
Amnesia: Data anonymization made easy (8th OpenAIRE workshop)
PPTX
From Re-Identification Risk to Compliance: A Guide to Data Anonymization
PPTX
Data explosion
PPTX
Lions, zebras and Big Data Anonymization
PDF
Data Anonymization Process Challenges and Context Missions
PDF
Data Anonymization Process Challenges and Context Missions
PPTX
Micah Altman NISO privacy in library systems
PDF
Anonymisation theory and practice
PPTX
Understanding Data Anonymization- Protecting Privacy in the Age of Informatio...
PPTX
datamining-lect2 - What is data The data mining pipeline. Preprocessing and ...
PDF
Data Security & Data Privacy: Data Anonymization
PPTX
Functional anonymisation - risk management in a data environment
PDF
Problems in Technology to Use Anonymized Personal Data
PPTX
Distinct l diversity anonymization of set valued data
PDF
Computational privacy: balancing privacy and utility in the digital era
DOCX
M privacy for collaborative data publishing
PPTX
Privacy Protection Technologies: Introductory Overview
Automatski - The Internet of Things - Privacy in IoT
SFScon 22 - Paolo Pinto - Real Life Data Anonymization.pdf
Data Anonymization For Better Software Testing
Amnesia: Data anonymization made easy (8th OpenAIRE workshop)
From Re-Identification Risk to Compliance: A Guide to Data Anonymization
Data explosion
Lions, zebras and Big Data Anonymization
Data Anonymization Process Challenges and Context Missions
Data Anonymization Process Challenges and Context Missions
Micah Altman NISO privacy in library systems
Anonymisation theory and practice
Understanding Data Anonymization- Protecting Privacy in the Age of Informatio...
datamining-lect2 - What is data The data mining pipeline. Preprocessing and ...
Data Security & Data Privacy: Data Anonymization
Functional anonymisation - risk management in a data environment
Problems in Technology to Use Anonymized Personal Data
Distinct l diversity anonymization of set valued data
Computational privacy: balancing privacy and utility in the digital era
M privacy for collaborative data publishing
Privacy Protection Technologies: Introductory Overview

Recently uploaded (20)

PPTX
Spectroscopy.pptx food analysis technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Cloud computing and distributed systems.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
Spectroscopy.pptx food analysis technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
MYSQL Presentation for SQL database connectivity
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Machine learning based COVID-19 study performance prediction
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Chapter 3 Spatial Domain Image Processing.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Review of recent advances in non-invasive hemoglobin estimation
20250228 LYD VKU AI Blended-Learning.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Big Data Technologies - Introduction.pptx
MIND Revenue Release Quarter 2 2025 Press Release

Data anonymization