SlideShare a Scribd company logo
3
Most read
12
Most read
23
Most read
OpenMetadata Community Forum August
Data Cataloging and Governance
Using OpenMetadata at Thndr
Data Engineer (Data Platform Engineer)
● 5+ years of experience in Data
Engineering
● Worked on ETL, Data Quality,
Data Governance, platform &
infrastructure deployments.
● Analyze optimizations in the
data platform and analytics
Fizza Abid
Agenda
● What does Thndr do?
● What is the OpenMetadata Architecture at thndr?
● What were the Metadata Challenges at Thndr?
● Why did we choose OpenMetadata?
● What are use cases for using OpenMetadata?
● How did we implement OpenMetadata at Thndr?
What does Thndr do?
Push on Education
No Barriers
Investment
Supermarket Relevant & Intuitive
Users can open and
manage accounts without
visiting branches or access
to restrictive capital and
can fund their accounts
easily
Providing access to all
relevant investment
products whether local
or abroad
Equipping out users
with everything they
need for a successful
investing journey
Relevant interface and
hand-holding focused
experience
Why invest on Thndr?
What has Thndr Achieved?
● Buy and Sell Stocks
● Invest in Mutual Funds
● Invest in Gold
● Instant Top Ups and withdrawals
Features of Thndr
● Automated Data Quality Alerts
● Visibility about metadata (freshness,
volume, schema changes, etc.)
● Manual detection of PII columns
● Business or Technical Glossary
● Centralized metadata repository
Metadata
Challenges
at Thndr
Lineage
Data
Quality
Metadata
Visibility
Manual Pll
Detection
Why did we choose OpenMetadata?
Open Source
OpenMetadata is
open-source and it can
be deployed on EKS or
docker easily
1
Features
The features that they’re
offering are better than
paid solutions and in a
single place, we can find
metadata management,
lineage, data quality, etc.
2
Community Support
OpenMetadata
community is support
on reaching out on
slack, they answer your
questions instantly
3
OpenMetadata Architecture at Thndr
● Deployed OpenMetadata on
EC2 machines using Docker
image on AWS
● We used EC2 Machines on AWS
● Rotate JWT tokens when
deploying OpenMetadata
● SSO deployment for an added
layer of security
Deployment
● We have small Team of 2 Data Engineers, one data platform
engineer, and Team of 3 Data Analyst
● Business Users can know what particular column in data
products means using OpenMetadata Glossary Feature
● Data Quality checks are implemented by Data Engineers and
Data Analyst
● We have small number of data quality test yet but we’ll be
putting it on more tables.
● Most of the checks are done for Batch ETL like checking
duplicates, freshness of data, etc.
OpenMetadata Usage at Thndr
OpenMetadata Usage at Thndr
● Search in the explorer based on table
name, column name, schema name
● inbuilt data cataloging feature with many
different datasets and tables, each with
varying access levels for different users
● Data catalog provides a single, unified
user experience for quickly discovering
these datasets.
● Metadata, DDLs, views, and stored
procedures will be available and can be
explored through data catalog.
Data Discovery
Data Cataloging
Data Classification
● OpenMetadata uses machine learning to auto classify PII columns into sensitive and
non-sensitive.
● Also provide custom tagging into personal or special category
Data Classification
● Data Lineage in OpenMetadata
is available on either
observability layer or column
layer
● Data lineage shows the
complete journey and
transformation of data from its
origin to its final destination.
● Export data lineage in excel
Data Lineage
Data Quality
● Data Quality Feature in
OpenMetadata allows to run
custom test cases based on the
table or columns.
● Custom SQL Query for data quality
testing like if you want to compare
values in multiple tables or have
any dynamic logic.
● Data quality test include match
column name to certain regex,
value in between, not null, etc.
Data Quality
Data Quality
Data Quality
● Role based access control
team wise, department wise,
and roles include Data
Consumer, Data Steward, etc.
● Attribute based access control
● Granular level of access control
on resources
Data Governance

More Related Content

PDF
OpenMetadata Community Meeting - 7th August 2024
PDF
Using neo4j for enterprise metadata requirements
PDF
OpenMetadata Community Meeting - 14 Dec. 2023
PPTX
JOSA TechTalk: Metadata Management
in Big Data
PDF
OpenMetadata Spotlight - OpenMetadata @ Carrefour Brazil
PDF
OpenMetadata Community Meeting - 19th February 2025
PDF
OpenMetadata Spotlight - OpenMetadata @ Loggi by Erica Bertan
PDF
OpenMetadata Community Meeting - 18th September 2024
OpenMetadata Community Meeting - 7th August 2024
Using neo4j for enterprise metadata requirements
OpenMetadata Community Meeting - 14 Dec. 2023
JOSA TechTalk: Metadata Management
in Big Data
OpenMetadata Spotlight - OpenMetadata @ Carrefour Brazil
OpenMetadata Community Meeting - 19th February 2025
OpenMetadata Spotlight - OpenMetadata @ Loggi by Erica Bertan
OpenMetadata Community Meeting - 18th September 2024

Similar to OpenMetadata Spotlight - OpenMetadata @ Thndr by Fizza Abid (20)

PDF
OpenMetadata Community Meeting - 8th May 2024
PDF
Introduction to metadata management
PDF
DataGraft: Data-as-a-Service for Open Data
PDF
OpenMetadata Community Meeting - 5th June 2024
PPTX
OpenDataForge - SledgeHammer EDDI 2013 presentation
PDF
OpenMetadata Community Meeting - 18th December 2024
PDF
What is Metadata_ Unlocking Smarter Archiving Solutions.pdf
PDF
OpenMetadata Community Meeting - 15th January 2025
PPTX
2025_07_23 - OpenMetadata Community Meeting.pptx
PDF
Data Discovery and Metadata
PPTX
Benefits of Data
PDF
2025_06_18 - OpenMetadata Community Meeting.pdf
PPTX
Platform Deep Dive
PDF
Data Discovery & Trust through Metadata
PDF
OpenMetadata Community Meeting - 4th April, 2024
PPTX
Optier presentation for open analytics event
PPTX
metadata.pptx
PPTX
OpenMetadata Spotlight - OpenMetadata @ Talentys Data.pptx
PPTX
Group Project 1.pptx
PPTX
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
OpenMetadata Community Meeting - 8th May 2024
Introduction to metadata management
DataGraft: Data-as-a-Service for Open Data
OpenMetadata Community Meeting - 5th June 2024
OpenDataForge - SledgeHammer EDDI 2013 presentation
OpenMetadata Community Meeting - 18th December 2024
What is Metadata_ Unlocking Smarter Archiving Solutions.pdf
OpenMetadata Community Meeting - 15th January 2025
2025_07_23 - OpenMetadata Community Meeting.pptx
Data Discovery and Metadata
Benefits of Data
2025_06_18 - OpenMetadata Community Meeting.pdf
Platform Deep Dive
Data Discovery & Trust through Metadata
OpenMetadata Community Meeting - 4th April, 2024
Optier presentation for open analytics event
metadata.pptx
OpenMetadata Spotlight - OpenMetadata @ Talentys Data.pptx
Group Project 1.pptx
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
Ad

More from OpenMetadata (11)

PDF
OpenMetadata Spotlight - OpenMetadata @ EDNON
PPTX
OpenMetadata Community Meeting - 21st May 2025
PDF
OpenMetadata Community Meeting - 16th April 2025
PDF
OpenMetadata Spotlight - OpenMetadata @ Gorgias
PDF
OpenMetadata Community Meeting - 19th March 2025
PDF
OpenMetadata Community Meeting - 20th November 2024
PDF
OpenMetadata Community Meeting - 16th October 2024
PDF
OpenMetadata Spotlight - OpenMetadata @ Aspire by Vinol Joy Dsouza
PDF
OpenMetadata Community Spotlight - Jürgen Zornig from ms.GIS
PDF
OpenMetadata Webinar on Custom Connectors
PDF
OpenMetadata Community Meeting - 1 Feb. 2024
OpenMetadata Spotlight - OpenMetadata @ EDNON
OpenMetadata Community Meeting - 21st May 2025
OpenMetadata Community Meeting - 16th April 2025
OpenMetadata Spotlight - OpenMetadata @ Gorgias
OpenMetadata Community Meeting - 19th March 2025
OpenMetadata Community Meeting - 20th November 2024
OpenMetadata Community Meeting - 16th October 2024
OpenMetadata Spotlight - OpenMetadata @ Aspire by Vinol Joy Dsouza
OpenMetadata Community Spotlight - Jürgen Zornig from ms.GIS
OpenMetadata Webinar on Custom Connectors
OpenMetadata Community Meeting - 1 Feb. 2024
Ad

Recently uploaded (20)

PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Lecture1 pattern recognition............
PDF
Introduction to the R Programming Language
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Managing Community Partner Relationships
PDF
annual-report-2024-2025 original latest.
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Business Analytics and business intelligence.pdf
Data_Analytics_and_PowerBI_Presentation.pptx
Lecture1 pattern recognition............
Introduction to the R Programming Language
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Reliability_Chapter_ presentation 1221.5784
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Supervised vs unsupervised machine learning algorithms
Managing Community Partner Relationships
annual-report-2024-2025 original latest.
STUDY DESIGN details- Lt Col Maksud (21).pptx
Qualitative Qantitative and Mixed Methods.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
[EN] Industrial Machine Downtime Prediction
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Quality review (1)_presentation of this 21
Introduction-to-Cloud-ComputingFinal.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx

OpenMetadata Spotlight - OpenMetadata @ Thndr by Fizza Abid

  • 1. OpenMetadata Community Forum August Data Cataloging and Governance Using OpenMetadata at Thndr
  • 2. Data Engineer (Data Platform Engineer) ● 5+ years of experience in Data Engineering ● Worked on ETL, Data Quality, Data Governance, platform & infrastructure deployments. ● Analyze optimizations in the data platform and analytics Fizza Abid
  • 3. Agenda ● What does Thndr do? ● What is the OpenMetadata Architecture at thndr? ● What were the Metadata Challenges at Thndr? ● Why did we choose OpenMetadata? ● What are use cases for using OpenMetadata? ● How did we implement OpenMetadata at Thndr?
  • 4. What does Thndr do? Push on Education No Barriers Investment Supermarket Relevant & Intuitive Users can open and manage accounts without visiting branches or access to restrictive capital and can fund their accounts easily Providing access to all relevant investment products whether local or abroad Equipping out users with everything they need for a successful investing journey Relevant interface and hand-holding focused experience
  • 5. Why invest on Thndr?
  • 6. What has Thndr Achieved?
  • 7. ● Buy and Sell Stocks ● Invest in Mutual Funds ● Invest in Gold ● Instant Top Ups and withdrawals Features of Thndr
  • 8. ● Automated Data Quality Alerts ● Visibility about metadata (freshness, volume, schema changes, etc.) ● Manual detection of PII columns ● Business or Technical Glossary ● Centralized metadata repository Metadata Challenges at Thndr Lineage Data Quality Metadata Visibility Manual Pll Detection
  • 9. Why did we choose OpenMetadata? Open Source OpenMetadata is open-source and it can be deployed on EKS or docker easily 1 Features The features that they’re offering are better than paid solutions and in a single place, we can find metadata management, lineage, data quality, etc. 2 Community Support OpenMetadata community is support on reaching out on slack, they answer your questions instantly 3
  • 11. ● Deployed OpenMetadata on EC2 machines using Docker image on AWS ● We used EC2 Machines on AWS ● Rotate JWT tokens when deploying OpenMetadata ● SSO deployment for an added layer of security Deployment
  • 12. ● We have small Team of 2 Data Engineers, one data platform engineer, and Team of 3 Data Analyst ● Business Users can know what particular column in data products means using OpenMetadata Glossary Feature ● Data Quality checks are implemented by Data Engineers and Data Analyst ● We have small number of data quality test yet but we’ll be putting it on more tables. ● Most of the checks are done for Batch ETL like checking duplicates, freshness of data, etc. OpenMetadata Usage at Thndr
  • 14. ● Search in the explorer based on table name, column name, schema name ● inbuilt data cataloging feature with many different datasets and tables, each with varying access levels for different users ● Data catalog provides a single, unified user experience for quickly discovering these datasets. ● Metadata, DDLs, views, and stored procedures will be available and can be explored through data catalog. Data Discovery
  • 16. Data Classification ● OpenMetadata uses machine learning to auto classify PII columns into sensitive and non-sensitive. ● Also provide custom tagging into personal or special category
  • 18. ● Data Lineage in OpenMetadata is available on either observability layer or column layer ● Data lineage shows the complete journey and transformation of data from its origin to its final destination. ● Export data lineage in excel Data Lineage
  • 19. Data Quality ● Data Quality Feature in OpenMetadata allows to run custom test cases based on the table or columns. ● Custom SQL Query for data quality testing like if you want to compare values in multiple tables or have any dynamic logic. ● Data quality test include match column name to certain regex, value in between, not null, etc.
  • 23. ● Role based access control team wise, department wise, and roles include Data Consumer, Data Steward, etc. ● Attribute based access control ● Granular level of access control on resources Data Governance