SlideShare a Scribd company logo
7
Most read
14
Most read
18
Most read
1
September 2024 / Data Management
Erica Bertan
How Open Metadata
helps us in the Data
Governance at Loggi
8+ years of experience in data and software engineering teams
Software Engineering: microservices, unit tests, spark, docker
Data: analytics, visualization tools, data modelling, data quality
Strategy, communication and leadership
Erica Bertan
Analytics Engineering Manager
about Loggi
Brazilian logistic company
About Loggi
● 10+ years of experience delivering packages
● 300,000 of packages / day
● Hubs and distribution centers in the 27 federal states of the
country
● US$ 1 billion of investments in 7 years (SoftBank, Microsoft, GGV
Capital, Monashees, Kaszkek and others)
● 2019: a brazilian unicorn
Challenges
● Continental country, several modus operandi to delivery
package
● Complex logistic chain
● More than 2 thousand of workers spread across the country
● Every federal state with particularities
● how we are doing Data Governance at Loggi at the
moment
● what works for us
● how Open Metadata fits in the Data Governance
marathon
Goals
● deep dive about tools
● technical aspects of our infrastructure
● the pros and cons of the stack
Not goals
the problems
Problem 1: Communication and
definition of responsibilities
"Who can I ask about the business context of this model?"
Problem 2: Data organization
"Where can I find the correct data in order to produce insights?"
"How can I even start?"
Problem 3: Data reporting
inconsistencies
"Which metric is the correct one?"
● 18,000 of dashboards and looks
● 50 looker models
Problem 4: Complex
structures
"Which table should I use?"
● package_events versus package_register
data: our big numbers
~100
ETL Jobs
The midnight job processes
almost 500 hundred tables in 8
hours
data: our big numbers
776
Looker users
42TB DL
100TB DW
~1,8k
Looker Dashboards
200 GB
new data
daily volume
9 million
new records of
package's tracking/day
2,5 hours/day
Average daily usage
9,4k
tables
Storage
how Open Metadata fits?
how Open Metadata fits?
Definition of Ownership
"Who can I ask about the business context of this model?"
how Open Metadata fits?
Data Lineage
"What's the impact of this model new release?"
how Open Metadata fits?
Deletion of dated dashboards
"This dashboard is not used anymore" - from 18 thousand to 1,5 thousand
The process occurred in the following sequence:
1. We listed all the dashboards.
2. We informed the company.
3. People marked what they used.
4. We deleted everything that wasn't marked.
how Open Metadata fits?
Catalog
"Whats the meaning of this table/column table?"
how Open Metadata fits?
Data Quality
"Can we add robustness to these models?"
how Open Metadata fits?
Alerts
Proactiveness and observability building trust
timeline
Jun-Dec
2023
Cleaning the house
✅Ownerships
✅model refactoring: midnight
job 30% faster
✅deletion of unused/dated
dashboards: from 18 -> 1.5
thousand
Jan-Jun
2024
Gold metrics and data
quality
✅building trust: developing
17+ test cases of data quality
on top of important models
✅catalog: documenting 250+
of our data sources
Jun-Dec
2024
Governance
✅ Deletion of unused/dated
tables: recovery of U$D
2000/month
🔄 Organization of models
and permissions on Looker
🔄 Organization: ownerships,
metadata, data quality
Jan-Jun
2025
🎯work in progress
Thank you!
Obrigada!
erica.bertan@loggi.com
loggi.com
21

More Related Content

PDF
Data Engineer's Lunch #85: Designing a Modern Data Stack
PDF
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
PPTX
bigdata 2.pptx
PPTX
bigdata.pptx
PPTX
final oracle presentation
PPT
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
PDF
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
PPTX
Introduction to Big Data
Data Engineer's Lunch #85: Designing a Modern Data Stack
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
bigdata 2.pptx
bigdata.pptx
final oracle presentation
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Introduction to Big Data

Similar to OpenMetadata Spotlight - OpenMetadata @ Loggi by Erica Bertan (20)

DOC
Resume_Sita_Ramadas_akkineni
PDF
bigdata.pdf
PPTX
Big data by Mithlesh sadh
PDF
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
PDF
Big data issues and challenges
PPT
D.3.1: State of the Art - Linked Data and Digital Preservation
PDF
Hadoop and SAP BI
PDF
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
PPT
Counting Unique Users in Real-Time: Here's a Challenge for You!
PDF
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
PDF
The Role of Logical Data Fabric in a Unified Platform for Modern Analytics (A...
PPTX
[Rakuten TechConf2014] [A-4] Rakuten Ichiba
DOC
Shuchi_Agrawal
PDF
Report for internship
DOC
Amith_Mansingh_Ramanund's_Resume
DOCX
Resume (1)
DOCX
Resume (1)
DOCX
VamsiKrishna Maddiboina
PPTX
Northern New England Tableau User Group - September 2024 Meeting
Resume_Sita_Ramadas_akkineni
bigdata.pdf
Big data by Mithlesh sadh
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Big data issues and challenges
D.3.1: State of the Art - Linked Data and Digital Preservation
Hadoop and SAP BI
Data Science Salon: Quit Wasting Time – Case Studies in Production Machine Le...
Counting Unique Users in Real-Time: Here's a Challenge for You!
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
The Role of Logical Data Fabric in a Unified Platform for Modern Analytics (A...
[Rakuten TechConf2014] [A-4] Rakuten Ichiba
Shuchi_Agrawal
Report for internship
Amith_Mansingh_Ramanund's_Resume
Resume (1)
Resume (1)
VamsiKrishna Maddiboina
Northern New England Tableau User Group - September 2024 Meeting
Ad

More from OpenMetadata (20)

PPTX
OpenMetadata Spotlight - OpenMetadata @ Talentys Data.pptx
PPTX
2025_07_23 - OpenMetadata Community Meeting.pptx
PDF
2025_06_18 - OpenMetadata Community Meeting.pdf
PDF
OpenMetadata Spotlight - OpenMetadata @ EDNON
PPTX
OpenMetadata Community Meeting - 21st May 2025
PDF
OpenMetadata Community Meeting - 16th April 2025
PDF
OpenMetadata Spotlight - OpenMetadata @ Gorgias
PDF
OpenMetadata Community Meeting - 19th March 2025
PDF
OpenMetadata Community Meeting - 19th February 2025
PDF
OpenMetadata Spotlight - OpenMetadata @ Carrefour Brazil
PDF
OpenMetadata Community Meeting - 15th January 2025
PDF
OpenMetadata Community Meeting - 18th December 2024
PDF
OpenMetadata Community Meeting - 20th November 2024
PDF
OpenMetadata Community Meeting - 16th October 2024
PDF
OpenMetadata Community Meeting - 18th September 2024
PDF
OpenMetadata Community Meeting - 7th August 2024
PDF
OpenMetadata Spotlight - OpenMetadata @ Thndr by Fizza Abid
PDF
OpenMetadata Spotlight - OpenMetadata @ Aspire by Vinol Joy Dsouza
PDF
OpenMetadata Community Meeting - 5th June 2024
PDF
OpenMetadata Community Meeting - 8th May 2024
OpenMetadata Spotlight - OpenMetadata @ Talentys Data.pptx
2025_07_23 - OpenMetadata Community Meeting.pptx
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata Spotlight - OpenMetadata @ EDNON
OpenMetadata Community Meeting - 21st May 2025
OpenMetadata Community Meeting - 16th April 2025
OpenMetadata Spotlight - OpenMetadata @ Gorgias
OpenMetadata Community Meeting - 19th March 2025
OpenMetadata Community Meeting - 19th February 2025
OpenMetadata Spotlight - OpenMetadata @ Carrefour Brazil
OpenMetadata Community Meeting - 15th January 2025
OpenMetadata Community Meeting - 18th December 2024
OpenMetadata Community Meeting - 20th November 2024
OpenMetadata Community Meeting - 16th October 2024
OpenMetadata Community Meeting - 18th September 2024
OpenMetadata Community Meeting - 7th August 2024
OpenMetadata Spotlight - OpenMetadata @ Thndr by Fizza Abid
OpenMetadata Spotlight - OpenMetadata @ Aspire by Vinol Joy Dsouza
OpenMetadata Community Meeting - 5th June 2024
OpenMetadata Community Meeting - 8th May 2024
Ad

Recently uploaded (20)

PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Computer network topology notes for revision
PPTX
Introduction to machine learning and Linear Models
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Introduction to the R Programming Language
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
annual-report-2024-2025 original latest.
Introduction to Knowledge Engineering Part 1
Qualitative Qantitative and Mixed Methods.pptx
Supervised vs unsupervised machine learning algorithms
STERILIZATION AND DISINFECTION-1.ppthhhbx
Fluorescence-microscope_Botany_detailed content
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Computer network topology notes for revision
Introduction to machine learning and Linear Models
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
[EN] Industrial Machine Downtime Prediction
Business Ppt On Nestle.pptx huunnnhhgfvu
Clinical guidelines as a resource for EBP(1).pdf
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Introduction to the R Programming Language
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
SAP 2 completion done . PRESENTATION.pptx
climate analysis of Dhaka ,Banglades.pptx
annual-report-2024-2025 original latest.

OpenMetadata Spotlight - OpenMetadata @ Loggi by Erica Bertan

  • 1. 1 September 2024 / Data Management Erica Bertan How Open Metadata helps us in the Data Governance at Loggi
  • 2. 8+ years of experience in data and software engineering teams Software Engineering: microservices, unit tests, spark, docker Data: analytics, visualization tools, data modelling, data quality Strategy, communication and leadership Erica Bertan Analytics Engineering Manager
  • 4. Brazilian logistic company About Loggi ● 10+ years of experience delivering packages ● 300,000 of packages / day ● Hubs and distribution centers in the 27 federal states of the country ● US$ 1 billion of investments in 7 years (SoftBank, Microsoft, GGV Capital, Monashees, Kaszkek and others) ● 2019: a brazilian unicorn Challenges ● Continental country, several modus operandi to delivery package ● Complex logistic chain ● More than 2 thousand of workers spread across the country ● Every federal state with particularities
  • 5. ● how we are doing Data Governance at Loggi at the moment ● what works for us ● how Open Metadata fits in the Data Governance marathon Goals ● deep dive about tools ● technical aspects of our infrastructure ● the pros and cons of the stack Not goals
  • 7. Problem 1: Communication and definition of responsibilities "Who can I ask about the business context of this model?"
  • 8. Problem 2: Data organization "Where can I find the correct data in order to produce insights?" "How can I even start?"
  • 9. Problem 3: Data reporting inconsistencies "Which metric is the correct one?" ● 18,000 of dashboards and looks ● 50 looker models
  • 10. Problem 4: Complex structures "Which table should I use?" ● package_events versus package_register
  • 11. data: our big numbers
  • 12. ~100 ETL Jobs The midnight job processes almost 500 hundred tables in 8 hours data: our big numbers 776 Looker users 42TB DL 100TB DW ~1,8k Looker Dashboards 200 GB new data daily volume 9 million new records of package's tracking/day 2,5 hours/day Average daily usage 9,4k tables Storage
  • 14. how Open Metadata fits? Definition of Ownership "Who can I ask about the business context of this model?"
  • 15. how Open Metadata fits? Data Lineage "What's the impact of this model new release?"
  • 16. how Open Metadata fits? Deletion of dated dashboards "This dashboard is not used anymore" - from 18 thousand to 1,5 thousand The process occurred in the following sequence: 1. We listed all the dashboards. 2. We informed the company. 3. People marked what they used. 4. We deleted everything that wasn't marked.
  • 17. how Open Metadata fits? Catalog "Whats the meaning of this table/column table?"
  • 18. how Open Metadata fits? Data Quality "Can we add robustness to these models?"
  • 19. how Open Metadata fits? Alerts Proactiveness and observability building trust
  • 20. timeline Jun-Dec 2023 Cleaning the house ✅Ownerships ✅model refactoring: midnight job 30% faster ✅deletion of unused/dated dashboards: from 18 -> 1.5 thousand Jan-Jun 2024 Gold metrics and data quality ✅building trust: developing 17+ test cases of data quality on top of important models ✅catalog: documenting 250+ of our data sources Jun-Dec 2024 Governance ✅ Deletion of unused/dated tables: recovery of U$D 2000/month 🔄 Organization of models and permissions on Looker 🔄 Organization: ownerships, metadata, data quality Jan-Jun 2025 🎯work in progress