SlideShare a Scribd company logo
HBase System Tables / Metadata Info
Wellington Chevreuil
Overview
● General metadata files
● Table level metadata files
● Meta Table
● Namespace Table
● ACLs Table
● Snapshot metadata files
● Hands-on labs
General Metadata Files
● Simple files defining hbase unique identification and version, respectively
○ /hbase/hbase.id
■ Created once the cluster is being set up, or when Master is restarted and
no id file is found
○ /hbase/hbase.version
■ Created only upon cluster install or upgrade
● Written in protobuff format
○ For implementation details: org.apache.hadoop.hbase.ClusterId,
org.apache.hadoop.hbase.util.FSUtils
● Most issues will be related with corruption
● Duplicate hbase.id may prevent Replication from start (often occurs if the whole
"/hbase" is copied between clusters when bootstrapping backup cluster)
Table Level Metadata Files
● /hbase/data/default/test-1/.tabledesc/.tableinfo.0000000001
○ This is where several table structure information such as CFs, coprocessors,
encoding, etc, are stored.
○ This is same info shown by hbase shell describe command.
● /hbase/data/default/test-1/40f1ab8c352a23f54b343de5e3125f4e/.regioninfo
○ Keeps info such as region name, start and end keys
● /hbase/data/default/test-1/40f1ab8c352a23f54b343de5e3125f4e/recovered.edits/1
27.seqid
○ Empty file, named with the region initialisation sequence id
● Most issues related to HDFS corruption
○ HBCK provides -fixHdfsOrphans and -fixTableOrphans to regenerate those files.
More info on the HBCK session
HBase Meta
● Records regions assignments;
● First table to be onlined upon initialisation (HBase fails if it can't bring it online);
● Is queried by clients when these try to access specific rowkeys for the 1st time;
○ Once the specific region assignment is known, client keeps a cache
● Has same hdfs structure from user tables:
It only has a single region,
with a different naming
convention, then user
tables.
HBase Meta
● It's normally updated by Master's AssignmentManager class (more details on HBase
RIT session)
● Reflects current regions assignments state:
● How many regions are online for this test-1 table?
Namespace Table
● Records available namespaces:
● Must be online, so that hbase is operational.
● Has same hdfs structure from user tables:
Namespace Table
● Most common issue involving namespace table is HBASE-14190:
○ Currently, namespace table does not have assignment priority over user tables
○ On cluster with too many regions, namespace table can get relegated for more
than 5 minutes to get online
○ Causes Master startup to crash, due to timeout determined by
hbase.master.namespace.init.timeout (default 5 minutes)
● Other problems would be caused by any sort of table hdfs file corruption (much rare)
ACLs Table
● Table where HBase internal ACLs are stored.
● Only created once hbase.security.authorization is set.
● Queried by hbase shell user_permission command, or scanning it directly:
● ACLs are also stored on ZK under /hbase/acl znode. When updating ACLs, HBase will
also try to edit this znode.
Snapshot metadata files
● Created under /hbase/.hbase-snapshot/SNAPSHOT_NAME hdfs dir. Example:
○ /hbase/.hbase-snapshot/snap-1/.snapshotinfo
○ /hbase/.hbase-snapshot/snap-1/data.manifest
● Store snapshots details such as table schema, hfiles containing data for the time
snapshot has been taken and general snapshot information.
● These files are also serialised within protobuff. Contents can be viewed with
snapshotinfo command:
HBase Meta - Possible issues
● HDFS files corruption
○ Would require meta reconstruction, HBCK/HBCK2 would be useful
● Meta not consistent with hdfs table/regions structure
○ Usually due uncompleted or broken region splits;
● Region not assigned to any RS
○ Caused by some failed previous region assignment

More Related Content

PDF
Administration and Management with UltraESB
PDF
UltraESB - Installation and Configuration
PPTX
PXF HAWQ Unmanaged Data
PPTX
Meet hbase 2.0
PPTX
HBase Incremental Backup
PPTX
PXF BDAM 2016
PDF
Analyze corefile and backtraces with GDB for Mysql/MariaDB on Linux - Nilanda...
PPTX
Hawq Hcatalog Integration
Administration and Management with UltraESB
UltraESB - Installation and Configuration
PXF HAWQ Unmanaged Data
Meet hbase 2.0
HBase Incremental Backup
PXF BDAM 2016
Analyze corefile and backtraces with GDB for Mysql/MariaDB on Linux - Nilanda...
Hawq Hcatalog Integration

What's hot (16)

PDF
MySQL database replication
PDF
Highly efficient backups with percona xtrabackup
PDF
What's New in PostgreSQL 9.6
 
PPT
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
PPT
8a. How To Setup HBase with Docker
PPT
8b. Column Oriented Databases Lab
PDF
HBase Blockcache 101
PPT
Zing Database
PPTX
PostgreSQL Hangout Parameter Tuning
DOC
My sql technical reference manual
PDF
PostgreSQL Replication High Availability Methods
PPTX
Monitoring multiple ultra esb instances with u console
PDF
Dumitru Enache - Bacula
PDF
Apache HDFS - Lab Assignment
DOCX
Inno db datafiles backup and retore
PDF
Introduction to Drupal - Installation, Anatomy, Terminologies
MySQL database replication
Highly efficient backups with percona xtrabackup
What's New in PostgreSQL 9.6
 
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
8a. How To Setup HBase with Docker
8b. Column Oriented Databases Lab
HBase Blockcache 101
Zing Database
PostgreSQL Hangout Parameter Tuning
My sql technical reference manual
PostgreSQL Replication High Availability Methods
Monitoring multiple ultra esb instances with u console
Dumitru Enache - Bacula
Apache HDFS - Lab Assignment
Inno db datafiles backup and retore
Introduction to Drupal - Installation, Anatomy, Terminologies
Ad

Similar to HBase System Tables / Metadata Info (20)

PPTX
HBaseCon 2013: Apache HBase Table Snapshots
PPTX
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
PPTX
Hbase.pptx
PPT
Hbase introduction
PDF
004 architecture andadvanceduse
PDF
Apachecon Europe 2012: Operating HBase - Things you need to know
ODP
HBase introduction talk
PDF
Apache HBase: Where We've Been and What's Upcoming
DOCX
Hbase Quick Review Guide for Interviews
PPTX
Hbasepreso 111116185419-phpapp02
PPTX
Introduction to Apache HBase
PDF
Apache HBase 1.0 Release
PDF
PPT
Chicago Data Summit: Apache HBase: An Introduction
ODP
Apache hadoop hbase
PDF
HBaseConAsia2018 Track3-4: HBase and OpenTSDB practice at Huawei
PPTX
PPTX
Apache HBase Internals you hoped you Never Needed to Understand
PDF
Facebook keynote-nicolas-qcon
PDF
Facebook Messages & HBase
HBaseCon 2013: Apache HBase Table Snapshots
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
Hbase.pptx
Hbase introduction
004 architecture andadvanceduse
Apachecon Europe 2012: Operating HBase - Things you need to know
HBase introduction talk
Apache HBase: Where We've Been and What's Upcoming
Hbase Quick Review Guide for Interviews
Hbasepreso 111116185419-phpapp02
Introduction to Apache HBase
Apache HBase 1.0 Release
Chicago Data Summit: Apache HBase: An Introduction
Apache hadoop hbase
HBaseConAsia2018 Track3-4: HBase and OpenTSDB practice at Huawei
Apache HBase Internals you hoped you Never Needed to Understand
Facebook keynote-nicolas-qcon
Facebook Messages & HBase
Ad

More from wchevreuil (10)

PDF
Cloudera Enabling Native Integration of NoSQL HBase with Cloud Providers.pdf
PDF
HDFS client write/read implementation details
PDF
HBase RITs
PDF
HBase tales from the trenches
PPTX
Hbasecon2019 hbck2 (1)
PDF
Web hdfs and httpfs
PDF
HBase replication
PPT
Hadoop tuning
PPT
I nd t_bigdata(1)
PDF
Hadoop - TDC 2012
Cloudera Enabling Native Integration of NoSQL HBase with Cloud Providers.pdf
HDFS client write/read implementation details
HBase RITs
HBase tales from the trenches
Hbasecon2019 hbck2 (1)
Web hdfs and httpfs
HBase replication
Hadoop tuning
I nd t_bigdata(1)
Hadoop - TDC 2012

Recently uploaded (20)

PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PPTX
L1 - Introduction to python Backend.pptx
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
CHAPTER 2 - PM Management and IT Context
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Understanding Forklifts - TECH EHS Solution
PDF
Digital Strategies for Manufacturing Companies
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Nekopoi APK 2025 free lastest update
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PPTX
Introduction to Artificial Intelligence
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
PTS Company Brochure 2025 (1).pdf.......
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
Navsoft: AI-Powered Business Solutions & Custom Software Development
L1 - Introduction to python Backend.pptx
Reimagine Home Health with the Power of Agentic AI​
Adobe Illustrator 28.6 Crack My Vision of Vector Design
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Design an Analysis of Algorithms I-SECS-1021-03
CHAPTER 2 - PM Management and IT Context
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Understanding Forklifts - TECH EHS Solution
Digital Strategies for Manufacturing Companies
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Odoo Companies in India – Driving Business Transformation.pdf
Nekopoi APK 2025 free lastest update
Which alternative to Crystal Reports is best for small or large businesses.pdf
How to Choose the Right IT Partner for Your Business in Malaysia
Introduction to Artificial Intelligence
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PTS Company Brochure 2025 (1).pdf.......

HBase System Tables / Metadata Info

  • 1. HBase System Tables / Metadata Info Wellington Chevreuil
  • 2. Overview ● General metadata files ● Table level metadata files ● Meta Table ● Namespace Table ● ACLs Table ● Snapshot metadata files ● Hands-on labs
  • 3. General Metadata Files ● Simple files defining hbase unique identification and version, respectively ○ /hbase/hbase.id ■ Created once the cluster is being set up, or when Master is restarted and no id file is found ○ /hbase/hbase.version ■ Created only upon cluster install or upgrade ● Written in protobuff format ○ For implementation details: org.apache.hadoop.hbase.ClusterId, org.apache.hadoop.hbase.util.FSUtils ● Most issues will be related with corruption ● Duplicate hbase.id may prevent Replication from start (often occurs if the whole "/hbase" is copied between clusters when bootstrapping backup cluster)
  • 4. Table Level Metadata Files ● /hbase/data/default/test-1/.tabledesc/.tableinfo.0000000001 ○ This is where several table structure information such as CFs, coprocessors, encoding, etc, are stored. ○ This is same info shown by hbase shell describe command. ● /hbase/data/default/test-1/40f1ab8c352a23f54b343de5e3125f4e/.regioninfo ○ Keeps info such as region name, start and end keys ● /hbase/data/default/test-1/40f1ab8c352a23f54b343de5e3125f4e/recovered.edits/1 27.seqid ○ Empty file, named with the region initialisation sequence id ● Most issues related to HDFS corruption ○ HBCK provides -fixHdfsOrphans and -fixTableOrphans to regenerate those files. More info on the HBCK session
  • 5. HBase Meta ● Records regions assignments; ● First table to be onlined upon initialisation (HBase fails if it can't bring it online); ● Is queried by clients when these try to access specific rowkeys for the 1st time; ○ Once the specific region assignment is known, client keeps a cache ● Has same hdfs structure from user tables: It only has a single region, with a different naming convention, then user tables.
  • 6. HBase Meta ● It's normally updated by Master's AssignmentManager class (more details on HBase RIT session) ● Reflects current regions assignments state: ● How many regions are online for this test-1 table?
  • 7. Namespace Table ● Records available namespaces: ● Must be online, so that hbase is operational. ● Has same hdfs structure from user tables:
  • 8. Namespace Table ● Most common issue involving namespace table is HBASE-14190: ○ Currently, namespace table does not have assignment priority over user tables ○ On cluster with too many regions, namespace table can get relegated for more than 5 minutes to get online ○ Causes Master startup to crash, due to timeout determined by hbase.master.namespace.init.timeout (default 5 minutes) ● Other problems would be caused by any sort of table hdfs file corruption (much rare)
  • 9. ACLs Table ● Table where HBase internal ACLs are stored. ● Only created once hbase.security.authorization is set. ● Queried by hbase shell user_permission command, or scanning it directly: ● ACLs are also stored on ZK under /hbase/acl znode. When updating ACLs, HBase will also try to edit this znode.
  • 10. Snapshot metadata files ● Created under /hbase/.hbase-snapshot/SNAPSHOT_NAME hdfs dir. Example: ○ /hbase/.hbase-snapshot/snap-1/.snapshotinfo ○ /hbase/.hbase-snapshot/snap-1/data.manifest ● Store snapshots details such as table schema, hfiles containing data for the time snapshot has been taken and general snapshot information. ● These files are also serialised within protobuff. Contents can be viewed with snapshotinfo command:
  • 11. HBase Meta - Possible issues ● HDFS files corruption ○ Would require meta reconstruction, HBCK/HBCK2 would be useful ● Meta not consistent with hdfs table/regions structure ○ Usually due uncompleted or broken region splits; ● Region not assigned to any RS ○ Caused by some failed previous region assignment