SlideShare a Scribd company logo
DATA ORCHESTRATION SUMMIT
2020
How to build a new Under Filesystem in Alluxio
Apache Ozone as an example
Baolong Mao | Sr. System Engineer
Alluxio PMC
Apache Ozone Committer
DATA ORCHESTRATION SUMMIT
Contribution From Tencent
● Ozone & COSN UFS
● Support using IP as connect host
● Make PermissionChecker Configurable
● Prometheus + Grafana monitor dashboard template & docs
● Mount table webUI
● Generic tarball script to generate customized hadoop
● More metrics: Lock pool size and block remover metrics
● Add ETAG header to s3 api proxy
● ……………………….
DATA ORCHESTRATION SUMMIT
Agenda
● Alluxio global namespace
● Apache Ozone
● Alluxio UFS framework
●
DATA ORCHESTRATION 
SUMMIT
2020
● Alluxio global namespace
Introduction
How the unified namespace works in Alluxio
with different under file systems
DATA ORCHESTRATION SUMMIT
Alluxio Overview
DATA ORCHESTRATION SUMMIT
Alluxio global namespace
DATA ORCHESTRATION 
SUMMIT
2020
Apache Ozone Introduction
Introduce Apache Ozone shortly
DATA ORCHESTRATION SUMMIT
Apache Ozone
DATA ORCHESTRATION SUMMIT
Apache Ozone
DATA ORCHESTRATION 
SUMMIT
2020
Alluxio UFS framework
Introduce about Alluxio UFS
DATA ORCHESTRATION SUMMIT
•./bin/alluxio fs mount 
--option
alluxio.underfs.hdfs.configuration=<DIR>/ozone-site.xml:<DIR>/
core-site.xml 
/ozone o3fs://<OZONE_BUCKET>.<OZONE_VOLUME>/
How Alluxio UFS works
DATA ORCHESTRATION SUMMIT
Service Discovery
● Dynamically loaded
● Java ServiceLoader . Implements the
alluxio.underfs.UnderFileSystemFactory interface.
● Pointing to the class implementation by
META_INF/services/alluxio.underfs.UnderFileSystemFactory
How Alluxio UFS works
DATA ORCHESTRATION SUMMIT
Dependency Management
● Fat jar
● Shaded jar
● Isolated classloading
How Alluxio UFS works
DATA ORCHESTRATION SUMMIT
How Alluxio UFS works
DATA ORCHESTRATION SUMMIT
Implementing an Under Storage Extension
•Implementing the required under storage interface
DATA ORCHESTRATION SUMMIT
Implementing an Under Storage Extension
•Declaring the
service implementation
DATA ORCHESTRATION SUMMIT
Implementing an Under Storage Extension
•Add maven plugin to pom.xml for build a package
<build>
<plugins>
<plugin>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-shade-plugin</artifactId>
</plugin>
<plugin>
<groupId>com.coderplus.maven.plugins</groupId>
<artifactId>copy-rename-maven-plugin</artifactId>
</plugin>
</plugins>
</build>
DATA ORCHESTRATION 
SUMMIT
2020
Introduce how to implement Ozone UFS
DATA ORCHESTRATION SUMMIT
alluxio fs cat /ozone/B
DATA ORCHESTRATION SUMMIThttps://github.com/Alluxio/alluxio/pull/11396
DATA ORCHESTRATION SUMMIT
Hadoop compatible File system(HCFS)
DATA ORCHESTRATION SUMMIT
DATA ORCHESTRATION SUMMIT
DATA ORCHESTRATION SUMMIT
•https://docs.alluxio.io/os/user/edge/en/Overview.html
•https://docs.alluxio.io/os/user/edge/en/overview/Archite
cture.html
•https://docs.alluxio.io/os/user/edge/en/core-services/Un
ified-Namespace.html
•https://docs.alluxio.io/os/user/edge/en/ufs/Ozone.html
•https://docs.alluxio.io/os/user/edge/en/ufs/Ufs-Extensio
ns.html
•https://docs.alluxio.io/os/user/edge/en/ufs/Ufs-Extensio
n-API.html
DATA ORCHESTRATION SUMMIT
How to Join Alluxio Community
Thank you!
alluxio.io/slack

More Related Content

PPTX
Make stateful apps in Kubernetes a no brainer with Pure Storage and GitOps
PDF
2013 PyCon SG - Building your cloud infrastructure with Python
PDF
WKSctl: Gitops Management of Kubernetes Clusters
PPTX
Java App On Digital Ocean: Deploying With Gitlab CI/CD
PDF
Packaging Strategy for Community Openstack and Implementation Reference | Hoj...
PDF
Become An OpenStack TripleO ATC - Easy As ABC
PDF
Build A Private Developer's Cloud Using TripleO
ODP
LinuxKit Swarm Nodes
Make stateful apps in Kubernetes a no brainer with Pure Storage and GitOps
2013 PyCon SG - Building your cloud infrastructure with Python
WKSctl: Gitops Management of Kubernetes Clusters
Java App On Digital Ocean: Deploying With Gitlab CI/CD
Packaging Strategy for Community Openstack and Implementation Reference | Hoj...
Become An OpenStack TripleO ATC - Easy As ABC
Build A Private Developer's Cloud Using TripleO
LinuxKit Swarm Nodes

What's hot (20)

PDF
Contributing to an os project
PDF
CoreOS @Codetalks Hamburg
PDF
Puppet, now with google!
PDF
TryStack: A Sandbox for OpenStack Users and Admins
PDF
Openstack trystack
PDF
Dev opsmeetup sept2013-leaseweb
PDF
Guillotina: The Asyncio REST Resource API
PDF
[OpenInfra Days Korea 2018] Day 2 - E3-2: "핸즈온 워크샵: Kubespray, Helm, Armada를 ...
PDF
Using Puppet for Deploying Hyper-V OpenStack Compute Nodes - PuppetConf 2013
PPT
CloudStack / Saltstack lightning talk at DevOps Amsterdam
PDF
Techfest 2013 No RESTKit for the Weary
PPTX
How to deploy docker container inside ikoula's cloud
PDF
K8s in 2hours
PDF
PPTX
Creating Elasticsearch Snapshots
PDF
2015 DockerCon Using Docker in production at bity.com
PPTX
PDF
Declare your infrastructure: InfraKit, LinuxKit and Moby
PDF
Fun with containers: Use Ansible to build Docker images
PDF
CCCEU15 run cloudstack in docker
Contributing to an os project
CoreOS @Codetalks Hamburg
Puppet, now with google!
TryStack: A Sandbox for OpenStack Users and Admins
Openstack trystack
Dev opsmeetup sept2013-leaseweb
Guillotina: The Asyncio REST Resource API
[OpenInfra Days Korea 2018] Day 2 - E3-2: "핸즈온 워크샵: Kubespray, Helm, Armada를 ...
Using Puppet for Deploying Hyper-V OpenStack Compute Nodes - PuppetConf 2013
CloudStack / Saltstack lightning talk at DevOps Amsterdam
Techfest 2013 No RESTKit for the Weary
How to deploy docker container inside ikoula's cloud
K8s in 2hours
Creating Elasticsearch Snapshots
2015 DockerCon Using Docker in production at bity.com
Declare your infrastructure: InfraKit, LinuxKit and Moby
Fun with containers: Use Ansible to build Docker images
CCCEU15 run cloudstack in docker
Ad

More from Alluxio, Inc. (20)

PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PDF
Introduction to Apache Iceberg™ & Tableflow
PDF
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
PDF
Meet in the Middle: Solving the Low-Latency Challenge for Agentic AI
PDF
From Data Preparation to Inference: How Alluxio Speeds Up AI
PDF
Best Practice for LLM Serving in the Cloud
PDF
Meet You in the Middle: 1000x Performance for Parquet Queries on PB-Scale Dat...
PDF
How Coupang Leverages Distributed Cache to Accelerate ML Model Training
PDF
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
PDF
AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendat...
PDF
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
PDF
AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, ...
PDF
AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale
PDF
Alluxio Webinar | What’s New in Alluxio AI: 3X Faster Checkpoint File Creatio...
PDF
AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack
PDF
AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU...
PDF
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment wit...
PDF
Alluxio Webinar | Accelerate AI: Alluxio 101
PDF
AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI
PDF
AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training wi...
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
Introduction to Apache Iceberg™ & Tableflow
Optimizing Tiered Storage for Low-Latency Real-Time Analytics at AI Scale
Meet in the Middle: Solving the Low-Latency Challenge for Agentic AI
From Data Preparation to Inference: How Alluxio Speeds Up AI
Best Practice for LLM Serving in the Cloud
Meet You in the Middle: 1000x Performance for Parquet Queries on PB-Scale Dat...
How Coupang Leverages Distributed Cache to Accelerate ML Model Training
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
AI/ML Infra Meetup | Building Production Platform for Large-Scale Recommendat...
AI/ML Infra Meetup | How Uber Optimizes LLM Training and Finetune
AI/ML Infra Meetup | Optimizing ML Data Access with Alluxio: Preprocessing, ...
AI/ML Infra Meetup | Deployment, Discovery and Serving of LLMs at Uber Scale
Alluxio Webinar | What’s New in Alluxio AI: 3X Faster Checkpoint File Creatio...
AI/ML Infra Meetup | A Faster and More Cost Efficient LLM Inference Stack
AI/ML Infra Meetup | Balancing Cost, Performance, and Scale - Running GPU/CPU...
AI/ML Infra Meetup | RAYvolution - The Last Mile: Mastering AI Deployment wit...
Alluxio Webinar | Accelerate AI: Alluxio 101
AI/ML Infra Meetup | The power of Ray in the era of LLM and multi-modality AI
AI/ML Infra Meetup | Exploring Distributed Caching for Faster GPU Training wi...
Ad

Recently uploaded (20)

PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
System and Network Administration Chapter 2
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
L1 - Introduction to python Backend.pptx
PPTX
Essential Infomation Tech presentation.pptx
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
medical staffing services at VALiNTRY
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
How Creative Agencies Leverage Project Management Software.pdf
PTS Company Brochure 2025 (1).pdf.......
Operating system designcfffgfgggggggvggggggggg
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
How to Migrate SBCGlobal Email to Yahoo Easily
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
System and Network Administration Chapter 2
wealthsignaloriginal-com-DS-text-... (1).pdf
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
VVF-Customer-Presentation2025-Ver1.9.pptx
L1 - Introduction to python Backend.pptx
Essential Infomation Tech presentation.pptx
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
medical staffing services at VALiNTRY
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Design an Analysis of Algorithms II-SECS-1021-03

How to Build a new under filesystem in Alluxio: Apache Ozone as an example