SlideShare a Scribd company logo
Easy Object Storage Import/Export
Using the S3 Storage Connector
Lee Liming (LLiming@uchicago.edu)
University of Chicago
funded by the National Science Foundation
Award #ACI-1445604
funded by the National Science Foundation
Award #ACI-1445604
http://jetstream-cloud.org/
Cloud, specialized for the research community
Researchers browse VM images
saved by other researchers
– Web interface provided by
CyVerse
– Share fully-installed systems with
applications ready to go
– OpenStack API/CLI access for
advanced users
Images can be launched on VMs
scaled for research
– 1-44 vCPUs
– 2-120 GB RAM
– 8-480GB disk
Use is allocated by NSF/XSEDE
funded by the National Science Foundation
Award #ACI-1445604
http://jetstream-cloud.org/
Jetstream’s object storage (Swift/Ceph)
• 1.25 PB usable storage
(If all of JS was object store, no VMs,
volumes, etc. Of course, it’s not…)
• Access via Swift APIs
(and any tools that use them)
– Horizon web app
– OpenStack CLI
• AWS S3-compatible
• A place to learn &
evaluate before the
commercial cloud
funded by the National Science Foundation
Award #ACI-1445604
http://jetstream-cloud.org/
Research data starts & lives at home
• Campus systems or research facilities/labs
– Data often comes from
facilities/labs/observatories
– Results are often needed elsewhere
– Publishing requirements are becoming important
• Very often, available via Globus
• So…how do we get it from there into Jetstream’s
object store for use via Ceph, Swift, S3, etc?
– Must be: easy, fast, reliable
• (Jetstream to “next step location” is also
valuable…)
funded by the National Science Foundation
Award #ACI-1445604
http://jetstream-cloud.org/
Fire-and-forget transfers into Jetstream buckets
funded by the National Science Foundation
Award #ACI-1445604
http://jetstream-cloud.org/
So how does this work?
• Globus Connect Server (the
server version of Globus)
offers storage connectors to
subscribers, including an
AWS S3 connector.
• The AWS S3 connector works
equally well with OpenStack
Swift’s S3 dialect.
Setting it up
1. Create a VM w/floating IP.
2. Install GCSv5.2 w/S3 connector.
3. Create an S3 storage gateway
requiring an XSEDE ID.
4. Use Globus web app to create a
collection and supply S3-style
credentials.
5. Use the collection to transfer data
in or out.
6. Shelve VM until needed again.
Credential management
When the GCSv5 server admin
does this…
…it allows users to
do this.
funded by the National Science Foundation
Award #ACI-1445604
http://jetstream-cloud.org/
Load dataset from EBI into a Jetstream bucket

More Related Content

PPTX
Wikipedia Cloud Search Webinar
PPTX
Apache CloudStack 4.2: A First Look
PDF
Deploying Data Science with Docker and AWS
PPTX
Serverless Architecture - introduction + AWS demo
PPTX
AWS Elastic Container Service (ECS) with a CI Pipeline Overview
PDF
Intro to Serverless
PPTX
Running on Elastic Beanstalk
PPTX
Containerization in microsoft azure
Wikipedia Cloud Search Webinar
Apache CloudStack 4.2: A First Look
Deploying Data Science with Docker and AWS
Serverless Architecture - introduction + AWS demo
AWS Elastic Container Service (ECS) with a CI Pipeline Overview
Intro to Serverless
Running on Elastic Beanstalk
Containerization in microsoft azure

What's hot (20)

PPTX
104 meets cloud
PPTX
Greetings from AWS User Group Taiwan
PPTX
Developer workflow with docker
PPTX
Cooking Akka.net and Azure Service Fabric together
PDF
Virtualized Containers - How Good is it - Ananth - Siemens - CC18
PPTX
AWS Distilled
PPTX
Containerization with Microsoft Azure
PDF
Monitoring docker: from zero to Azure
PPTX
Container orchestration overview
PPTX
Kubernetes on OpenStack @eBay
PDF
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
PPTX
Deploying .net application using VSTS on ACS in kubernetes
PDF
ContainerDays NYC 2016: "Containers in Azure: Understanding the Microsoft Con...
PPTX
Microservices using .Net core
PDF
Demystifying the cloud
PPTX
Serverless Patterns
PPTX
Rubix - Serverless architecture
PPTX
What are clouds made from
PPTX
Introduction to container mangement
PPTX
Persistent, Portable Storage for Docker Containers and Microservices
104 meets cloud
Greetings from AWS User Group Taiwan
Developer workflow with docker
Cooking Akka.net and Azure Service Fabric together
Virtualized Containers - How Good is it - Ananth - Siemens - CC18
AWS Distilled
Containerization with Microsoft Azure
Monitoring docker: from zero to Azure
Container orchestration overview
Kubernetes on OpenStack @eBay
Must Know Azure Kubernetes Best Practices And Features For Better Resiliency ...
Deploying .net application using VSTS on ACS in kubernetes
ContainerDays NYC 2016: "Containers in Azure: Understanding the Microsoft Con...
Microservices using .Net core
Demystifying the cloud
Serverless Patterns
Rubix - Serverless architecture
What are clouds made from
Introduction to container mangement
Persistent, Portable Storage for Docker Containers and Microservices
Ad

Similar to Easy Object Storage Import/Export Using the S3 Connector on Jetstream (20)

PPTX
Jetstream: Accessible cloud computing for the national science and engineerin...
PPTX
On-Demand Cloud Computing for Life Sciences Research and Education
PPTX
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
PDF
Easy access to open stack object storage
PDF
Jetstream - Adding Cloud-based Computing to the National Cyberinfrastructure
PDF
Initial presentation of swift (for montreal user group)
PPTX
Integrating with Aws s3
PPTX
ABCs of AWS: S3
PDF
Everything you wanted to know about RadosGW - Orit Wasserman, Matt Benjamin
PPT
Build A Cloud Day London - Introduction
PPT
Cloud Computing Amazon
PDF
Not Your Father’s Web App: The Cloud-Native Architecture of images.nasa.gov
PPTX
Storage as a service OpenStack
DOCX
Research Paper
PPTX
OpenStack for AWS Architects - Similarities, differences and bridging the gap
ODP
Jetstream
PDF
PowerPoint Presentation.pdf
PDF
Amazon S3 Overview
PDF
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
PDF
[AWS DC Meetup] Not Your Father’s WebApp: The Cloud-Native Architecture of im...
Jetstream: Accessible cloud computing for the national science and engineerin...
On-Demand Cloud Computing for Life Sciences Research and Education
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Easy access to open stack object storage
Jetstream - Adding Cloud-based Computing to the National Cyberinfrastructure
Initial presentation of swift (for montreal user group)
Integrating with Aws s3
ABCs of AWS: S3
Everything you wanted to know about RadosGW - Orit Wasserman, Matt Benjamin
Build A Cloud Day London - Introduction
Cloud Computing Amazon
Not Your Father’s Web App: The Cloud-Native Architecture of images.nasa.gov
Storage as a service OpenStack
Research Paper
OpenStack for AWS Architects - Similarities, differences and bridging the gap
Jetstream
PowerPoint Presentation.pdf
Amazon S3 Overview
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
[AWS DC Meetup] Not Your Father’s WebApp: The Cloud-Native Architecture of im...
Ad

More from Globus (20)

PDF
Globus Compute wth IRI Workflows - GlobusWorld 2024
PDF
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
PDF
Globus Compute Introduction - GlobusWorld 2024
PDF
Globus Connect Server Deep Dive - GlobusWorld 2024
PDF
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
PDF
First Steps with Globus Compute Multi-User Endpoints
PDF
Enhancing Research Orchestration Capabilities at ORNL.pdf
PDF
Understanding Globus Data Transfers with NetSage
PDF
How to Position Your Globus Data Portal for Success Ten Good Practices
PDF
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
PDF
Developing Distributed High-performance Computing Capabilities of an Open Sci...
PDF
The Department of Energy's Integrated Research Infrastructure (IRI)
PDF
GlobusWorld 2024 Opening Keynote session
PDF
Enhancing Performance with Globus and the Science DMZ
PDF
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
PDF
Globus at the United States Geological Survey
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
PDF
Globus Compute with Integrated Research Infrastructure (IRI) workflows
PDF
Reactive Documents and Computational Pipelines - Bridging the Gap
Globus Compute wth IRI Workflows - GlobusWorld 2024
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus Compute Introduction - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
First Steps with Globus Compute Multi-User Endpoints
Enhancing Research Orchestration Capabilities at ORNL.pdf
Understanding Globus Data Transfers with NetSage
How to Position Your Globus Data Portal for Success Ten Good Practices
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
The Department of Energy's Integrated Research Infrastructure (IRI)
GlobusWorld 2024 Opening Keynote session
Enhancing Performance with Globus and the Science DMZ
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Globus at the United States Geological Survey
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Reactive Documents and Computational Pipelines - Bridging the Gap

Recently uploaded (20)

PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
Cloud computing and distributed systems.
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
Spectroscopy.pptx food analysis technology
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Machine learning based COVID-19 study performance prediction
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Understanding_Digital_Forensics_Presentation.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Approach and Philosophy of On baking technology
Unlocking AI with Model Context Protocol (MCP)
Cloud computing and distributed systems.
Per capita expenditure prediction using model stacking based on satellite ima...
Spectroscopy.pptx food analysis technology
Diabetes mellitus diagnosis method based random forest with bat algorithm
Machine learning based COVID-19 study performance prediction
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Programs and apps: productivity, graphics, security and other tools
Dropbox Q2 2025 Financial Results & Investor Presentation
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Building Integrated photovoltaic BIPV_UPV.pdf
Electronic commerce courselecture one. Pdf
MIND Revenue Release Quarter 2 2025 Press Release
Build a system with the filesystem maintained by OSTree @ COSCUP 2025

Easy Object Storage Import/Export Using the S3 Connector on Jetstream

  • 1. Easy Object Storage Import/Export Using the S3 Storage Connector Lee Liming (LLiming@uchicago.edu) University of Chicago funded by the National Science Foundation Award #ACI-1445604
  • 2. funded by the National Science Foundation Award #ACI-1445604 http://jetstream-cloud.org/ Cloud, specialized for the research community Researchers browse VM images saved by other researchers – Web interface provided by CyVerse – Share fully-installed systems with applications ready to go – OpenStack API/CLI access for advanced users Images can be launched on VMs scaled for research – 1-44 vCPUs – 2-120 GB RAM – 8-480GB disk Use is allocated by NSF/XSEDE
  • 3. funded by the National Science Foundation Award #ACI-1445604 http://jetstream-cloud.org/ Jetstream’s object storage (Swift/Ceph) • 1.25 PB usable storage (If all of JS was object store, no VMs, volumes, etc. Of course, it’s not…) • Access via Swift APIs (and any tools that use them) – Horizon web app – OpenStack CLI • AWS S3-compatible • A place to learn & evaluate before the commercial cloud
  • 4. funded by the National Science Foundation Award #ACI-1445604 http://jetstream-cloud.org/ Research data starts & lives at home • Campus systems or research facilities/labs – Data often comes from facilities/labs/observatories – Results are often needed elsewhere – Publishing requirements are becoming important • Very often, available via Globus • So…how do we get it from there into Jetstream’s object store for use via Ceph, Swift, S3, etc? – Must be: easy, fast, reliable • (Jetstream to “next step location” is also valuable…)
  • 5. funded by the National Science Foundation Award #ACI-1445604 http://jetstream-cloud.org/ Fire-and-forget transfers into Jetstream buckets
  • 6. funded by the National Science Foundation Award #ACI-1445604 http://jetstream-cloud.org/ So how does this work? • Globus Connect Server (the server version of Globus) offers storage connectors to subscribers, including an AWS S3 connector. • The AWS S3 connector works equally well with OpenStack Swift’s S3 dialect.
  • 7. Setting it up 1. Create a VM w/floating IP. 2. Install GCSv5.2 w/S3 connector. 3. Create an S3 storage gateway requiring an XSEDE ID. 4. Use Globus web app to create a collection and supply S3-style credentials. 5. Use the collection to transfer data in or out. 6. Shelve VM until needed again.
  • 8. Credential management When the GCSv5 server admin does this… …it allows users to do this.
  • 9. funded by the National Science Foundation Award #ACI-1445604 http://jetstream-cloud.org/ Load dataset from EBI into a Jetstream bucket