SlideShare a Scribd company logo
Globus Introduction
Raj Kettimuthu
Steve Tuecke
Vas Vasiliadis
“I need a good place to store /
backup / archive my (big) research
data, at a reasonable price.”
Public Cloud ArchiveMass StoreCampus Store
“I need to easily, quickly, & reliably move or
mirror portions of my data to other places.”
Research	
  Compu.ng	
  HPC	
  Cluster	
  
Lab	
  Server	
  
Campus	
  Home	
  Filesystem	
  
Desktop	
  Worksta.on	
  
Personal	
  Laptop	
  
XSEDE	
  Resource	
  
Public	
  Cloud	
  
“I need to easily and securely share
my data with my colleagues at other
institutions.”
“I need to get data from a scientific
instrument to my analysis server.”
Next Gen
Sequencer
Light Sheet Microscope
MRI Advanced
Light Source
Challenge: Manage research
data as easily as…
…our	
  pictures	
  
…home	
  entertainment	
  
…our	
  e-­‐mail	
  
What is Globus?
Big data transfer, and sharing…
… delivered via SaaS …
… that is simple, secure, and fast…
… directly from your own storage
systems
Reliable, secure, high-performance
file transfer & synchronization
•  “Fire-and-forget”
transfers
•  Automatic fault
recovery
•  Seamless security
integration
•  Powerful GUI
and APIs
Data
Source
Data
Destination
User initiates
transfer
request
1
Globus
moves and
syncs files
2
Globus
notifies user
3
Simple, secure sharing off existing
storage systems
Data
Source
User A selects
file(s) to share,
selects user or
group, and sets
permissions
1
Globus tracks shared
files; no need to
move files to cloud
storage!
2
User B logs in
to Globus and
accesses
shared file
3
•  Easily share large data
with any user or group
•  No cloud storage
required
Globus is SaaS
•  Web, command line, and REST interfaces
•  Reduced IT operational costs
•  New features automatically available
•  Consolidated support & troubleshooting
•  Easy to add your laptop, server, cluster,
supercomputer, etc. with Globus Connect
8,000
active endpoints
(in the past year)
Introduction to Globus - XSEDE14 Tutorial
Globus increasingly used to build
campus-wide data services
Source: University of Nebraska
Holland Computing Center
Enable	
  campus	
  compu4ng	
  
facili4es	
  to	
  be7er	
  u4lize	
  
high	
  performance	
  network	
  
infrastructure	
  
10GE
10GE
10GE
10GE
10G
Border Router
WAN
Science DMZ
Switch/Router
Enterprise Border
Router/Firewall
Site / Campus
LAN
Per-service
security policy
control points
Clean,
High-bandwidth
WAN path
Site / Campus
access to Science
DMZ resources
perfSONAR
perfSONAR
perfSONAR
High performance
Data Transfer Node
with high-speed storage
Typical deployment
Science
DMZ
+
Globus
Details at: fasterdata.es.net
Demonstration
1.  Go to: globus.org/signup
2.  Create your Globus account
3.  Validate e-mail address
4.  Optional: Login with your
campus/InCommon identity
Exercise 1: Account Signup
1.  Install Globus Connect Personal
2.  Move file(s) from esnet#anl-diskpt1 to your
laptop
3.  Sign up for a free Globus Plus trial
4.  Create a shared endpoint on your laptop
5.  Grant your neighbor permissions on your shared
endpoint
6.  Access your neighbor’s shared endpoint
7.  Optional: Create group, and grant share access
Exercise 2: Transfer, Sharing,
Group Management
Our challenge:
Sustainability
We are a non-profit, delivering a
production-grade service to the
non-profit research community
Globus Provider Subscriptions
•  Managed Endpoints
–  Priority support
–  Management console
–  Usage reports
–  Mass Storage System optimization
–  Host shared endpoints
–  Integration support
•  Branded Web Site
•  Alternate Identity Provider (InCommon is standard)
globus.org/provider-plans
NET+ Globus
•  Internet2 members get discounted
Globus Provider subscriptions
•  Completing “Service Validation” phase
– Sponsors: Cornell, U.Michigan, Yale,
U.Missouri, and U.Chicago
•  Available to “Early Adopters” soon
Globus Platform-as-a-Service
Identity, Group, Profile
Management Services
…
Sharing Service
Transfer Service
Globus Toolkit
GlobusAPIs
GlobusConnect
globus
genomics
Flexible, scalable,
affordable
genomics analysis
for all biologists
+
Data management
PaaS
Next-gen sequence
analysis SaaS
+
Scalable IaaS
Globus is moving beyond
transfer and sharing to
data publication and
discovery
Globus Data Publication
(coming soon)
•  SaaS for publishing large research data
•  Bring your own storage
•  Extensible metadata
•  Publication and curation workflows
•  Public and restricted collections
•  Rich discovery model
Identified
Described
Curated
Verifiable
Accessible
Preserved
Enables data to be easily…
Search
Browse
Access
…across collections,
endpoints
…and facilitates rich discovery
Metadata
Access Control
License
Storage
Curation
Workflow
Policies
Collection
Globus’ view of data publishing
Metadata
DataMetadata
Data
Metadata
Data
Dataset
Dataset
Dataset
Community
Argonne Storage System
Univ. of Chicago Argonne IIT UIUC
Exemplar Use Case
3. Assemble Dataset
(Transfer Data)
Argonne Curator
2. Describe
Submission
Scientist
Shared Endpoint
4. Curate Dataset
1. Publish Data 6. Download
5. Search
Demonstration
Globus CLI
1.  Optional: Generate SSH key
2.  Go to: globus.org/account/
ManageIdentities
3.  Add your SSH key to your Globus identity
4.  SSH to cli.globusonline.org
5.  Check on the status of your earlier
transfer(s)
6.  Transfer a file using the scp command
Exercise 3: Globus CLI
Thank you to our sponsors!
U . S . D E P A R T M E N T O F
ENERGY

More Related Content

PDF
Campus Bridging with Globus Services
PPTX
Big Data on Cloud Native Platform
PPTX
azure synapse analytics end-to-end solution-hands-on at 20200728
PDF
Introducing the Hub for Data Orchestration
PPTX
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
PDF
Cortana Analytics Workshop: Azure Data Lake
PDF
Accessing Google Cloud APIs
PPTX
Scalable relational database with SQL Azure
Campus Bridging with Globus Services
Big Data on Cloud Native Platform
azure synapse analytics end-to-end solution-hands-on at 20200728
Introducing the Hub for Data Orchestration
Manage Microservices & Fast Data Systems on One Platform w/ DC/OS
Cortana Analytics Workshop: Azure Data Lake
Accessing Google Cloud APIs
Scalable relational database with SQL Azure

What's hot (20)

PDF
Data Lake and the rise of the microservices
PDF
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
PDF
Enabling OpenStack for Enterprise - Tarso Dos Santos, Veritas
PDF
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
PDF
From SQL to MongoDB
PDF
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
PDF
Release 8.1 - Breakfast Paris
PDF
Dipping Your Toes: Azure Data Lake for DBAs
PPTX
BlueData EPIC 2.0 Overview
PPTX
Introducing Cloudian HyperStore 6.0
PDF
Moving to the cloud; PaaS, IaaS or Managed Instance
PPTX
Azure data bricks by Eugene Polonichko
PPTX
Jax Cloud 2016 Microsoft Ignite Recap
PDF
Scaling the Content Repository with Elasticsearch
PDF
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
PDF
Monitoring MySQL at scale
PPTX
Ontology2 platform
PPTX
Leveraging Azure Databricks to minimize time to insight by combining Batch an...
PPT
Cloudant Overview Bluemix Meetup from Lisa Neddam
PDF
Azure Data services
Data Lake and the rise of the microservices
WSO2Con ASIA 2016: WSO2 Analytics Platform: The One Stop Shop for All Your Da...
Enabling OpenStack for Enterprise - Tarso Dos Santos, Veritas
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
From SQL to MongoDB
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Release 8.1 - Breakfast Paris
Dipping Your Toes: Azure Data Lake for DBAs
BlueData EPIC 2.0 Overview
Introducing Cloudian HyperStore 6.0
Moving to the cloud; PaaS, IaaS or Managed Instance
Azure data bricks by Eugene Polonichko
Jax Cloud 2016 Microsoft Ignite Recap
Scaling the Content Repository with Elasticsearch
Microsoft: Building a Massively Scalable System with DataStax and Microsoft's...
Monitoring MySQL at scale
Ontology2 platform
Leveraging Azure Databricks to minimize time to insight by combining Batch an...
Cloudant Overview Bluemix Meetup from Lisa Neddam
Azure Data services
Ad

Similar to Introduction to Globus - XSEDE14 Tutorial (20)

PPTX
Sept 24 NISO Virtual Conference: Library Data in the Cloud
PPTX
Science as a Service: How On-Demand Computing can Accelerate Discovery
PDF
Science cloud foster june 2013
PPTX
Globus status and publication plans
PDF
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
PDF
Introduction to Globus for New Users (GlobusWorld Tour - Columbia University)
PPTX
Webinar: Q&A on Globus Subscription Features
PDF
Simplified Research Data Management with the Globus Platform
PDF
Automating Research Data Management at Scale with Globus
PPTX
Science for the Future: Strategies for Moving and Sharing Data
PPTX
Delivering a Campus Research Data Service with Globus
PPTX
re:Invent 2013-foster-madduri
PDF
Introduction to the Globus SaaS (GlobusWorld Tour - STFC)
PPTX
globus.pptx
PPTX
Globus: Beyond File Transfer
PDF
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
PDF
Introduction to Data Transfer and Sharing for Researchers
PPTX
What's New With Globus
PPTX
Supporting Research through "Desktop as a Service" models of e-infrastructure...
PDF
Introduction to Globus
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science cloud foster june 2013
Globus status and publication plans
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
Introduction to Globus for New Users (GlobusWorld Tour - Columbia University)
Webinar: Q&A on Globus Subscription Features
Simplified Research Data Management with the Globus Platform
Automating Research Data Management at Scale with Globus
Science for the Future: Strategies for Moving and Sharing Data
Delivering a Campus Research Data Service with Globus
re:Invent 2013-foster-madduri
Introduction to the Globus SaaS (GlobusWorld Tour - STFC)
globus.pptx
Globus: Beyond File Transfer
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
Introduction to Data Transfer and Sharing for Researchers
What's New With Globus
Supporting Research through "Desktop as a Service" models of e-infrastructure...
Introduction to Globus
Ad

More from Globus (20)

PDF
Globus Compute wth IRI Workflows - GlobusWorld 2024
PDF
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
PDF
Globus Compute Introduction - GlobusWorld 2024
PDF
Globus Connect Server Deep Dive - GlobusWorld 2024
PDF
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
PDF
First Steps with Globus Compute Multi-User Endpoints
PDF
Enhancing Research Orchestration Capabilities at ORNL.pdf
PDF
Understanding Globus Data Transfers with NetSage
PDF
How to Position Your Globus Data Portal for Success Ten Good Practices
PDF
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
PDF
Developing Distributed High-performance Computing Capabilities of an Open Sci...
PDF
The Department of Energy's Integrated Research Infrastructure (IRI)
PDF
GlobusWorld 2024 Opening Keynote session
PDF
Enhancing Performance with Globus and the Science DMZ
PDF
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
PDF
Globus at the United States Geological Survey
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
PDF
Globus Compute with Integrated Research Infrastructure (IRI) workflows
PDF
Reactive Documents and Computational Pipelines - Bridging the Gap
Globus Compute wth IRI Workflows - GlobusWorld 2024
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus Compute Introduction - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
First Steps with Globus Compute Multi-User Endpoints
Enhancing Research Orchestration Capabilities at ORNL.pdf
Understanding Globus Data Transfers with NetSage
How to Position Your Globus Data Portal for Success Ten Good Practices
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
The Department of Energy's Integrated Research Infrastructure (IRI)
GlobusWorld 2024 Opening Keynote session
Enhancing Performance with Globus and the Science DMZ
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Globus at the United States Geological Survey
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Reactive Documents and Computational Pipelines - Bridging the Gap

Recently uploaded (20)

PDF
RMMM.pdf make it easy to upload and study
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Classroom Observation Tools for Teachers
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Pharma ospi slides which help in ospi learning
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Basic Mud Logging Guide for educational purpose
PDF
Business Ethics Teaching Materials for college
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
RMMM.pdf make it easy to upload and study
Renaissance Architecture: A Journey from Faith to Humanism
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Classroom Observation Tools for Teachers
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Anesthesia in Laparoscopic Surgery in India
Pharma ospi slides which help in ospi learning
Insiders guide to clinical Medicine.pdf
Week 4 Term 3 Study Techniques revisited.pptx
Complications of Minimal Access Surgery at WLH
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Supply Chain Operations Speaking Notes -ICLT Program
O7-L3 Supply Chain Operations - ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Basic Mud Logging Guide for educational purpose
Business Ethics Teaching Materials for college
Abdominal Access Techniques with Prof. Dr. R K Mishra
STATICS OF THE RIGID BODIES Hibbelers.pdf
Cell Structure & Organelles in detailed.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx

Introduction to Globus - XSEDE14 Tutorial

  • 2. “I need a good place to store / backup / archive my (big) research data, at a reasonable price.” Public Cloud ArchiveMass StoreCampus Store
  • 3. “I need to easily, quickly, & reliably move or mirror portions of my data to other places.” Research  Compu.ng  HPC  Cluster   Lab  Server   Campus  Home  Filesystem   Desktop  Worksta.on   Personal  Laptop   XSEDE  Resource   Public  Cloud  
  • 4. “I need to easily and securely share my data with my colleagues at other institutions.”
  • 5. “I need to get data from a scientific instrument to my analysis server.” Next Gen Sequencer Light Sheet Microscope MRI Advanced Light Source
  • 6. Challenge: Manage research data as easily as… …our  pictures   …home  entertainment   …our  e-­‐mail  
  • 7. What is Globus? Big data transfer, and sharing… … delivered via SaaS … … that is simple, secure, and fast… … directly from your own storage systems
  • 8. Reliable, secure, high-performance file transfer & synchronization •  “Fire-and-forget” transfers •  Automatic fault recovery •  Seamless security integration •  Powerful GUI and APIs Data Source Data Destination User initiates transfer request 1 Globus moves and syncs files 2 Globus notifies user 3
  • 9. Simple, secure sharing off existing storage systems Data Source User A selects file(s) to share, selects user or group, and sets permissions 1 Globus tracks shared files; no need to move files to cloud storage! 2 User B logs in to Globus and accesses shared file 3 •  Easily share large data with any user or group •  No cloud storage required
  • 10. Globus is SaaS •  Web, command line, and REST interfaces •  Reduced IT operational costs •  New features automatically available •  Consolidated support & troubleshooting •  Easy to add your laptop, server, cluster, supercomputer, etc. with Globus Connect
  • 13. Globus increasingly used to build campus-wide data services Source: University of Nebraska Holland Computing Center Enable  campus  compu4ng   facili4es  to  be7er  u4lize   high  performance  network   infrastructure  
  • 14. 10GE 10GE 10GE 10GE 10G Border Router WAN Science DMZ Switch/Router Enterprise Border Router/Firewall Site / Campus LAN Per-service security policy control points Clean, High-bandwidth WAN path Site / Campus access to Science DMZ resources perfSONAR perfSONAR perfSONAR High performance Data Transfer Node with high-speed storage Typical deployment Science DMZ + Globus Details at: fasterdata.es.net
  • 16. 1.  Go to: globus.org/signup 2.  Create your Globus account 3.  Validate e-mail address 4.  Optional: Login with your campus/InCommon identity Exercise 1: Account Signup
  • 17. 1.  Install Globus Connect Personal 2.  Move file(s) from esnet#anl-diskpt1 to your laptop 3.  Sign up for a free Globus Plus trial 4.  Create a shared endpoint on your laptop 5.  Grant your neighbor permissions on your shared endpoint 6.  Access your neighbor’s shared endpoint 7.  Optional: Create group, and grant share access Exercise 2: Transfer, Sharing, Group Management
  • 18. Our challenge: Sustainability We are a non-profit, delivering a production-grade service to the non-profit research community
  • 19. Globus Provider Subscriptions •  Managed Endpoints –  Priority support –  Management console –  Usage reports –  Mass Storage System optimization –  Host shared endpoints –  Integration support •  Branded Web Site •  Alternate Identity Provider (InCommon is standard) globus.org/provider-plans
  • 20. NET+ Globus •  Internet2 members get discounted Globus Provider subscriptions •  Completing “Service Validation” phase – Sponsors: Cornell, U.Michigan, Yale, U.Missouri, and U.Chicago •  Available to “Early Adopters” soon
  • 21. Globus Platform-as-a-Service Identity, Group, Profile Management Services … Sharing Service Transfer Service Globus Toolkit GlobusAPIs GlobusConnect
  • 24. Globus is moving beyond transfer and sharing to data publication and discovery
  • 25. Globus Data Publication (coming soon) •  SaaS for publishing large research data •  Bring your own storage •  Extensible metadata •  Publication and curation workflows •  Public and restricted collections •  Rich discovery model
  • 28. Metadata Access Control License Storage Curation Workflow Policies Collection Globus’ view of data publishing Metadata DataMetadata Data Metadata Data Dataset Dataset Dataset Community
  • 29. Argonne Storage System Univ. of Chicago Argonne IIT UIUC Exemplar Use Case 3. Assemble Dataset (Transfer Data) Argonne Curator 2. Describe Submission Scientist Shared Endpoint 4. Curate Dataset 1. Publish Data 6. Download 5. Search
  • 31. 1.  Optional: Generate SSH key 2.  Go to: globus.org/account/ ManageIdentities 3.  Add your SSH key to your Globus identity 4.  SSH to cli.globusonline.org 5.  Check on the status of your earlier transfer(s) 6.  Transfer a file using the scp command Exercise 3: Globus CLI
  • 32. Thank you to our sponsors! U . S . D E P A R T M E N T O F ENERGY