SlideShare a Scribd company logo
Lessons from the Field:Azure for ScienceRob Gillengillenre@ornl.govrob.gillenfamily.net@argodev
AgendaIntroductionsWhy is ORNL looking at Cloud Computing
Azure in 5 minutesPost-Processing and Data Distribution in the Cloud Using Cloud Computing for Post-Processing
Data hosting/distributionLessons (being) Learned General Lessons
PerformanceOak Ridge National Laboratory is DOE’s largest science and energy labWorld’s most powerful open scientific computing facility
Nation’s largest concentrationof open source materials research
$1.6B budget
4,350 employees
3,900 researchguests annually
$350 million investedin modernization
Nation’s most diverse energy portfolio
Operating the world’s most intense pulsed neutron source
Managing the billion-dollar U.S. ITER projectDelivering science and technologyUltrascale computingEnergy technologiesBioenergyITERNeutron sciencesClimateMaterials at the nanoscaleNational securityNuclear energy
UltrascaleScientific ComputingLeadership Computing Facility:
World’s most powerful open scientific computing facility
Peak speed of 2.33 petaflops (> two thousand trillion calculations/sec)
18,688 nodes, 224,526 compute cores, 299 TB RAM, 10,000 TB Disk
4,352 ft2 floor space
Exascale system by the end of the next decade
Focus on computationally intensive projects of large scale and high scientific impact
Addressing key science and technology issues
Climate
Fusion
Materials
Bioenergy
Home of the 1st and 3rd fastest super computers in the world.The world’s most powerful system for open science
Then Why Look at Cloud Computing???Science Takes Different FormsTight Simulations
Data-Parallelized
Embarrassingly ParallelDearth of Mid-Range Assets256-1,000 cores
1 of many possible solutionsScaling IssuesPower Consumption
Programming Struggles
Fault-ToleranceForward-LookingNext-Generation Problems
Next-Generation ResearchersPrivate(On-Premise)Infrastructure(as a Service)Platform(as a Service)Types of CloudsYou manageApplicationsApplicationsApplicationsYou manageRuntimesRuntimesRuntimesSecurity & IntegrationSecurity & IntegrationSecurity & IntegrationManaged by vendorDatabasesDatabasesDatabasesYou manageServersServersServersManaged by vendorVirtualizationVirtualizationVirtualizationServer HWServer HWServer HWStorageStorageStorageNetworkingNetworkingNetworking
Private(On-Premise)Types of CloudsInfrastructure(as a Service)Platform(as a Service)
Application Services“Dublin”“Velocity”Frameworks“Geneva”SecurityAccess ControlProject “Sydney”ConnectivityService BusSQL Azure Data SyncDataComputeWindows Azure PlatformTable StorageBlob StorageQueueDriveContent Delivery NetworkStorage
Windows Azure ComputeDevelopment, service hosting, & management environment.NET, Java PHP, Python, Ruby, native code (C/C++, Win32, etc.)ASP.NET providers, FastCGI, memcached, MySQL, TomcatFull-trust – supports standard languages and APIsSecure certificate storeManagement API’s, and logging and diagnostics systemsMultiple roles – Web, Worker, Virtual Machine (VHD)Multiple VM sizes1.6 GHz CPU x64, 1.75GB RAM, 100Mbps network, 250GB volatile storageSmall (1X), Medium (2X), Large (4X), X-Large (8X)In-place rolling upgrades, organized by upgrade domainsWalk each upgrade  domain one at a timeCompute
Windows Azure DiagnosticsConfigurable trace, performance counter, Windows event log, IIS log & file bufferingLocal data buffering quota managementQuery & modify from the cloud and from the desktop per role instanceTransfer to storage scheduled & on-demandFilter by data type, verbosity & time rangeCompute
Windows Azure StorageRich data abstractions – tables, blobs, queues, drives, CDNCapacity (100TB), throughput (100MB/sec), transactions (1K req/sec)High accessibilitySupports geo-locationLanguage & platform agnostic REST APIsURL: http://<account>.<store>.core.windows.netClient libraries for .NET, Java, PHP, etc.High durability – data is replicated 3 times within a cluster, and (Feb 2010) across datacentersHigh scalability – data is automatically partitioned and load balanced across serversStorageStorage
Windows Azure Table StorageDesigned for structured data, not relational dataData definition is part of the applicationA Table is a set of Entities (records)An Entity is a set of Properties (fields)No fixed schemaEach property is stored as a <name, typed value> pairTwo entities within the same table can have different propertiesNo schema is enforcedTable Storage
Windows Azure Blob StorageStorage for  large, named files plus their metadataBlock Blob Targeted at streaming workloadsEach blob consists of a sequence of blocksEach block is identified by a Block IDSize limit 200GB per blobPage BlobTargeted at random read/write workloadsEach blob consists of an array of pagesEach page is identified by its offset from the start of the blobSize limit 1TB per blobBlob Storage
Windows Azure QueuePerformance efficient, highly available and provide reliable message deliveryAsynchronous work dispatchInter-role communication Polling based model; best-effort FIFO data structureQueue operationsCreate QueueDelete QueueList QueuesGet/Set Queue MetadataMessage operationsAdd MessageGet Message(s)Peek Message(s)Delete MessageQueue
Windows Azure DriveProvides a durable NTFS volume for Windows Azure applications to useUse existing NTFS APIs to access a durable driveDurability and survival of data on application failover Enables migrating existing NTFS applications to the cloudDrives can be up to 1TB; a VM can dynamically mount up to 8 drivesA Windows Azure Drive is a Page BlobExample, mount Page Blob as X:\http://<account>.blob.core.windows.net/<container>/<blob>All writes to drive are made durable to the Page BlobDrive made durable through standard Page Blob replicationDrive
Windows Azure Content Delivery NetworkProvides high-bandwidth global blob content delivery18 locations globally (US, Europe, Asia, Australia and South America), and growingBlob service URL vs. CDN URLBlob URL: http://<account>.blob.core.windows.net/CDN URL: http://<guid>.vo.msecnd.net/ Support for custom domain namesAccess detailsBlobs are cached in CDN until the TTL passesUse per-blob HTTP Cache-Control policy for TTL (new)CDN provides only anonymous HTTP accessContent Delivery Network
Tenants of Internet-Scale Application ArchitectureDesignHorizontal scaling
Service-oriented composition

More Related Content

PPTX
Windows Azure: Lessons From The Field
PPTX
NoSQL Introduction
PPTX
Mongodb - NoSql Database
PPT
Windows Azure and a little SQL Data Services
PDF
Microsoft R - Data Science at Scale
PPT
Lecture 40 1
PPTX
The CIOs Guide to NoSQL
PPT
NoSql Databases
Windows Azure: Lessons From The Field
NoSQL Introduction
Mongodb - NoSql Database
Windows Azure and a little SQL Data Services
Microsoft R - Data Science at Scale
Lecture 40 1
The CIOs Guide to NoSQL
NoSql Databases

What's hot (20)

PPT
NoSQL Slideshare Presentation
PDF
AWS Athena vs. Google BigQuery for interactive SQL Queries
PDF
CouchDB
PDF
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
PPTX
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
PDF
From sql server to mongo db
PPTX
MongoDB Best Practices for Developers
PDF
Migrating to postgresql
PPTX
Google BigQuery 101 & What’s New
PDF
Imply at Apache Druid Meetup in London 1-15-20
PPTX
Getting Started with MongoDB Using the Microsoft Stack
PPTX
Couchbase Data Platform | Big Data Demystified
PDF
Hudi: Large-Scale, Near Real-Time Pipelines at Uber with Nishith Agarwal and ...
PPTX
Introduction to NoSQL
PDF
Engineering practices in big data storage and processing
PDF
Easy, scalable, fault tolerant stream processing with structured streaming - ...
PPTX
Introduction to Google BigQuery
PPTX
Building Spring Data with MongoDB
PPTX
EclipseCon 2021 NoSQL Endgame
PPTX
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
NoSQL Slideshare Presentation
AWS Athena vs. Google BigQuery for interactive SQL Queries
CouchDB
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
IMC Summit 2016 Breakout - William Bain - Implementing Extensible Data Struct...
From sql server to mongo db
MongoDB Best Practices for Developers
Migrating to postgresql
Google BigQuery 101 & What’s New
Imply at Apache Druid Meetup in London 1-15-20
Getting Started with MongoDB Using the Microsoft Stack
Couchbase Data Platform | Big Data Demystified
Hudi: Large-Scale, Near Real-Time Pipelines at Uber with Nishith Agarwal and ...
Introduction to NoSQL
Engineering practices in big data storage and processing
Easy, scalable, fault tolerant stream processing with structured streaming - ...
Introduction to Google BigQuery
Building Spring Data with MongoDB
EclipseCon 2021 NoSQL Endgame
Best practices on Building a Big Data Analytics Solution (SQLBits 2018 Traini...
Ad

Viewers also liked (20)

PPTX
Último Cómputo - Persona 2011
DOCX
Revista
PPTX
الرحمة العالمية | إغاثة اليمن
PDF
Articlefunctional theory
PDF
Leveraging TFS for Driving Process Improvement using Lean Principles
PDF
SME Yalista Job title
PPT
Rider motors ppt
DOCX
Final Report Literature Review
PPTX
Responsivewebdesign part2
PPTX
150210111017
PDF
PRWEEK MSLGROUP Social Media Survey
PDF
MSLGROUP Hispanic Marketing White Paper
PDF
R cuda presentation_ib_features_120704
PPTX
Application Architecture Jumpstart
PPTX
4.john milton and his time
PDF
PTW Water Cube
PPT
Herramientas Web 2.0 y su aplicación en salud
PPTX
Chapter 8 transport in humans
PPT
Paravertebral Cevical Sympathetic Block
PDF
Q2 2010-mobile-video-ad-report final
Último Cómputo - Persona 2011
Revista
الرحمة العالمية | إغاثة اليمن
Articlefunctional theory
Leveraging TFS for Driving Process Improvement using Lean Principles
SME Yalista Job title
Rider motors ppt
Final Report Literature Review
Responsivewebdesign part2
150210111017
PRWEEK MSLGROUP Social Media Survey
MSLGROUP Hispanic Marketing White Paper
R cuda presentation_ib_features_120704
Application Architecture Jumpstart
4.john milton and his time
PTW Water Cube
Herramientas Web 2.0 y su aplicación en salud
Chapter 8 transport in humans
Paravertebral Cevical Sympathetic Block
Q2 2010-mobile-video-ad-report final
Ad

Similar to Azure: Lessons From The Field (20)

PPTX
Azure Platform
PPTX
A Lap Around Azure
PPTX
Understanding the Windows Azure Platform - Dec 2010
PPTX
Understanding The Azure Platform March 2010
PPTX
Azure platform for customers
PPTX
MSDN - Converting an existing ASP.NET application to Windows Azure
PPT
Building Cloud-Native Applications with Microsoft Windows Azure
PPTX
Azure, Cloud Computing & Services
PPTX
Windowsazureplatform Overviewlatest
PPTX
Understanding The Azure Platform Jan
PPTX
Windows Azure Platform Overview
PPTX
Creation of cloud application using microsoft azure by vaishali sahare [katkar]
PPTX
Sky High With Azure
PPTX
Microsoft Partner Roadshow - To the Cloud
PPTX
Microsoft Azure
PPTX
Understanding The Azure Platform November 09
PPTX
Building services using windows azure
PPTX
Windows azure camp - Kolkata
PDF
Microsoft azure platforms
PPT
ArcReady - Architecting For The Cloud
Azure Platform
A Lap Around Azure
Understanding the Windows Azure Platform - Dec 2010
Understanding The Azure Platform March 2010
Azure platform for customers
MSDN - Converting an existing ASP.NET application to Windows Azure
Building Cloud-Native Applications with Microsoft Windows Azure
Azure, Cloud Computing & Services
Windowsazureplatform Overviewlatest
Understanding The Azure Platform Jan
Windows Azure Platform Overview
Creation of cloud application using microsoft azure by vaishali sahare [katkar]
Sky High With Azure
Microsoft Partner Roadshow - To the Cloud
Microsoft Azure
Understanding The Azure Platform November 09
Building services using windows azure
Windows azure camp - Kolkata
Microsoft azure platforms
ArcReady - Architecting For The Cloud

More from Rob Gillen (20)

PDF
CodeStock14: Hiding in Plain Sight
PDF
What's in a password
PPTX
How well do you know your runtime
PPTX
Software defined radio and the hacker
PPTX
So whats in a password
PPTX
Hiding in plain sight
PPTX
ETCSS: Into the Mind of a Hacker
PPTX
DevLink - WiFu: You think your wireless is secure?
PPTX
You think your WiFi is safe?
PPTX
Anatomy of a Buffer Overflow Attack
PPTX
Intro to GPGPU with CUDA (DevLink)
PPTX
AWS vs. Azure
PPTX
A Comparison of AWS and Azure - Part2
PPTX
A Comparison of AWS and Azure - Part 1
PPTX
Intro to GPGPU Programming with Cuda
PPTX
Scaling Document Clustering in the Cloud
PPTX
Hands On with Amazon Web Services (StirTrek)
PPTX
Amazon Web Services for the .NET Developer
PPT
05561 Xfer Research 02
PPT
05561 Xfer Research 01
CodeStock14: Hiding in Plain Sight
What's in a password
How well do you know your runtime
Software defined radio and the hacker
So whats in a password
Hiding in plain sight
ETCSS: Into the Mind of a Hacker
DevLink - WiFu: You think your wireless is secure?
You think your WiFi is safe?
Anatomy of a Buffer Overflow Attack
Intro to GPGPU with CUDA (DevLink)
AWS vs. Azure
A Comparison of AWS and Azure - Part2
A Comparison of AWS and Azure - Part 1
Intro to GPGPU Programming with Cuda
Scaling Document Clustering in the Cloud
Hands On with Amazon Web Services (StirTrek)
Amazon Web Services for the .NET Developer
05561 Xfer Research 02
05561 Xfer Research 01

Recently uploaded (20)

PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Machine learning based COVID-19 study performance prediction
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
Cloud computing and distributed systems.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Modernizing your data center with Dell and AMD
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Machine learning based COVID-19 study performance prediction
NewMind AI Monthly Chronicles - July 2025
Cloud computing and distributed systems.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MYSQL Presentation for SQL database connectivity
20250228 LYD VKU AI Blended-Learning.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Modernizing your data center with Dell and AMD
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Review of recent advances in non-invasive hemoglobin estimation
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Digital-Transformation-Roadmap-for-Companies.pptx
“AI and Expert System Decision Support & Business Intelligence Systems”
Network Security Unit 5.pdf for BCA BBA.
Chapter 3 Spatial Domain Image Processing.pdf

Azure: Lessons From The Field