SlideShare a Scribd company logo
Accelerating your research with Microsoft Azure
𝜌
𝐷𝑣
𝐷𝑡
= −𝛻𝑝 + 𝛻 ∙ 𝜯 + 𝒇
Data
Acquisition &
modelling
Collaboration
and
visualisation
Analysis &
data mining
Dissemination
& sharing
Archiving and
preserving
fourthparadigm.org
Data-intensive Research
X-Info
• Data ingest
• Managing a petabyte
• Common schema
• How to organize it
• How to reorganize it
• How to share with others
• Query and Vis tools
• Building and executing models
• Integrating data and Literature
• Documenting experiments
• Curation and long-term
preservation
The Generic Problems
Experiments &
Instruments
Simulations
Literature
Other Archives
facts
facts
facts
facts
Questions
Answers
Gartner: http://t.co/Co3EK1ERfN
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
https://www.youtube.com/watch?v=TJTSEPpFZaw
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
A-series
• 1-16 cores
• 0.75-112GB RAM
• 20-605 GB HDD
• Up to InfiniBand 40Gbit/s
RDMA network (MPI)
D-series
• 1-16 cores
• 3.5-112 GB RAM
• Up to 800GB SSD
G-series
• 32 cores
• 468 GB RAM
• 6.5 TB SSD
Accelerating your research with Microsoft Azure
Parker MacCready: Univ. of Washington
Rob Fatland:, Wenming Ye, Nels Oscar, Microsoft Research
Accelerating your research with Microsoft Azure
Modeling Workflow
Forcing Data
Processed into
Standard Format
Output
MODEL
Model-specific
Forcing Files
Raw Forcing
Data
Observations
Processed into
Standard Format
SKILL TESTRaw Observational
Data
Skill
Result
ROMS
Cluster
200 cores
1 week/year
2 TB
per model year
Standard
Post Processing
1 week
LiveOcean: Hybrid Architecture
HPC
linux 150 cores
Forecast
NetCDF files
LiveOcean
Server
• Post Processing
• Pre-make .png “views”
• Archive NetCDF files
• API for web sites
• Admin.js
• Client.js
Blob Storage:
Forecast Copy
Science User
pythonAzure Table:
Log Info
Admin
Website
Client Website
http://mappable.azurewebsites.
net/liveocean/
Rivers
USGS
Atmosphere
UW WRF
Ocean
HYCOM
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
James Williams, SLAC CIO
ConnectTheDots.io
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
Accelerating your research with Microsoft Azure
“The Azure for Research programme has helped
the Marine Institute and our research partners
understand how cloud computing can be used to
advance collaborative marine research including
by making on-demand compute and advanced
analytical data services much more easily
available to virtual research teams.”
Eoin O’Grady, Information Services and Development Manager,
Marine Institute (Ireland)
Accelerating your research with Microsoft Azure
British Library Labs cloud
analysis of digital catalogues,
including 19th Century books
scanned by Microsoft.
@MechCuratorBot
mechanicalcurator.tumblr.com
Accelerating your research with Microsoft Azure
RaaS
SaaS
PaaS
IaaS
Cloud Services
Research collaboration and data
lifecycle services
Data management, application
services, collaboration tools.
Programming abstractions,
database support, runtime
systems
Virtual machines, reliable
storage, provisioning tools,
network bandwidth
Research
Marketplace
Analytics services and expert
consulting
Domain specific applications
and data access
Advanced development tools
and libraries to SaaS
developers
Specially configured virtual
machine templates
www.azure4research.com
Use laptops &
desktop computers
Overwhelmed by
data
Finding analysis
ever more difficult;
sharing even
harder
www.azure4research.com
Azure for Research Russia Special Awards
• 250,000 compute hours, 20TB storage,
machine learning, NoSQL and more…
• Apply by 15 Aug’15 at
http://aka.ms/azureresearchrussia
http://aka.ms/azureresearchrussia

More Related Content

PDF
ieee cloud 2015 keynote talk
PDF
Accelerating your Research with Microsoft Azure (June 2015)
PDF
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
PDF
Reproducible Research and the Cloud
PDF
Doing Research in the Cloud - NIH Workshop Dennis Gannon
PPTX
A4 r overview deck_1.7
PPTX
Open Science Data Cloud (IEEE Cloud 2011)
ieee cloud 2015 keynote talk
Accelerating your Research with Microsoft Azure (June 2015)
Keynote IEEE International Workshop on Cloud Analytics. Dennis Gannon
Reproducible Research and the Cloud
Doing Research in the Cloud - NIH Workshop Dennis Gannon
A4 r overview deck_1.7
Open Science Data Cloud (IEEE Cloud 2011)

What's hot (20)

PPTX
Scaling collaborative data science with Globus and Jupyter
PPT
Foss4G 2009 Scenz Grid
PPTX
Data Tribology: Overcoming Data Friction with Cloud Automation
PPTX
Research Automation for Data-Driven Discovery
PDF
Cloud Dataverse
PPTX
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
PPT
Grid
PPTX
PhD Projects in Green Cloud Computing Research Guidance
PDF
Honey on the Wire KohaCon18
PDF
Research Objects in Wf4Ever
PPT
Data Citation in The Dataverse Network
PPTX
Louise McCluskey, Kx Engineer at Kx Systems
PPTX
BlogMyData at AllHands 2010
PPTX
Accelerating data-intensive science by outsourcing the mundane
PPTX
Sept 24 NISO Virtual Conference: Library Data in the Cloud
PDF
Big Data Modeling Challenges and Machine Learning with No Code
PPTX
TierraCloud HC2 Customer Presentation
PDF
Cytoscape Cyberinfrastructure
PPT
Experiences (mis)managing archaeological data
PPTX
NIH Data Commons Architecture Ideas
Scaling collaborative data science with Globus and Jupyter
Foss4G 2009 Scenz Grid
Data Tribology: Overcoming Data Friction with Cloud Automation
Research Automation for Data-Driven Discovery
Cloud Dataverse
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
Grid
PhD Projects in Green Cloud Computing Research Guidance
Honey on the Wire KohaCon18
Research Objects in Wf4Ever
Data Citation in The Dataverse Network
Louise McCluskey, Kx Engineer at Kx Systems
BlogMyData at AllHands 2010
Accelerating data-intensive science by outsourcing the mundane
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Big Data Modeling Challenges and Machine Learning with No Code
TierraCloud HC2 Customer Presentation
Cytoscape Cyberinfrastructure
Experiences (mis)managing archaeological data
NIH Data Commons Architecture Ideas
Ad

Similar to Accelerating your research with Microsoft Azure (20)

PPTX
PPTX
Azure: Lessons From The Field
PPT
Visual Data Analytics in the Cloud for Exploratory Science
PDF
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
PDF
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
PPTX
Windows Azure: Lessons From The Field
PPT
Orbital presentation pt1_200112_v1
PDF
Where Does Big Data Meet Big Database - QCon 2012
PDF
Azure Brain: 4th paradigm, scientific discovery & (really) big data
PPTX
Cloud Services for Repositories
PDF
Microsoft Azure Cloud Master-Cheat-Sheet
PPTX
Microsoft's Hadoop Story
PPTX
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
PPTX
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
PPTX
Big Data with Azure
PPTX
Opportunities for X-Ray science in future computing architectures
PPTX
Microsoft Openness Mongo DB
PPTX
Michael newberry
PPTX
Big Process for Big Data @ PNNL, May 2013
PDF
Designing a Better Planet with Big Data and Sensor Networks (for Intelligent ...
Azure: Lessons From The Field
Visual Data Analytics in the Cloud for Exploratory Science
DSD-INT 2014 - Data Science symposium - 4th Paradigm - a technology perspecti...
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
Windows Azure: Lessons From The Field
Orbital presentation pt1_200112_v1
Where Does Big Data Meet Big Database - QCon 2012
Azure Brain: 4th paradigm, scientific discovery & (really) big data
Cloud Services for Repositories
Microsoft Azure Cloud Master-Cheat-Sheet
Microsoft's Hadoop Story
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
Discovery Engines for Big Data: Accelerating Discovery in Basic Energy Sciences
Big Data with Azure
Opportunities for X-Ray science in future computing architectures
Microsoft Openness Mongo DB
Michael newberry
Big Process for Big Data @ PNNL, May 2013
Designing a Better Planet with Big Data and Sensor Networks (for Intelligent ...
Ad

More from Microsoft Azure for Research (8)

PDF
Parallel asynchronous inference of word senses with Microsoft Azure
PDF
Cloud hpc-bigdata-challenges
PDF
Environmental Science, Big Data and the Cloud
PDF
Big data - from consumers and patients, to the sea and stars
PPTX
Living Outside the Comfort Zone - Daron green florianopolis 5-7-2014
PPTX
Keynote Presentation at Moscow State University.
Parallel asynchronous inference of word senses with Microsoft Azure
Cloud hpc-bigdata-challenges
Environmental Science, Big Data and the Cloud
Big data - from consumers and patients, to the sea and stars
Living Outside the Comfort Zone - Daron green florianopolis 5-7-2014
Keynote Presentation at Moscow State University.

Recently uploaded (20)

PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
Sciences of Europe No 170 (2025)
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
2. Earth - The Living Planet earth and life
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PPTX
Microbiology with diagram medical studies .pptx
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PDF
The scientific heritage No 166 (166) (2025)
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPT
protein biochemistry.ppt for university classes
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
2. Earth - The Living Planet Module 2ELS
neck nodes and dissection types and lymph nodes levels
TOTAL hIP ARTHROPLASTY Presentation.pptx
Sciences of Europe No 170 (2025)
bbec55_b34400a7914c42429908233dbd381773.pdf
2. Earth - The Living Planet earth and life
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
The KM-GBF monitoring framework – status & key messages.pptx
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Microbiology with diagram medical studies .pptx
2Systematics of Living Organisms t-.pptx
Introduction to Fisheries Biotechnology_Lesson 1.pptx
The scientific heritage No 166 (166) (2025)
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
protein biochemistry.ppt for university classes
7. General Toxicologyfor clinical phrmacy.pptx
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf

Accelerating your research with Microsoft Azure

  • 3. Data Acquisition & modelling Collaboration and visualisation Analysis & data mining Dissemination & sharing Archiving and preserving fourthparadigm.org Data-intensive Research
  • 4. X-Info • Data ingest • Managing a petabyte • Common schema • How to organize it • How to reorganize it • How to share with others • Query and Vis tools • Building and executing models • Integrating data and Literature • Documenting experiments • Curation and long-term preservation The Generic Problems Experiments & Instruments Simulations Literature Other Archives facts facts facts facts Questions Answers
  • 16. A-series • 1-16 cores • 0.75-112GB RAM • 20-605 GB HDD • Up to InfiniBand 40Gbit/s RDMA network (MPI) D-series • 1-16 cores • 3.5-112 GB RAM • Up to 800GB SSD G-series • 32 cores • 468 GB RAM • 6.5 TB SSD
  • 18. Parker MacCready: Univ. of Washington Rob Fatland:, Wenming Ye, Nels Oscar, Microsoft Research
  • 20. Modeling Workflow Forcing Data Processed into Standard Format Output MODEL Model-specific Forcing Files Raw Forcing Data Observations Processed into Standard Format SKILL TESTRaw Observational Data Skill Result ROMS Cluster 200 cores 1 week/year 2 TB per model year Standard Post Processing 1 week
  • 21. LiveOcean: Hybrid Architecture HPC linux 150 cores Forecast NetCDF files LiveOcean Server • Post Processing • Pre-make .png “views” • Archive NetCDF files • API for web sites • Admin.js • Client.js Blob Storage: Forecast Copy Science User pythonAzure Table: Log Info Admin Website Client Website http://mappable.azurewebsites. net/liveocean/ Rivers USGS Atmosphere UW WRF Ocean HYCOM
  • 30. “The Azure for Research programme has helped the Marine Institute and our research partners understand how cloud computing can be used to advance collaborative marine research including by making on-demand compute and advanced analytical data services much more easily available to virtual research teams.” Eoin O’Grady, Information Services and Development Manager, Marine Institute (Ireland)
  • 32. British Library Labs cloud analysis of digital catalogues, including 19th Century books scanned by Microsoft. @MechCuratorBot mechanicalcurator.tumblr.com
  • 34. RaaS SaaS PaaS IaaS Cloud Services Research collaboration and data lifecycle services Data management, application services, collaboration tools. Programming abstractions, database support, runtime systems Virtual machines, reliable storage, provisioning tools, network bandwidth Research Marketplace Analytics services and expert consulting Domain specific applications and data access Advanced development tools and libraries to SaaS developers Specially configured virtual machine templates
  • 36. Use laptops & desktop computers Overwhelmed by data Finding analysis ever more difficult; sharing even harder www.azure4research.com
  • 37. Azure for Research Russia Special Awards • 250,000 compute hours, 20TB storage, machine learning, NoSQL and more… • Apply by 15 Aug’15 at http://aka.ms/azureresearchrussia