SlideShare a Scribd company logo
Mapping Informatics To the Cloud


                    2012 AIRI Petabyte Challenge
                                  Chris Dagdigian
                               chris@bioteam.net
I‟m Chris.

I‟m an infrastructure
geek.

I work for the
BioTeam.
The “C” Word.
When I say “cloud”
 I‟m talking IaaS.
Amazon AWS
      Is the IaaS cloud.
         Most others are fooling themselves.
(Has-beens, also-rans & delusional marketing zombies)
A message for the
  pretenders…
No APIs?
Not a cloud.
No self-service?
 Not a cloud.
I have to email a human?
       Not a cloud.
~50% failure rate when
provisioning new servers?
    Stupid cloud.
Block storage
and virtual servers only?
    (barely) a cloud;
Private Clouds: My $.02
Private Clouds in 2012:

• Hype vs. Reality ratio still wacky
• Sensible only for certain shops
 •   Have you seen what you have to do to your networks & gear?

• There are easier ways
Private Clouds: My Advice for „12

• Remain cynical (test vendor claims)
• Due Diligence still essential
• I personally would not deploy/buy
  anything that does not explicitly provide
  Amazon API compatibility
Private Clouds: My Advice for „12

• Most people are better off:
  • Adding VM platforms to existing HPC
    clusters & environments
  • Extending enterprise VM platforms to
    allow user self-service & server
    catalogs
Enough Bloviating. Advice time.
Tip #1
HPC & Clouds: Whole New World
• We have spent decades learning
  to tune research HPC systems
  for shared access & many users.

• The cloud upends this model
• Far more common to see …
 • Dedicated cloud resources
   spun up for each app or use case
 • Each system gets individually
   tuned & optimized
Tip #2
Hybrid Clouds & Cloud Bursting
• Lots of aggressive marketing
• Lots of carefully constructed “case
  studies” and prototypes
• The truth?
  • Less usable than you‟ve been told
  • Possible? Heck yeah.
  • Practical? Only sometimes.
• Advice
  • Be cynical
  • Demand proof
  • Test carefully
• Still want to do it?
  • Buy it, don‟t build it
   •   Cycle Computing
   •   Univa
   •   BrightComputing
   •   …
• Follow the crowd
• In the real world we see:
 • Separation between local
   and cloud HPC resources
 • Send your work to the
   system most suitable
Tip #3
You can‟t rewrite EVERYTHING.
• Salesfolk will just glibly tell
  you to rewrite your apps so
  you can use whatever big
  data analysis framework
  they happen to be selling
  today
• They have no clue.
• In life science informatics
  we have hundreds of codes
  that will never be rewritten.
• We‟ll be needing them for
  years to come.
• Advice:
 • MapReduceish methods are
   the future for big-data
   informatics
 • It will take years to get there
 • We still have to deal with
   legacy algorithms and codes
• You will need:
 • A process for figuring out
   when it‟s worthwhile to
   rewrite/re-architect
 • Tested cloud strategies for
   handling three use cases
You need 3 cloud
architectures:

 1. Legacy HPC
 2. “Cloudy” HPC
 3. Big Data HPC (Hadoop)
Legacy HPC on the cloud

•       MIT StarCluster
•       http://web.mit.edu/star/cluster/
• This is your baseline
    •     Extend as needed
“Cloudy” HPC

•   Use this method when …
•   It makes sense to rewrite or
    rearchitect an HPC workflow to
    better leverage modern cloud
    capabilities
“Cloudy” HPC, continued

•       Ditch the legacy compute farm model
•       Leverage elastic scale-out tools (***)
    •     Spot Instances for elastic & cheap compute
    •     SimpleDB for job statekeeping
    •     SQS for job queues & workrflow “glue”
    •     SNS for message passing & monitoring
    •     S3 for input & output data
    •     Etc.
Big Data HPC

•   It‟s gonna be a MapReduce world
•   Little need to roll your own
•   Ecosystem already healthy
•   Multiple providers today
•   Often a slam-dunk cloud use case
Tip #4
The Cloud was not designed for
            “us”
• HPC is an edge case for the
  hyperscale IaaS clouds
• We need to deal with this
  and engineer around it.
• Many examples
  • Eventual consistency
  • Networking & subnets
  • Latency
  • Node placement
• Advice
  • Manage expectations
  • Benchmark & test
  • Evangelize
   • (pester the cloud sales reps …)
Tip #5
Data Movement Is Still Hard
• Consistently getting easier
  • Amazon is not a
    bottleneck
  • AWS Import/Export
  • AWS Direct Connect
  • Aspera has some amazing
    stuff out right now
• Advice
 • AWS Import/Export works well
 • Size of pipe is not everything
 • Sweat the small stuff
   • Tracking, checksums, disk speed
   • Dedicated workstations
   • Secure media storage
Dedicated data movement station
„naked‟ Terabyte-scale data movement
Don‟t overlook media storage …
• Advice for 2012
 • BioTeam is dialing down our
    advocacy of physical data
    ingestion into the cloud
 • Why?
   • Operationally hard, expensive
      and no longer strictly needed
Real world cross-country
internet-based data movement




                        March
                        2012
700Mb/sec into Amazon, stress-free & zero tuning




                                    March
                                    2012
• People trying to move data via
  physical media quickly realize the
  operational difficulties
• Bandwidth is cheaper than hiring
  another body to manage physical
  data ingestion & movement
• In 2012 we strongly recommend
  network-based data movement
  when at all possible
u r doing it wrong
cool data movement, bro!
Tips #6 & 7
Cloud storage. Still slow.
Big shared storage. Still hard.
• Not much we can do except
  engineer around it
• AWS compute cluster
  instances are a huge step
  forward
• AWS competitors take note
• We are not database nerds
• We care about more than
   just random IO performance
• We need it all
  • Random I/O
  • Long sequential read/write
• Faster Storage Options
  • Software RAID on EBS
  • Various GlusterFS options
• Even if you optimize
   everything, the virtual NICs
   are still a bottleneck
• Big Shared Storage
  • 10GbE nodes and NFS
  • Software RAID sets
  • GlusterFS or similar
  • 2012: pNFS finally?
Tip #8
Things fail differently in the cloud.
• Stuff breaks
• It breaks in weird ways
• Transient/temporary issues
  more common than what we
  see “at home”
• Advice
  • Pessimism is good
  • Design for failure
  • Think hard about
    • How will you detect?
    • How will you respond?
• Advice
  • Remove humans from
    loop
  • Automate recovery
  • Automate your backups
Tip #9
Serial/batch computing at-scale
• Loosely coupled workflows
  are ideal
• Break the pipeline into
  discrete components
• Components should be able
  to scale up|down
  independently
• Component = Opportunity to:
 • … Make a scaling decision
   •   (# nodes in use)
 • … Make sizing decision
   •   (instance type in use)
Nirvana is …
… independent loosely
connected components that
can self-scale and
communicate asynchronously
Advice:
• Many people already doing
  this
• Best practices are well known
• Steal from the best:
  • RightScale, Opscode &
    Cycle Computing
Phew. Think I‟m done now.
Questions?
       Slides available at
http://slideshare.net/chrisdag/
End;
Backup Slides
Private Clouds: Pick Your Poison

• OpenStack - http://openstack.org
  • Pro: Super smart developers;
    significant mindshare; True
    Open Source
  • Con: Commitment to AWS API
    compatibility (?) & stability
Private Clouds: Pick Your Poison

• CloudStack- http://cloudstack.org
  • Pro: Explicit AWS API support;
    very recent move away from
    “open-core” model; usability
  • Con: Developer mindshare?
    Sudden switch to Apache
Private Clouds: Pick Your Poison

• Eucalyptus- http://eucalyptus.com
  • Pro: Direct AWS API
    compatibility; lots of hypervisor
    support
  • Con: Open-core model;
    mindshare; Recent ressurection

More Related Content

PDF
Multi-Tenant Pharma HPC Clusters
PDF
BioIT World 2016 - HPC Trends from the Trenches
PPTX
BioIT Trends - 2014 Internet2 Technology Exchange
PDF
2015 CDC Workshop on ScienceDMZ
PPTX
Taming Big Science Data Growth with Converged Infrastructure
PPTX
2015 Bio-IT Trends From the Trenches
PDF
2013: Trends from the Trenches
PDF
2014 BioIT World - Trends from the trenches - Annual presentation
Multi-Tenant Pharma HPC Clusters
BioIT World 2016 - HPC Trends from the Trenches
BioIT Trends - 2014 Internet2 Technology Exchange
2015 CDC Workshop on ScienceDMZ
Taming Big Science Data Growth with Converged Infrastructure
2015 Bio-IT Trends From the Trenches
2013: Trends from the Trenches
2014 BioIT World - Trends from the trenches - Annual presentation

What's hot (20)

PPTX
Cloud Security for Life Science R&D
PDF
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
PDF
Bio-IT & Cloud Sobriety: 2013 Beyond The Genome Meeting
PDF
Bio-IT for Core Facility Managers
PPTX
2015 04 bio it world
PPTX
Cloud Sobriety for Life Science IT Leadership (2018 Edition)
PPTX
Trends from the Trenches: 2019
PDF
Bio-IT Trends From The Trenches (digital edition)
PPTX
Practical Petabyte Pushing
PDF
Disruptive Innovation: how do you use these theories to manage your IT?
PDF
Big Data and Bad Analogies
PDF
Lean approach to IT development
PDF
Everything has changed except us
PPTX
2016 05 sanger
PDF
BioTeam Trends from the Trenches - NIH, April 2014
PDF
frog IoT Big Design IoT World Congress 2015
PDF
Mini-course "Practices of the Web Giants" at Global Code - São Paulo
PDF
Humans in a loop: Jupyter notebooks as a front-end for AI
KEY
Trends from the Trenches (Singapore Edition)
PPTX
Innovation med big data – chr. hansens erfaringer
Cloud Security for Life Science R&D
Facilitating Collaborative Life Science Research in Commercial & Enterprise E...
Bio-IT & Cloud Sobriety: 2013 Beyond The Genome Meeting
Bio-IT for Core Facility Managers
2015 04 bio it world
Cloud Sobriety for Life Science IT Leadership (2018 Edition)
Trends from the Trenches: 2019
Bio-IT Trends From The Trenches (digital edition)
Practical Petabyte Pushing
Disruptive Innovation: how do you use these theories to manage your IT?
Big Data and Bad Analogies
Lean approach to IT development
Everything has changed except us
2016 05 sanger
BioTeam Trends from the Trenches - NIH, April 2014
frog IoT Big Design IoT World Congress 2015
Mini-course "Practices of the Web Giants" at Global Code - São Paulo
Humans in a loop: Jupyter notebooks as a front-end for AI
Trends from the Trenches (Singapore Edition)
Innovation med big data – chr. hansens erfaringer
Ad

Similar to Mapping Life Science Informatics to the Cloud (20)

PPTX
4 C’s for Using Cloud to Support Scientific Research
PDF
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
PPTX
2019 BioIt World - Post cloud legacy edition
PPT
Cloud introduction2.ppt
PDF
Cloud Computing Berkeley.pdf
PDF
Bio-IT Asia 2013: Informatics & Cloud - Best Practices & Lessons Learned
PPTX
Building a Just-in-Time Application Stack for Analysts
PPTX
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
PDF
CloudCamp
PPT
Cloud Computing Realities - Getting past the hype and setting your cloud stra...
PDF
Cloud Computing .ppt
PDF
ppt2.pdf
KEY
2012: Trends from the Trenches
PDF
Migrating to Public Cloud
PPT
CloudCamp London 3 - 451 Group - William Fellows
PPT
Cloud introduction
PPTX
Making Sense of the Cloud
PDF
Scientific Cloud Computing: Present & Future
PPTX
Cloud computing and its job opportunities
PDF
Amazon Ec2 Application Design
4 C’s for Using Cloud to Support Scientific Research
RightScale Webinar: Operationalize Your Enterprise AWS Usage Through an IT Ve...
2019 BioIt World - Post cloud legacy edition
Cloud introduction2.ppt
Cloud Computing Berkeley.pdf
Bio-IT Asia 2013: Informatics & Cloud - Best Practices & Lessons Learned
Building a Just-in-Time Application Stack for Analysts
The Fast Path to Building a Private Cloud (With Guest Speaker from Forrester ...
CloudCamp
Cloud Computing Realities - Getting past the hype and setting your cloud stra...
Cloud Computing .ppt
ppt2.pdf
2012: Trends from the Trenches
Migrating to Public Cloud
CloudCamp London 3 - 451 Group - William Fellows
Cloud introduction
Making Sense of the Cloud
Scientific Cloud Computing: Present & Future
Cloud computing and its job opportunities
Amazon Ec2 Application Design
Ad

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Machine learning based COVID-19 study performance prediction
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Approach and Philosophy of On baking technology
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPT
Teaching material agriculture food technology
Cloud computing and distributed systems.
Review of recent advances in non-invasive hemoglobin estimation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
NewMind AI Weekly Chronicles - August'25 Week I
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
MYSQL Presentation for SQL database connectivity
Machine learning based COVID-19 study performance prediction
Advanced Soft Computing BINUS July 2025.pdf
Electronic commerce courselecture one. Pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Approach and Philosophy of On baking technology
Unlocking AI with Model Context Protocol (MCP)
Per capita expenditure prediction using model stacking based on satellite ima...
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Teaching material agriculture food technology

Mapping Life Science Informatics to the Cloud

  • 1. Mapping Informatics To the Cloud 2012 AIRI Petabyte Challenge Chris Dagdigian chris@bioteam.net
  • 2. I‟m Chris. I‟m an infrastructure geek. I work for the BioTeam.
  • 4. When I say “cloud” I‟m talking IaaS.
  • 5. Amazon AWS Is the IaaS cloud. Most others are fooling themselves. (Has-beens, also-rans & delusional marketing zombies)
  • 6. A message for the pretenders…
  • 7. No APIs? Not a cloud.
  • 9. I have to email a human? Not a cloud.
  • 10. ~50% failure rate when provisioning new servers? Stupid cloud.
  • 11. Block storage and virtual servers only? (barely) a cloud;
  • 13. Private Clouds in 2012: • Hype vs. Reality ratio still wacky • Sensible only for certain shops • Have you seen what you have to do to your networks & gear? • There are easier ways
  • 14. Private Clouds: My Advice for „12 • Remain cynical (test vendor claims) • Due Diligence still essential • I personally would not deploy/buy anything that does not explicitly provide Amazon API compatibility
  • 15. Private Clouds: My Advice for „12 • Most people are better off: • Adding VM platforms to existing HPC clusters & environments • Extending enterprise VM platforms to allow user self-service & server catalogs
  • 18. HPC & Clouds: Whole New World
  • 19. • We have spent decades learning to tune research HPC systems for shared access & many users. • The cloud upends this model
  • 20. • Far more common to see … • Dedicated cloud resources spun up for each app or use case • Each system gets individually tuned & optimized
  • 22. Hybrid Clouds & Cloud Bursting
  • 23. • Lots of aggressive marketing • Lots of carefully constructed “case studies” and prototypes • The truth? • Less usable than you‟ve been told • Possible? Heck yeah. • Practical? Only sometimes.
  • 24. • Advice • Be cynical • Demand proof • Test carefully
  • 25. • Still want to do it? • Buy it, don‟t build it • Cycle Computing • Univa • BrightComputing • …
  • 26. • Follow the crowd • In the real world we see: • Separation between local and cloud HPC resources • Send your work to the system most suitable
  • 28. You can‟t rewrite EVERYTHING.
  • 29. • Salesfolk will just glibly tell you to rewrite your apps so you can use whatever big data analysis framework they happen to be selling today
  • 30. • They have no clue.
  • 31. • In life science informatics we have hundreds of codes that will never be rewritten. • We‟ll be needing them for years to come.
  • 32. • Advice: • MapReduceish methods are the future for big-data informatics • It will take years to get there • We still have to deal with legacy algorithms and codes
  • 33. • You will need: • A process for figuring out when it‟s worthwhile to rewrite/re-architect • Tested cloud strategies for handling three use cases
  • 34. You need 3 cloud architectures: 1. Legacy HPC 2. “Cloudy” HPC 3. Big Data HPC (Hadoop)
  • 35. Legacy HPC on the cloud • MIT StarCluster • http://web.mit.edu/star/cluster/ • This is your baseline • Extend as needed
  • 36. “Cloudy” HPC • Use this method when … • It makes sense to rewrite or rearchitect an HPC workflow to better leverage modern cloud capabilities
  • 37. “Cloudy” HPC, continued • Ditch the legacy compute farm model • Leverage elastic scale-out tools (***) • Spot Instances for elastic & cheap compute • SimpleDB for job statekeeping • SQS for job queues & workrflow “glue” • SNS for message passing & monitoring • S3 for input & output data • Etc.
  • 38. Big Data HPC • It‟s gonna be a MapReduce world • Little need to roll your own • Ecosystem already healthy • Multiple providers today • Often a slam-dunk cloud use case
  • 40. The Cloud was not designed for “us”
  • 41. • HPC is an edge case for the hyperscale IaaS clouds • We need to deal with this and engineer around it.
  • 42. • Many examples • Eventual consistency • Networking & subnets • Latency • Node placement
  • 43. • Advice • Manage expectations • Benchmark & test • Evangelize • (pester the cloud sales reps …)
  • 45. Data Movement Is Still Hard
  • 46. • Consistently getting easier • Amazon is not a bottleneck • AWS Import/Export • AWS Direct Connect • Aspera has some amazing stuff out right now
  • 47. • Advice • AWS Import/Export works well • Size of pipe is not everything • Sweat the small stuff • Tracking, checksums, disk speed • Dedicated workstations • Secure media storage
  • 50. Don‟t overlook media storage …
  • 51. • Advice for 2012 • BioTeam is dialing down our advocacy of physical data ingestion into the cloud • Why? • Operationally hard, expensive and no longer strictly needed
  • 52. Real world cross-country internet-based data movement March 2012
  • 53. 700Mb/sec into Amazon, stress-free & zero tuning March 2012
  • 54. • People trying to move data via physical media quickly realize the operational difficulties • Bandwidth is cheaper than hiring another body to manage physical data ingestion & movement • In 2012 we strongly recommend network-based data movement when at all possible
  • 55. u r doing it wrong
  • 59. Big shared storage. Still hard.
  • 60. • Not much we can do except engineer around it • AWS compute cluster instances are a huge step forward • AWS competitors take note
  • 61. • We are not database nerds • We care about more than just random IO performance • We need it all • Random I/O • Long sequential read/write
  • 62. • Faster Storage Options • Software RAID on EBS • Various GlusterFS options • Even if you optimize everything, the virtual NICs are still a bottleneck
  • 63. • Big Shared Storage • 10GbE nodes and NFS • Software RAID sets • GlusterFS or similar • 2012: pNFS finally?
  • 65. Things fail differently in the cloud.
  • 66. • Stuff breaks • It breaks in weird ways • Transient/temporary issues more common than what we see “at home”
  • 67. • Advice • Pessimism is good • Design for failure • Think hard about • How will you detect? • How will you respond?
  • 68. • Advice • Remove humans from loop • Automate recovery • Automate your backups
  • 71. • Loosely coupled workflows are ideal • Break the pipeline into discrete components • Components should be able to scale up|down independently
  • 72. • Component = Opportunity to: • … Make a scaling decision • (# nodes in use) • … Make sizing decision • (instance type in use)
  • 74. … independent loosely connected components that can self-scale and communicate asynchronously
  • 75. Advice: • Many people already doing this • Best practices are well known • Steal from the best: • RightScale, Opscode & Cycle Computing
  • 76. Phew. Think I‟m done now.
  • 77. Questions? Slides available at http://slideshare.net/chrisdag/
  • 78. End;
  • 80. Private Clouds: Pick Your Poison • OpenStack - http://openstack.org • Pro: Super smart developers; significant mindshare; True Open Source • Con: Commitment to AWS API compatibility (?) & stability
  • 81. Private Clouds: Pick Your Poison • CloudStack- http://cloudstack.org • Pro: Explicit AWS API support; very recent move away from “open-core” model; usability • Con: Developer mindshare? Sudden switch to Apache
  • 82. Private Clouds: Pick Your Poison • Eucalyptus- http://eucalyptus.com • Pro: Direct AWS API compatibility; lots of hypervisor support • Con: Open-core model; mindshare; Recent ressurection