SlideShare a Scribd company logo
Architecture and Performance of Runtime Environments for Data Intensive Scalable ComputingSC09 Doctoral Symposium,  Portland, 11/18/2009Student: Jaliya EkanayakeAdvisor: Prof. Geoffrey FoxCommunity Grids Laboratory, Digital Science CenterPervasive Technology InstituteIndiana University
Cloud Runtimes for Data/Compute Intensive ApplicationsCloud RuntimesMapReduce Dryad/DryadLINQSector/Sphere Moving Computation to  DataSimple communication topologiesMapReduceDirected Acyclic Graphs (DAG)sDistributed File SystemsFault ToleranceData/Compute intensive Applications
Represented as filter pipelines
Parallelizable filtersApplications using Hadoop and DryadLINQ (1)Input files (FASTA)CAP3 [1] - Expressed Sequence Tag assembly  to re-construct full-length mRNACAP3CAP3CAP3DryadLINQOutput files“Map only” operation in HadoopSingle “Select” operation in DryadLINQ[1] X. Huang, A. Madan, “CAP3: A DNA Sequence Assembly Program,” Genome Research, vol. 9, no. 9, pp. 868-877, 1999.
Applications using Hadoop and DryadLINQ (2)PhyloD [1]project from Microsoft ResearchDerive associations between HLA alleles and HIV codons and between codons themselvesDryadLINQ  implementation[1] Microsoft Computational Biology Web Tools, http://research.microsoft.com/en-us/um/redmond/projects/MSCompBio/
Applications using Hadoop and DryadLINQ (3)125 million distances4 hours & 46 minutesCalculate  Pairwise Distances (Smith Waterman Gotoh)Calculate pairwise distances for a collection of genes (used for clustering, MDS)Fine grained tasks in MPICoarse grained tasks in DryadLINQPerformed on 768 cores (Tempest Cluster)
Applications using Hadoop and DryadLINQ (4)High Energy Physics (HEP)
K-Means Clustering
Matrix Multiplication
Multi-Dimensional Scaling (MDS)MapReduce for Iterative ComputationsClassic MapReduce RuntimesGoogle, Apache Hadoop, Sector/Sphere, DryadLINQ (DAG based)Focus on Single Step MapReduce computations onlyIntermediate data is stored and accessed via file systemsBetter fault tolerance supportHigher latenciesIterative MapReduce computations uses new maps/reducesin each iterationFixed data is loaded again and againInefficient for many iterative computations to which the MapReduce technique could be appliedSolution: i-MapReduce
Applications & Different Interconnection PatternsInputmapiterationsInputInputmapmapOutputPijreducereduceMPIDomain of MapReduce and Iterative Extensions
i-MapReduceIn-memory MapReduce
Distinction on static data and variable data (data flow vs. δ flow)
Cacheable map/reduce tasks (long running tasks)
Combine operation
Support fast intermediate data transfersStaticdataConfigure()IterateUser Programδ flowMap(Key, Value)  Reduce (Key, List<Value>) Close()Combine (Key, List<Value>)Different synchronization and intercommunication mechanisms used by the parallel runtimes
i-MapReduceProgramming ModelrunMapReduce()  IterationsWorker NodesconfigureMaps()Local DiskconfigureReduce()Cacheable map/reduce taskswhile(condition){Can send <Key,Value> pairs directlyMap()Reduce()Combine() operationCommunications/data transfers via the pub-sub broker networkupdateCondition()Two configuration options :Using local disks (only for maps)Using pub-sub bus } //end whileclose()User program’s process space
i-MapReduceArchitecturePub/Sub Broker NetworkMap WorkerMWorker NodesReduce WorkerDMRDriverUserProgramDRMMMMMRDeamonDRRRRData Read/WriteFile SystemCommunicationData SplitStreaming based communication
Eliminates file based communication
Cacheable map/reduce tasks
Static data remains in memory

More Related Content

PPTX
Scalable Parallel Computing on Clouds
PPTX
High Performance Parallel Computing with Clouds and Cloud Technologies
PPTX
HPC with Clouds and Cloud Technologies
PPTX
Applications of PARALLEL PROCESSING
PPT
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
PPT
Chapter 1 pc
PDF
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
PDF
Accelerating Real Time Applications on Heterogeneous Platforms
Scalable Parallel Computing on Clouds
High Performance Parallel Computing with Clouds and Cloud Technologies
HPC with Clouds and Cloud Technologies
Applications of PARALLEL PROCESSING
Improving Efficiency of Machine Learning Algorithms using HPCC Systems
Chapter 1 pc
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
Accelerating Real Time Applications on Heterogeneous Platforms

What's hot (20)

PPT
Parallel Computing 2007: Bring your own parallel application
PPTX
Plenzogan technology
PDF
Efficient load rebalancing for distributed file system in Clouds
PPT
Migration To Multi Core - Parallel Programming Models
PDF
Energy-aware VM Allocation on An Opportunistic Cloud Infrastructure
PDF
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
PDF
Lecture 1 introduction to parallel and distributed computing
PDF
IRJET- Latin Square Computation of Order-3 using Open CL
PPT
Chap3 slides
PDF
Parallelization of Graceful Labeling Using Open MP
PDF
Image transmission in wireless sensor networks
PPTX
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
PDF
FrackingPaper
PDF
Spine net learning scale permuted backbone for recognition and localization
PDF
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
PPTX
Lecture 05 - Chapter 3 - Models of parallel computers and interconnections
PDF
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
PDF
Load Rebalancing for Distributed Hash Tables in Cloud Computing
PPTX
Lecture 04 chapter 2 - Parallel Programming Platforms
PDF
In datacenter performance analysis of a tensor processing unit
Parallel Computing 2007: Bring your own parallel application
Plenzogan technology
Efficient load rebalancing for distributed file system in Clouds
Migration To Multi Core - Parallel Programming Models
Energy-aware VM Allocation on An Opportunistic Cloud Infrastructure
High Performance & High Throughput Computing - EUDAT Summer School (Giuseppe ...
Lecture 1 introduction to parallel and distributed computing
IRJET- Latin Square Computation of Order-3 using Open CL
Chap3 slides
Parallelization of Graceful Labeling Using Open MP
Image transmission in wireless sensor networks
QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing...
FrackingPaper
Spine net learning scale permuted backbone for recognition and localization
Run-Time Adaptive Processor Allocation of Self-Configurable Intel IXP2400 Net...
Lecture 05 - Chapter 3 - Models of parallel computers and interconnections
Achieving Portability and Efficiency in a HPC Code Using Standard Message-pas...
Load Rebalancing for Distributed Hash Tables in Cloud Computing
Lecture 04 chapter 2 - Parallel Programming Platforms
In datacenter performance analysis of a tensor processing unit
Ad

Similar to Architecture and Performance of Runtime Environments for Data Intensive Scalable Computing (20)

PPTX
Qiu bosc2010
PPTX
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
PPTX
Slide 1
PPTX
Slide 1
PPTX
عصر کلان داده، چرا و چگونه؟
PDF
Big data processing systems research
PDF
Hadoop programming
PPTX
Cluster Computing with Dryad
PDF
Google Storage concepts and computing concepts.pdf
DOCX
Hadoop Seminar Report
PPTX
Cluster Computing with Dryad
PPT
Hadoop Tutorial.ppt
PPTX
Distributed computing poli
PPT
Hadoop tutorial
PDF
What is Distributed Computing, Why we use Apache Spark
PPT
Map reducecloudtech
PPTX
Hadoop An Introduction
PPT
Hadoop and Mapreduce Introduction
PDF
module4-cloudcomputing-180131071200.pdf
Qiu bosc2010
Apache Hadoop India Summit 2011 Keynote talk "Programming Abstractions for Sm...
Slide 1
Slide 1
عصر کلان داده، چرا و چگونه؟
Big data processing systems research
Hadoop programming
Cluster Computing with Dryad
Google Storage concepts and computing concepts.pdf
Hadoop Seminar Report
Cluster Computing with Dryad
Hadoop Tutorial.ppt
Distributed computing poli
Hadoop tutorial
What is Distributed Computing, Why we use Apache Spark
Map reducecloudtech
Hadoop An Introduction
Hadoop and Mapreduce Introduction
module4-cloudcomputing-180131071200.pdf
Ad

Recently uploaded (20)

PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Cloud computing and distributed systems.
PDF
Electronic commerce courselecture one. Pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Encapsulation theory and applications.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
KodekX | Application Modernization Development
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
20250228 LYD VKU AI Blended-Learning.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Chapter 3 Spatial Domain Image Processing.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Cloud computing and distributed systems.
Electronic commerce courselecture one. Pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
MYSQL Presentation for SQL database connectivity
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Unlocking AI with Model Context Protocol (MCP)
Building Integrated photovoltaic BIPV_UPV.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Encapsulation theory and applications.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Per capita expenditure prediction using model stacking based on satellite ima...
KodekX | Application Modernization Development
Profit Center Accounting in SAP S/4HANA, S4F28 Col11

Architecture and Performance of Runtime Environments for Data Intensive Scalable Computing

  • 1. Architecture and Performance of Runtime Environments for Data Intensive Scalable ComputingSC09 Doctoral Symposium, Portland, 11/18/2009Student: Jaliya EkanayakeAdvisor: Prof. Geoffrey FoxCommunity Grids Laboratory, Digital Science CenterPervasive Technology InstituteIndiana University
  • 2. Cloud Runtimes for Data/Compute Intensive ApplicationsCloud RuntimesMapReduce Dryad/DryadLINQSector/Sphere Moving Computation to DataSimple communication topologiesMapReduceDirected Acyclic Graphs (DAG)sDistributed File SystemsFault ToleranceData/Compute intensive Applications
  • 4. Parallelizable filtersApplications using Hadoop and DryadLINQ (1)Input files (FASTA)CAP3 [1] - Expressed Sequence Tag assembly to re-construct full-length mRNACAP3CAP3CAP3DryadLINQOutput files“Map only” operation in HadoopSingle “Select” operation in DryadLINQ[1] X. Huang, A. Madan, “CAP3: A DNA Sequence Assembly Program,” Genome Research, vol. 9, no. 9, pp. 868-877, 1999.
  • 5. Applications using Hadoop and DryadLINQ (2)PhyloD [1]project from Microsoft ResearchDerive associations between HLA alleles and HIV codons and between codons themselvesDryadLINQ implementation[1] Microsoft Computational Biology Web Tools, http://research.microsoft.com/en-us/um/redmond/projects/MSCompBio/
  • 6. Applications using Hadoop and DryadLINQ (3)125 million distances4 hours & 46 minutesCalculate Pairwise Distances (Smith Waterman Gotoh)Calculate pairwise distances for a collection of genes (used for clustering, MDS)Fine grained tasks in MPICoarse grained tasks in DryadLINQPerformed on 768 cores (Tempest Cluster)
  • 7. Applications using Hadoop and DryadLINQ (4)High Energy Physics (HEP)
  • 10. Multi-Dimensional Scaling (MDS)MapReduce for Iterative ComputationsClassic MapReduce RuntimesGoogle, Apache Hadoop, Sector/Sphere, DryadLINQ (DAG based)Focus on Single Step MapReduce computations onlyIntermediate data is stored and accessed via file systemsBetter fault tolerance supportHigher latenciesIterative MapReduce computations uses new maps/reducesin each iterationFixed data is loaded again and againInefficient for many iterative computations to which the MapReduce technique could be appliedSolution: i-MapReduce
  • 11. Applications & Different Interconnection PatternsInputmapiterationsInputInputmapmapOutputPijreducereduceMPIDomain of MapReduce and Iterative Extensions
  • 13. Distinction on static data and variable data (data flow vs. δ flow)
  • 14. Cacheable map/reduce tasks (long running tasks)
  • 16. Support fast intermediate data transfersStaticdataConfigure()IterateUser Programδ flowMap(Key, Value) Reduce (Key, List<Value>) Close()Combine (Key, List<Value>)Different synchronization and intercommunication mechanisms used by the parallel runtimes
  • 17. i-MapReduceProgramming ModelrunMapReduce() IterationsWorker NodesconfigureMaps()Local DiskconfigureReduce()Cacheable map/reduce taskswhile(condition){Can send <Key,Value> pairs directlyMap()Reduce()Combine() operationCommunications/data transfers via the pub-sub broker networkupdateCondition()Two configuration options :Using local disks (only for maps)Using pub-sub bus } //end whileclose()User program’s process space
  • 18. i-MapReduceArchitecturePub/Sub Broker NetworkMap WorkerMWorker NodesReduce WorkerDMRDriverUserProgramDRMMMMMRDeamonDRRRRData Read/WriteFile SystemCommunicationData SplitStreaming based communication
  • 19. Eliminates file based communication
  • 22. User Program is the composer of MapReduce computations
  • 23. Extends the MapReduce model to iterative computations
  • 25. Assume that static data fits in to distributed memory12/6/2009Jaliya Ekanayake11
  • 26. Applications – Pleasingly ParallelCAP3- Expressed Sequence TaggingInput files (FASTA)CAP3CAP3High Energy Physics (HEP) Data Analysis Output files
  • 27. Applications - IterativePerformance of K-MeansClusteringParallel Overhead of Matrix multiplication
  • 28. Current ResearchVirtualization OverheadApplications more susceptible to latencies (higher communication/computation ratio) => higher overheads under virtualizationHadoop shows 15% performance degradation on a private cloudLatency effect on i-MapReduceis lower compared to MPI due to the coarse grained tasks?Fault Tolerance for i-MapReduceReplicated dataSaving state after n iterations
  • 29. Related WorkGeneral MapReduce References:Google MapReduceApache HadoopMicrosoft DryadLINQPregel : Large-scale graph computing at GoogleSector/SphereAll-PairsSAGA: MapReduceDisco
  • 30. ContributionsProgramming model for iterative MapReduce computationsi-MapReduceimplementationMapReduce algorithms/implementations for a series of scientific applicationsApplicability of cloud runtimes to different classes of data/compute intensive applicationsComparison of cloud runtimes with MPI Virtualization overhead of HPC Applications and Cloud Runtimes
  • 31. PublicationsJaliya Ekanayake, (Advisor: Geoffrey Fox) Architecture and Performance of Runtime Environments for Data Intensive Scalable Computing, Accepted for the Doctoral Showcase, SuperComputing2009.Xiaohong Qiu, Jaliya Ekanayake, Scott Beason, Thilina Gunarathne, Geoffrey Fox, Roger Barga, Dennis Gannon, Cloud Technologies for Bioinformatics Applications, Accepted for publication in 2nd ACM Workshop on Many-Task Computing on Grids and Supercomputers, SuperComputing2009.Jaliya Ekanayake, Atilla Soner Balkir, Thilina Gunarathne, Geoffrey Fox, Christophe Poulain, Nelson Araujo, Roger Barga, DryadLINQ for Scientific Analyses, Accepted for publication in Fifth IEEE International Conference on e-Science (eScience2009), Oxford, UK.Jaliya Ekanayake and Geoffrey Fox, High Performance Parallel Computing with Clouds and Cloud Technologies, First International Conference on Cloud Computing (CloudComp2009), Munich, Germany. – An extended version of this paper goes to a book chapter.Geoffrey Fox, Seung-Hee Bae, Jaliya Ekanayake, Xiaohong Qiu, and Huapeng Yuan, Parallel Data Mining from Multicore to Cloudy Grids, High Performance Computing and Grids workshop, 2008. – An extended version of this paper goes to a book chapter.Jaliya Ekanayake, Shrideep Pallickara, Geoffrey Fox, MapReduce for Data Intensive Scientific Analyses, Fourth IEEE International Conference on eScience, 2008, pp.277-284.Jaliya Ekanayake, Shrideep Pallickara, and Geoffrey Fox, A collaborative framework for scientific data analysis and visualization, Collaborative Technologies and Systems(CTS08), 2008, pp. 339-346.Shrideep Pallickara, Jaliya Ekanayake and Geoffrey Fox, A Scalable Approach for the Secure and Authorized Tracking of the Availability of Entities in Distributed Systems, 21st IEEE International Parallel & Distributed Processing Symposium (IPDPS 2007).
  • 32. AcknowledgementsMy Ph.D. Committee: Prof. Geoffrey FoxProf. Andrew LumsdaineProf. Dennis GannonProf. David LeakeSALSA Team @ IUEspecially: Judy Qiu, Scott Beason, Thilina Gunarathne, Hui LiMicrosoft ResearchRoger BargeChristophe Poulain
  • 34. Parallel Runtimes – DryadLINQ vs. Hadoop

Editor's Notes

  • #12: Currently uses NaradaBrokering, but it is easily extensible to use any other pub/sub message infrastructure such as Apache ActiveMQ.