This document discusses using the DryadLINQ framework to perform data-intensive computing on Windows HPC Server. DryadLINQ allows developers to write LINQ queries over distributed datasets using a declarative programming model. It automatically parallelizes queries by generating execution plans that leverage both intra-node parallelism using PLINQ and inter-node parallelism using the Dryad distributed execution engine. DryadLINQ integrates with .NET and provides type safety while handling serialization, distribution, and failure recovery of queries across large clustered datasets.
Related topics: