How Voltron Data's Theseus solves data movement challenge

View profile for Craig Dunham

CEO Voltron Data | Data & AI Infrastructure | HPC on GPUs | Growth Operator | GTM Leader | Real Estate Investor | Kellogg MBA

Moving data at scale is the toughest challenge in accelerated computing. Nvidia is trying to solve. AMD (w/ROCm) is also. Voltron Data's Theseus... ... is architected for data movement. Theseus workers run four specialized, asynchronous executors: Compute, Memory, Pre-Load, and Network. This architecture allows I/O, spill/prefetch, and shuffle to happen in parallel with GPU compute. We prioritize data movement: - we guarantee places for data to live - we prefetch the exact bytes - we make host memory a fast transit lane - our network executor supports TCP and UCX/GPUDirect RDMA, with optional compression Voltron Data's Theseus has solved the data movement problem. [Link to Blog in Comments]

View organization page for Voltron Data

26,736 followers

The race to build a distributed GPU runtime is heating up. 🚀 As Mike Beaumont highlights in our recent blog post, the bottleneck at datacenter scale isn’t FLOPS, it’s data movement. That’s why we built Theseus with a data-movement first design, overlapping compute with I/O and prefetching exact byte ranges. In head-to-head, cost-matched cloud tests, Theseus outperforms Photon by up to 4X and completes 100TB TPC-H/DS with just two DGX A100 640 GB nodes. And it now runs on AMD via ROCm-DS/hipDF, bringing cross-platform performance into the conversation. Read more in our latest blog: https://lnkd.in/gRPGb-U7 #DistributedSystems #GPUs #DataEngineering #ApacheArrow #SQL #AI

  • No alternative text description for this image
Craig Dunham

CEO Voltron Data | Data & AI Infrastructure | HPC on GPUs | Growth Operator | GTM Leader | Real Estate Investor | Kellogg MBA

2w

Read more in our latest blog: https://lnkd.in/gRPGb-U7

Like
Reply

To view or add a comment, sign in

Explore content categories