The document proposes designs for cost-effective and scalable merge sorter trees on FPGAs. It summarizes existing work on merge sorter trees and single sort cell (SSC) architectures. It then proposes four designs: 1) a baseline SSC design, 2) a critical path optimized design for smaller trees, 3) a design using BRAMs for record management for larger trees, and 4) a combination of designs 1 and 3. An evaluation shows the combination design achieves 52.4x lower slice usage and 1.23x higher throughput compared to the baseline, enabling sorting of over 4,000 records on an FPGA with low resource usage.
Related topics: