This document summarizes the results of benchmarking and optimizing Altair HyperWorks RADIOSS simulation software on an HPC cluster. Key findings include:
- EDR InfiniBand interconnect provided the best performance and scalability compared to Ethernet or other InfiniBand technologies.
- Increasing CPU cores per node, simulation time, and enabling hybrid MPI/OpenMP parallelization improved performance.
- Tuning the MPI configuration, such as the MPI_Allreduce algorithm, provided significant performance gains.
- Single precision runs were faster than double precision by 47%. Higher CPU frequencies also increased performance.