8 AMD FirePro S10000s (16 GPUs) achieve 8 TFLOPS real world double precision compute performance
Consuming a total of 3.6 kilowatts, the folks down in the basement lab at AMD have built the fastest multi-GPU compute server using eight of their flagship compute powerhouse boards, the new AMD FirePro S10000!
This 16 GPU (eight S10000s) Exxact Computing Server provides more than 8 TFLOPS of real world double precision computing performance. While these are early drivers, this still means you are still seeing around 70% efficiency of the theoretical peak double precision floating point performance of 11.84 TFLOPS (47.28 TFLOPS peak single precision performance!).
Here’s a look at the DGEMM result and a glimpse at what’s under the hood of the compute server: