From: GPU-Based FFT Computation for Multi-Gigabit WirelessHD Baseband Processing
Method | FFT time (s) |
---|---|
CuFFT with no enhancements | 7 |
CuFFT enhancement #1: | 4.15 |
Large Batch Size | |
CuFFT enhancement #2: | 2.83 |
Page-locked memory + radix-2 | |
CuFFT enhancement #3: | 2.54 |
Asynchronous concurrency (streaming) | |
CuFFT enhancement #4: | 1.34 |
Reduced accuracy (16-bit) | |
CuFFT enhancement #5: | 0.534 |
Reduced accuracy (8-bit) |