GPU Architecture
NVIDIA Ampere
CUDA Cores
1280
Tensor Cores
40 | Gen 3
RT Cores
108 Gen 2
Peak FP32
4.5 TFLOPS
Peak TF32 Tensor Core
9 TFLOPS | 18 TFLOPS Sparsity
Peak FP16 Tensor Core
18 TFLOPS | 36 TFLOPS Sparsity
INT8
36 TOPS | 72 TOPS Sparsity
INT8
72 TOPS | 144 TOPS Sparsity
GPU Memory