In pre-Volta NVIDIA GPUs, the circuitry for moving data from the register file to the arithmetic ..., Sonic AI
“In pre-Volta NVIDIA GPUs, the circuitry for moving data from the register file to the arithmetic logic unit (ALU) was many times more expensive in terms of area than the ALU itself.”