CUDA semantics — PyTorch 1.10.1 documentation
pytorch.org › docs › stableTensorFloat-32(TF32) on Ampere devices¶. Starting in PyTorch 1.7, there is a new flag called allow_tf32 which defaults to true. This flag controls whether PyTorch is allowed to use the TensorFloat32 (TF32) tensor cores, available on new NVIDIA GPUs since Ampere, internally to compute matmul (matrix multiplies and batched matrix multiplies) and convolutions.