PyTorch data loader bottleneck - PyTorch Forums
discuss.pytorch.org › t › pytorch-data-loaderApr 02, 2021 · My model training is bottlenecked by IO, and I stream data from S3 using AWS wrangler. I only use 1 GPU for my model training. My machine has 8 GPUs, and I found when I run multiple training jobs, e.g. 4 jobs on my machine, the total IO of the machine increased 4 times. This demonstrated that the S3 throughput and network do not bottleneck my IO. It seems that a single data loader has a ...