Caching with Dataset - PyTorch Forums
https://discuss.pytorch.org/t/caching-with-dataset/7379519.03.2020 · First, my dataset class does not modify the data loaded (from HDF files, in this case). On the computers I run my models on, there is not enough ram to hold all of the dataset items. To speed up loading, I have been caching up to a specific count. Then, in get_item it just tests if the item is cached, and returns that, or loads the item from disk. However, this means …
Caching with Dataset - PyTorch Forums
discuss.pytorch.org › t › caching-with-datasetMar 19, 2020 · First, my dataset class does not modify the data loaded (from HDF files, in this case). On the computers I run my models on, there is not enough ram to hold all of the dataset items. To speed up loading, I have been caching up to a specific count. Then, in get_item it just tests if the item is cached, and returns that, or loads the item from disk. However, this means that it takes 10+ minutes ...
Dataloader caching on large datasets - PyTorch Forums
discuss.pytorch.org › t › dataloader-caching-onApr 04, 2021 · Hey, I’m training a standard resnet50 classifier on Imagenet dataset, which contains over 1M images and weights 150+ GB. I’m using my own training script, but it’s a basic code using my torch dataloader on top of my own costume dataset. My dataset is simple, in the init function it just saves the path to all the images, and in the getitem function it loads the image from the path (using ...