The batch size would influence size of snapshot(.pkl)? #18

landian60 · 2023-04-05T15:37:35Z

Hello, thanks for your great job.
I had tried the experiment, and found that different batch size will change the sizes of checkpoint. Does the _fourier_embs_cache item affect the snapshot size? And if so, should train and test on the same snapshot have the same batch size?
tks.

universome · 2023-04-18T18:41:06Z

Hi @landian60 , could you please provide additional information (e.g., the sizes of the checkpoints). The batch size could indeed influence the checkpoint size since we cache the fourier features which likely leak into the model's checkpoint due to the persistence_class decorator.

landian60 · 2023-04-19T05:03:26Z

Thanks for your kind reply even the work is almost 2 yrs ago.
Also, if I train with a batch size of 16 per GPU, the checkpoint size is 4.86GB. If I train with a batch size of 24 per GPU, the checkpoint size is 5.91GB.
I changed the batch size to fully utilize the V100. So, it turns out that Fourier features occupy a large space and they wouldn’t affect the test process?
And could the checkpoint space be saved by caching just one group of Fourier features and repeating batch size numbers on a new dimension?
tks!

landian60 · 2023-04-24T15:30:59Z

And I have another question about extrapolating outside of image boundaries.
If I want to change the positional encoding coordinates from [0,1] to [-0.3,1.3], should I change the resolution of the logarithmic_basis? But if I do that, the size would not match with the const_embs.

universome · 2023-05-01T14:06:27Z

Hi @landian60, you are correct about "could the checkpoint space be saved by caching just one group of Fourier features and repeating batch size numbers on a new dimension". I guess, my reasoning back then was to cache the Fourier features for the whole batch to avoid additional memory allocation (which I thought could be expensive). To be honest, I do not remember benchmarking this (I only remember benchmarking "caching" vs "no caching") — so you might try it. Also, back then, I was not aware of torch.expand function (which does not allocate new memory) — it should be cheaper to use than torch.repeat in this scenario: I suspect that since we do concatenation afterward, there should anyway be new memory allocations/deallocations and then it might not matter much whether you used torch.expand or torch.repeat.

For extrapolation, you shouldn't change the basis. We didn't use const embeddings to train the generator on bedrooms to perform extrapolation afterwords.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The batch size would influence size of snapshot(.pkl)? #18

The batch size would influence size of snapshot(.pkl)? #18

landian60 commented Apr 5, 2023 •

edited

Loading

universome commented Apr 18, 2023 •

edited

Loading

landian60 commented Apr 19, 2023 •

edited

Loading

landian60 commented Apr 24, 2023

universome commented May 1, 2023

The batch size would influence size of snapshot(.pkl)? #18

The batch size would influence size of snapshot(.pkl)? #18

Comments

landian60 commented Apr 5, 2023 • edited Loading

universome commented Apr 18, 2023 • edited Loading

landian60 commented Apr 19, 2023 • edited Loading

landian60 commented Apr 24, 2023

universome commented May 1, 2023

landian60 commented Apr 5, 2023 •

edited

Loading

universome commented Apr 18, 2023 •

edited

Loading

landian60 commented Apr 19, 2023 •

edited

Loading