GPU memory #7

DhavalTaunk08 · 2022-06-04T15:17:52Z

Hi, can you please tell, how much gpu memory is required while finetuning the model? i am trying to finetune it using the Nivida 2080Ti with a memory of 12GB. But I am getting Cuda out of memory error.

MarkusSagen · 2022-06-05T07:39:24Z

Hi @DhavalTaunk08,
I would you mind expanding on some of your settings a bit?

When fine-tuning, what is your dataset? How large is your batch size? What do you set as the sequence length? Do you run it with mixed precision (fp16) and does your gpu have support and installed NVIDIA apex?

DhavalTaunk08 · 2022-06-07T13:43:11Z

Hi @MarkusSagen, I am using my own custom built dataset having same format as Wikisum dataset. I tried different batch sizes varying from 2 to 16. I have tried sequence length from 2 to 4096. I am using mixed precision (fp16) and the gpu is also well setup.

MarkusSagen · 2022-12-11T09:45:49Z

I think it is unlikely to work on with those specs unfortunately. xlm-r, just to initialize the weights and train for a minimal text sample is likely to take up more then 18gb I would estimate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU memory #7

GPU memory #7

DhavalTaunk08 commented Jun 4, 2022

MarkusSagen commented Jun 5, 2022

DhavalTaunk08 commented Jun 7, 2022

MarkusSagen commented Dec 11, 2022

GPU memory #7

GPU memory #7

Comments

DhavalTaunk08 commented Jun 4, 2022

MarkusSagen commented Jun 5, 2022

DhavalTaunk08 commented Jun 7, 2022

MarkusSagen commented Dec 11, 2022