How to fine-tune the pre-trained model on my dataset? #67

felixshing · 2024-07-09T02:04:37Z

Hello, I would like to ask how can I load the pre-trained model and fine-tune it on my self-collected dataset?

yerfor · 2024-07-09T05:50:36Z

Hi, you can use the init_from_ckpt option.

felixshing · 2024-07-09T06:09:32Z

Hi, you can use the init_from_ckpt option.

Thanks!

I have another question regarding the pre-trained models you provided. Specifically, you included "audio2secc_vae" and "secc2plane_torso_orig". However, in your training guidelines for audio, it is recommended to first train "audio_lm3d_syncnet" and then "audio2motion". Similarly, for motion, the guideline suggests first training "Img-to-Plane" followed by "Motion-to-Video", which includes "secc2plane_head" and "secc2plane_torso".

I am a bit confused about their relationships. Are "audio2secc_vae" equivalent to "audio2motion" and "secc2plane_torso_orig" equivalent to "secc2plane_torso"?

For audio training, should I:

Train "audio_lm3d_syncnet" myself, and then
When training "audio2motion", provide the checkpoints from both my trained "audio_lm3d_syncnet" and the provided "audio2secc_vae"?

Or, do I not have to train "audio_lm3d_syncnet" at all and just provide "audio2secc_vae" for fine-tuning?

Similarly, for Motion-to-Video training, should I:

Train "Img-to-Plane" myself
Train "secc2plane_head" myself, based on trained "Img-to-Plane"
When training "secc2plane_torso", provide the checkpoints from both my trained "secc2plane_head" and the provided "secc2plane_torso_orig"?

But seems we can only set one checkpoint for "init_from_ckp"?

Additionally, does "secc2plane_head" imply inferring only the head area without the torso?

Thank you so much for your help!

yerfor · 2024-07-09T07:50:48Z

Yes, "audio2secc_vae" equivalent to "audio2motion" and "secc2plane_torso_orig" equivalent to "secc2plane_torso"
For audio training, should I ==> Yes, you need to train a syncnet.
You can skip the image-to-plane pre-training, and go through the init_from_ckpt => secc2plane_head => secc2plane_torso.
does "secc2plane_head" imply inferring only the head area without the torso? ==> Yes

felixshing · 2024-07-09T14:04:30Z

Thank you so much for your response! I am still a bit confused about this step:

You can skip the image-to-plane pre-training, and go through the init_from_ckpt => secc2plane_head => secc2plane_torso.

Where can we get the pre-trained model for image-to-plane? It appears that currently, we only have the pre-trained models for "audio2motion" and "secc2plane_torso".

Additionally, I noticed that during evaluation, the human figure changes each time instead of using the one I provided. Where is this part of the setup, and how can we modify it to use my provided human figure?

Thank you for your time!

yerfor · 2024-07-09T14:17:54Z

you can use the provided pre-trained secc2plane_torso to initialize you own secc2plane_head model, just set strict=False.

For using your provided human figure, please modify the code in validation_steps

felixshing · 2024-07-09T16:22:13Z

you can use the provided pre-trained secc2plane_torso to initialize you own secc2plane_head model, just set strict=False.

For using your provided human figure, please modify the code in validation_steps

Thank you for your reply!

I have modified the training logic. However, when I tried to train the secc2plane_head model on my 4090 GPU, I encountered the OOM issue. Is there any way to reduce the GPU memory requirement during training? I tried to reduce "num_workers" but it did not work

yerfor · 2024-07-09T18:24:26Z

You can reduce the batch_size, or you can try amp=True

moliq1 · 2024-08-19T08:31:47Z

@yerfor Hi, Thank you so much for your wonderful work. I was wondering if you could also release a public avaliable model of the syncnet, so we can finetune on our dataset much easier?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to fine-tune the pre-trained model on my dataset? #67

How to fine-tune the pre-trained model on my dataset? #67

felixshing commented Jul 9, 2024

yerfor commented Jul 9, 2024

felixshing commented Jul 9, 2024 •

edited

Loading

yerfor commented Jul 9, 2024

felixshing commented Jul 9, 2024 •

edited

Loading

yerfor commented Jul 9, 2024 •

edited

Loading

felixshing commented Jul 9, 2024

yerfor commented Jul 9, 2024

moliq1 commented Aug 19, 2024

How to fine-tune the pre-trained model on my dataset? #67

How to fine-tune the pre-trained model on my dataset? #67

Comments

felixshing commented Jul 9, 2024

yerfor commented Jul 9, 2024

felixshing commented Jul 9, 2024 • edited Loading

yerfor commented Jul 9, 2024

felixshing commented Jul 9, 2024 • edited Loading

yerfor commented Jul 9, 2024 • edited Loading

felixshing commented Jul 9, 2024

yerfor commented Jul 9, 2024

moliq1 commented Aug 19, 2024

felixshing commented Jul 9, 2024 •

edited

Loading

felixshing commented Jul 9, 2024 •

edited

Loading

yerfor commented Jul 9, 2024 •

edited

Loading