new model #66

shunxing12345 · 2024-04-26T08:48:10Z

Hi i want to add a model that has a different architecture from the LLaMA model. BUT when I was trying
accelerate launch -m --mixed_precision=bf16 eagle.train.main --tmpdir [path of data]\ --cpdir [path of checkpoints] -- configpath [path of config file]
I got the following ERROR

The text was updated successfully, but these errors were encountered:

Liyuhui-12 · 2024-04-29T01:22:33Z

It seems that the name of the embedding in your model is not 'embed_tokens'. You can modify it to the name of the embedding layer in your model.

shunxing12345 · 2024-04-29T08:56:12Z

Thanks for your replay!
I got an other problem I am trying to train an LLM which structure differs from LLaMA and Mixtral,　should I change the code of cnet.py? It seems based on LLaMA

Liyuhui-12 · 2024-04-30T15:23:52Z

This is not necessary; EAGLE's structure is independent of the target model. You can use the same cnet.py, or you can try other structures as well.

shunxing12345 · 2024-05-06T10:41:40Z

Thanks!
I have a finetuned a 12B model, but I got the OOM ERROR in model, head, optimizer, train_loader, test_loader, scheduler = accelerator.prepare( model, head, optimizer, train_loader, test_loader, scheduler. I have 8 40G-A100.

this is my train_config

this is my config.json

Liyuhui-12 · 2024-05-06T13:55:01Z

I noticed that your "n_layers" is set to 38, which makes your draft model very large. In EAGLE, the draft model consists of only one layer.

shunxing12345 · 2024-05-07T10:37:17Z

Hi, I have successfully trained an Auto-regression Head, but I encountered the following error during inference.
https://github.com/SafeAILab/EAGLE/blob/main/eagle/modeling_eagle.py#L957

and here is the size of Tensor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new model #66

new model #66

shunxing12345 commented Apr 26, 2024 •

edited

Loading

Liyuhui-12 commented Apr 29, 2024

shunxing12345 commented Apr 29, 2024

Liyuhui-12 commented Apr 30, 2024

shunxing12345 commented May 6, 2024

Liyuhui-12 commented May 6, 2024

shunxing12345 commented May 7, 2024

new model #66

new model #66

Comments

shunxing12345 commented Apr 26, 2024 • edited Loading

Liyuhui-12 commented Apr 29, 2024

shunxing12345 commented Apr 29, 2024

Liyuhui-12 commented Apr 30, 2024

shunxing12345 commented May 6, 2024

Liyuhui-12 commented May 6, 2024

shunxing12345 commented May 7, 2024

shunxing12345 commented Apr 26, 2024 •

edited

Loading