I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough #81

2024WY · 2024-06-17T08:20:33Z

I had modified the preprocess_function.And the used data is the sft data of the model.
The pictures are the results of the training and testing in training:

but I use Spec-Bench project to test on Chineses dataset, the Mean accepted tokens is only 2.5994782086414654, not good enough.

Anything else to pay attention, Can you give some advice?Thanks!

Liyuhui-12 · 2024-06-28T07:47:27Z

I noticed that your top-3 accuracy on the training set is only about 0.8, which is relatively low. What is your training accuracy on the English dataset? If it is close to the accuracy on the Chinese dataset, it could be that the structure or size of the draft model is not suitable. If the English accuracy is significantly higher than the Chinese accuracy, it is possible that your base model is not sufficiently trained on Chinese, and its features cannot effectively capture the semantic information of Chinese.

2024WY changed the title ~~I use Chinese dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough~~ I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough Jun 17, 2024

hongyanz closed this as completed Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough #81

I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough #81

2024WY commented Jun 17, 2024

Liyuhui-12 commented Jun 28, 2024

I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough #81

I use Chinese and English mixed dataset and a another custom 7B model to train eagle，but the performance in Chinese is not good enough #81

Comments

2024WY commented Jun 17, 2024

Liyuhui-12 commented Jun 28, 2024