Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix nmt weight conversion #1660

Closed

Conversation

Pzzzzz5142
Copy link
Contributor

For WMT14 model, it shares the vocab across the encoder and decoder. So it wouldn't trigger this error. However, for language pair which has large differences like zh-en, usually we don't share the vocab. So here we need set the vocab size for decoder correctly.

@Pzzzzz5142 Pzzzzz5142 changed the title Fix nmt weight convert Fix nmt weight convention May 24, 2024
@Pzzzzz5142 Pzzzzz5142 changed the title Fix nmt weight convention Fix nmt weight conversion May 24, 2024
@byshiue
Copy link
Collaborator

byshiue commented May 28, 2024

Thank you for the report. We will fix it soon.

@byshiue byshiue self-requested a review May 28, 2024 01:08
@byshiue byshiue self-assigned this May 28, 2024
@byshiue byshiue added the triaged Issue has been triaged by maintainers label May 28, 2024
@kaiyux kaiyux mentioned this pull request May 28, 2024
@Pzzzzz5142
Copy link
Contributor Author

Closing this pr since the changes are included in the main branch.

@Pzzzzz5142 Pzzzzz5142 closed this May 28, 2024
@Pzzzzz5142 Pzzzzz5142 deleted the dev-pzzzzz-fix-nmt-convert branch May 28, 2024 12:11
@byshiue
Copy link
Collaborator

byshiue commented May 30, 2024

Thank you for the contribution. We have add you into the co-author into the release note.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Merged triaged Issue has been triaged by maintainers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants