Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FSDP: Add mistral model type support #384

Merged
merged 1 commit into from
Aug 26, 2024
Merged

Conversation

arm-diaz
Copy link
Collaborator

Adding support for mistral model type when using FSDP for training on Hyperpod. Before these changes, only mixtral was supported. This PR is part of other PRs that Nithin and I will incorporate as part of the mathstral sbatch job already tested on Hyperpod.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@arm-diaz arm-diaz changed the title Add mistral model type support FSDP: Add mistral model type support Jul 23, 2024
@nithiyn nithiyn requested a review from perifaws August 26, 2024 12:33
@perifaws
Copy link
Contributor

Approved, relates to #385

@nithiyn nithiyn merged commit 7791f5d into main Aug 26, 2024
@nithiyn nithiyn deleted the mathstral-fsdp-armdiazg branch August 26, 2024 22:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants