Skip to content

Issues: allenai/OLMo

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Performance degrades after converting checkpoint to HF type/question An issue that's a question
#716 opened Aug 28, 2024 by ahmadshapiro
Expected Data Format type/question An issue that's a question
#715 opened Aug 27, 2024 by aflah02
Which mmlu validation setting is recommend? type/question An issue that's a question
#714 opened Aug 27, 2024 by mathfinder
Criteria for Selecting acc vs. len_norm Metrics type/question An issue that's a question
#713 opened Aug 24, 2024 by mathfinder
OLMoThreadError: generator thread data thread 0 failed type/question An issue that's a question
#706 opened Aug 18, 2024 by ybdesire
[Quick question]: How do I turn off FSDP? type/question An issue that's a question
#703 opened Aug 15, 2024 by candygocandy
slurm script for: configs/official/OLMo-7B.yaml type/question An issue that's a question
#699 opened Aug 13, 2024 by andymvp2018
Number of tokens Olmo-1B was trained: 2T or 3T? type/question An issue that's a question
#697 opened Aug 9, 2024 by jyk13579
why CrossEntropyLoss is zero,i type/question An issue that's a question
#692 opened Aug 6, 2024 by aizhweiwei
Model ladder has no documentation type/documentation An issue or pull request related to documentation
#683 opened Jul 31, 2024 by IanMagnusson
Can long text be splitted into short texts? type/question An issue that's a question
#655 opened Jul 12, 2024 by CoinCheung
Cannot convert internal OLMo checkpoint to HF type/bug An issue about a bug
#654 opened Jul 11, 2024 by viking-sudo-rm
Issue with tokenizer wrapper type/question An issue that's a question
#644 opened Jul 8, 2024 by davidbrandfonbrener
What did OLMo 1B converge to? type/question An issue that's a question
#642 opened Jul 4, 2024 by sidereior
Resuming training on unsharded checkpoint type/bug An issue about a bug
#641 opened Jul 4, 2024 by lecifire
Multi node training type/question An issue that's a question
#640 opened Jul 3, 2024 by shahizat
How the 1B and 7B model are initialized? type/question An issue that's a question
#632 opened Jun 24, 2024 by sanyalsunny111
ProTip! Mix and match filters to narrow down what you’re looking for.