Skip to content

Issues: ggerganov/llama.cpp

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Huge performance degradation using latest branch on Intel Core Ultra 7 155H bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8328 opened Jul 5, 2024 by aahouzi
Bug: loading model is slow using llama-cli bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8323 opened Jul 5, 2024 by RunningLeon
Bug: make error bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8313 opened Jul 5, 2024 by lorihuang
New CMakelists is pure pain. bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8298 opened Jul 4, 2024 by Ph0rk0z
Feature Request: Support for Meta: Multi Token Prediction Models enhancement New feature or request
#8297 opened Jul 4, 2024 by sorasoras
4 tasks done
Bug: [SYCL] Inference not working correctly on multiple GPUs bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8294 opened Jul 4, 2024 by ch1y0q
Feature Request: support for Gemini Nano? enhancement New feature or request
#8289 opened Jul 4, 2024 by flatsiedatsie
4 tasks done
Add support for InternLM 2.5 1M context. Should be as good as command r+ enhancement New feature or request
#8285 opened Jul 4, 2024 by mirek190
4 tasks done
Why is the single input used incorrect, or no output? bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8276 opened Jul 3, 2024 by QIANXUNZDL123
Bug: Llama 3 8b giving different outputs for same input (temperature 0) bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8274 opened Jul 3, 2024 by LiquidGunay
Bug: Error when trying to use ./llama-gguf-split --merge to merge split model gguf files back bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8264 opened Jul 2, 2024 by tybalex
Bug: Gemma2 Context switching forgets original input bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8251 opened Jul 2, 2024 by Gomez12
Bug: CodeShell inference not working correctly bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8250 opened Jul 2, 2024 by chiranko
llama3 quantization error bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8247 opened Jul 2, 2024 by tomgm777
Bug: gemma 2 27B GGML_ASSERT n_dims <= ne0 bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8246 opened Jul 1, 2024 by duynt575
Investigate gemma 2 generation quality enhancement New feature or request
#8240 opened Jul 1, 2024 by ngxson
Feature Request: Support for CodeSage enhancement New feature or request
#8224 opened Jun 30, 2024 by unclemusclez
4 tasks done
Bug: Docker ROCm crashs, only works on metal compiled. bug-unconfirmed critical severity Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#8213 opened Jun 29, 2024 by rudiservo
Bug: ld: symbol(s) not found for architecture arm64 bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8211 opened Jun 29, 2024 by quarterturn
Bug: Unable to generate the model output correctly bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8202 opened Jun 29, 2024 by Smupk2778
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.