-
Notifications
You must be signed in to change notification settings - Fork 8.7k
Issues: ggerganov/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Huge performance degradation using latest branch on Intel Core Ultra 7 155H
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8328
opened Jul 5, 2024 by
aahouzi
Bug: loading model is slow using llama-cli
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8323
opened Jul 5, 2024 by
RunningLeon
Bug: make error
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8313
opened Jul 5, 2024 by
lorihuang
New CMakelists is pure pain.
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8298
opened Jul 4, 2024 by
Ph0rk0z
Feature Request: Support for Meta: Multi Token Prediction Models
enhancement
New feature or request
#8297
opened Jul 4, 2024 by
sorasoras
4 tasks done
Bug: [SYCL] Inference not working correctly on multiple GPUs
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8294
opened Jul 4, 2024 by
ch1y0q
Feature Request: support for Gemini Nano?
enhancement
New feature or request
#8289
opened Jul 4, 2024 by
flatsiedatsie
4 tasks done
Add support for InternLM 2.5 1M context. Should be as good as command r+
enhancement
New feature or request
#8285
opened Jul 4, 2024 by
mirek190
4 tasks done
Why is the single input used incorrect, or no output?
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8276
opened Jul 3, 2024 by
QIANXUNZDL123
Feature Request: (server) Add option to always skip all queued tasks and to process the last one only (within one slot)
enhancement
New feature or request
#8275
opened Jul 3, 2024 by
stduhpf
4 tasks done
Bug: Llama 3 8b giving different outputs for same input (temperature 0)
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8274
opened Jul 3, 2024 by
LiquidGunay
Bug: Error when trying to use Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
./llama-gguf-split --merge
to merge split model gguf files back
bug-unconfirmed
medium severity
#8264
opened Jul 2, 2024 by
tybalex
Bug: Gemma2 Context switching forgets original input
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8251
opened Jul 2, 2024 by
Gomez12
Bug: CodeShell inference not working correctly
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#8250
opened Jul 2, 2024 by
chiranko
llama3 quantization error
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8247
opened Jul 2, 2024 by
tomgm777
Bug: gemma 2 27B GGML_ASSERT n_dims <= ne0
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#8246
opened Jul 1, 2024 by
duynt575
Investigate gemma 2 generation quality
enhancement
New feature or request
#8240
opened Jul 1, 2024 by
ngxson
Feature Request: Support for CodeSage
enhancement
New feature or request
#8224
opened Jun 30, 2024 by
unclemusclez
4 tasks done
[feature request] Ability to import/export sessions from the UI.
#8220
opened Jun 30, 2024 by
0wwafa
Bug: Docker ROCm crashs, only works on metal compiled.
bug-unconfirmed
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#8213
opened Jun 29, 2024 by
rudiservo
Bug: ld: symbol(s) not found for architecture arm64
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8211
opened Jun 29, 2024 by
quarterturn
Show: FUTO-org Keyboard with llama.cpp-powered auto-correction and on-device finetuning
#8204
opened Jun 29, 2024 by
Green-Sky
Bug: Unable to generate the model output correctly
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#8202
opened Jun 29, 2024 by
Smupk2778
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.