Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

ggml-cpu: optimise s390x multiply extend instructions ggml changes relating to the ggml tensor library for machine learning
#20032 opened Mar 2, 2026 by taronaeo Loading…
cann: support flash attention for head dim not multiple of 16 Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#20031 opened Mar 2, 2026 by noemotiovon Loading…
gguf-py: add type validation to GGUFWriter.add_key_value python python script changes
#20023 opened Mar 1, 2026 by Scottcjn Loading…
json-schema: handle typeless schema nodes as any-value testing Everything test related
#20021 opened Mar 1, 2026 by Scottcjn Loading…
gguf: add big-endian magic "FUGG" for explicit endianness detection ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#20019 opened Mar 1, 2026 by Scottcjn Loading…
vulkan: add UMA zero-copy async transfers and fix event_record deferred memcpy handling ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#20018 opened Mar 1, 2026 by neilopet Loading…
vulkan: add sparse OOM fallback for large UMA allocations and chunked staging fallback ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#20017 opened Mar 1, 2026 by neilopet Loading…
server: add Qwen3-Reranker instruction support examples python python script changes server
#20009 opened Mar 1, 2026 by schwebke Loading…
webui: add PWA support examples server
#19995 opened Feb 28, 2026 by matous-volf Loading…
common : fix common_chat_peg_parse for incomplete utf-8 sequence tail testing Everything test related
#19992 opened Feb 28, 2026 by akreal Loading…
build: fix various compiler warnings on Windows MinGW examples testing Everything test related
#19990 opened Feb 28, 2026 by jonathanjacksonswe Loading…
vulkan: tune MMVQ for Intel Windows ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#19988 opened Feb 28, 2026 by 0cc4m Loading…
ggml : add GGML_OP_ADD1 for metal Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#19987 opened Feb 28, 2026 by aisk Loading…
cli : add command and file auto-completion examples
#19985 opened Feb 28, 2026 by CISC Loading…
Re-enable manual LoRA adapter free
#19983 opened Feb 28, 2026 by PopFlamingo Loading…
feat: add --threads-all option to llama-bench examples
#19971 opened Feb 28, 2026 by hobostay Loading…
4 tasks done
Fix logic for retrieving schema items in json_schema_to_grammar.py examples python python script changes
#19968 opened Feb 28, 2026 by RayXu14 Loading…
ggml webgpu: fix workgroup dispatch limit for large batch sizes ggml changes relating to the ggml tensor library for machine learning
#19965 opened Feb 28, 2026 by abhijitramesh Loading…
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19959 opened Feb 27, 2026 by wallentri88 Loading…
scripts : improve get-wikitext-2.sh script Script related
#19952 opened Feb 27, 2026 by angt Loading…
[New quant] Q3_PT examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes testing Everything test related
#19941 opened Feb 26, 2026 by pwilkin Draft
ProTip! Mix and match filters to narrow down what you’re looking for.