-
Notifications
You must be signed in to change notification settings - Fork 12.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml: adds CONV_2D op and direct GEMM Vulkan implementation
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14316
opened Jun 21, 2025 by
etasnadi
Loading…
gguf-py : fix Qwen3-Embedding eos token
python
python script changes
#14314
opened Jun 21, 2025 by
CISC
Loading…
CUDA: add mean operation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14313
opened Jun 21, 2025 by
am17an
Loading…
GitHub workflow: set RPATH to "@loader_path" / "$ORIGIN" to ensure executables and dynamic libraries search for dependencies in their origin directory.
devops
improvements to build systems and github actions
#14309
opened Jun 20, 2025 by
rotemdan
Loading…
ggml-cpu: enable IBM NNPA Vector Intrinsics
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Fix Windows Null Pointer Bug and Enhance Memory Operations in ggml-sycl
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14290
opened Jun 20, 2025 by
MengAiDev
Loading…
CUDA: mul_mat_v support for batch sizes > 1
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14262
opened Jun 18, 2025 by
JohannesGaessler
Loading…
opencl: ref count changes relating to the ggml tensor library for machine learning
ggml_backend_opencl_context
and refactor profiling
ggml
#14254
opened Jun 18, 2025 by
lhez
Loading…
Add SmolLM3
documentation
Improvements or additions to documentation
python
python script changes
#14240
opened Jun 17, 2025 by
Vaibhavs10
•
Draft
MODEL: Falcon-H1 support
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#14238
opened Jun 17, 2025 by
younesbelkada
•
Draft
Mtmd: add a way to select device for vision encoder
examples
#14236
opened Jun 17, 2025 by
stduhpf
Loading…
ggml: introduce GGML_NUMA_MIGRATE to optimize cross NUMA op computation
examples
ggml
changes relating to the ggml tensor library for machine learning
#14232
opened Jun 17, 2025 by
wenlujon
Loading…
logit_bias: apply configurable escalating EOG bias at low n_remain
examples
server
testing
Everything test related
#14229
opened Jun 16, 2025 by
graehl
Loading…
tests : enhance llama-bench with separate timings (pp/gen t/s), added n_threads_batch
examples
#14219
opened Jun 16, 2025 by
thad0ctor
Loading…
webui: save model name with conversation history (#13570)
examples
server
#14192
opened Jun 15, 2025 by
deepanshu2015
Loading…
ci: re-enable rocm linux build, reduce the built targets to the ones currently available in rocblas
devops
improvements to build systems and github actions
#14184
opened Jun 14, 2025 by
IMbackK
Loading…
ggml : implement op fusion, starting with REGLU/GEGLU/SWIGLU
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
help wanted
Extra attention is needed
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
models/templates: add mistralai/Mistral-Small-3.1-24B-Instruct-2503 template with tool calling support
#14148
opened Jun 12, 2025 by
bretello
Loading…
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization
ggml
changes relating to the ggml tensor library for machine learning
#14117
opened Jun 11, 2025 by
Vithulep
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.