Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

spacemit : fix wrong transpose function for int16 data ggml changes relating to the ggml tensor library for machine learning
#25161 opened Jun 30, 2026 by I3eg1nner Loading…
opencl: initial q1_0 support ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#25160 opened Jun 30, 2026 by lhez Contributor Draft
cuda: fix crash when querying memory on device with no free memory. CUDA Related to the CUDA backend ggml changes relating to the ggml tensor library for machine learning
#25157 opened Jun 30, 2026 by cphlipot Loading…
ggml: imatrix-aware NVFP4 quantization (scale search) + wire NVFP4 ftype examples ggml changes relating to the ggml tensor library for machine learning
#25153 opened Jun 30, 2026 by avifenesh Loading…
CUDA: add COL2IM_1D op CUDA Related to the CUDA backend documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning
#25151 opened Jun 29, 2026 by Ssamdeman Loading…
CUDA: fix Gemma E4B MTP FlashAttention CUDA Related to the CUDA backend ggml changes relating to the ggml tensor library for machine learning
#25148 opened Jun 29, 2026 by JohannesGaessler Contributor Loading…
ggml-webgpu: add support for NVFP4 ggml changes relating to the ggml tensor library for machine learning WebGPU
#25143 opened Jun 29, 2026 by yomaytk Contributor Loading…
model : register t_layer_inp for qwen3next model Model specific
#25141 opened Jun 29, 2026 by jschmied Loading…
llama : add llama_model_ftype_name() server
#25134 opened Jun 29, 2026 by angt Member Loading…
llama : add position-relocatable KV range save/load testing Everything test related
#25133 opened Jun 29, 2026 by Anyesh Loading…
[SYCL] enhance argsort to support all UT cases documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#25125 opened Jun 29, 2026 by arthw Contributor Loading…
[SYCL] fix unsupport ACC UT cases for noncontiguous documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#25124 opened Jun 29, 2026 by arthw Contributor Loading…
ggml-cpu: fix NEON build compilation on 32-bit ARMv7 architectures without hardware FP16 ggml changes relating to the ggml tensor library for machine learning
#25119 opened Jun 29, 2026 by Smu1zel Loading…
opencl: add ABS op documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend
#25115 opened Jun 29, 2026 by Gezahegne Loading…
Add Vision Support for Minimax-M3 conversion model Model specific mtmd Related to multimodal functionality (video/image/audio) testing Everything test related
#25113 opened Jun 28, 2026 by timkhronos Contributor Draft
Granite-Switch Architecture conversion model Model specific
#25107 opened Jun 28, 2026 by barvhaim Loading…
CUDA: fix get_rows_back for tables with more than 65535 rows (grid-y clamp + stride) CUDA Related to the CUDA backend ggml changes relating to the ggml tensor library for machine learning merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. testing Everything test related
#25103 opened Jun 28, 2026 by mattjallo Loading…
sycl: fix check_graph_compatibility() to allow graphs for MoE decode (CONCAT dim!=3, MUL_MAT_ID fused path) ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#25089 opened Jun 28, 2026 by Captain-Tripps Loading…
5 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.