llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-09 07:16:44 +02:00

Files

T

Masashi Yoshimura 1e1aca09da ggml-webgpu: Improve prefill speeds for k-quants + refactor matmul for Q4/Q5/Q8 and k-quants (#24225 )

* ggml-webgpu: Improve prefill speeds + refactor matmul for quants

* Fixes for editroconfig checker

2026-06-08 15:19:56 -07:00

2026-05-25 10:15:46 +03:00

2026-06-01 12:30:10 +02:00

2026-06-08 15:19:56 -07:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2026-06-08 14:31:33 +03:00