llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-30 01:27:42 +02:00

Files

T

Pascal 3a3edc9ac6 Ggml/cuda col2im 1d (#24417 )

* cuda: add GGML_OP_COL2IM_1D, follow-up to the CPU op

* cuda: col2im_1d use fast_div_modulo for the index decomposition

* cuda: col2im_1d tighten supports_op, type match and contiguous dst

2026-06-18 22:23:01 +02:00

cmake

ggml : Parallelize quant LUT init (#23595 )

2026-05-25 10:15:46 +03:00

include

Remove padding and multiple D2D copies for MTP (#24086 )

2026-06-10 23:21:16 +05:30

src

Ggml/cuda col2im 1d (#24417 )

2026-06-18 22:23:01 +02:00

.gitignore

vulkan : cmake integration (#8119 )

2024-07-13 18:12:39 +02:00

CMakeLists.txt

[SYCL] rename GGML_SYCL_SUPPORT_LEVEL_ZERO (#24719 )

2026-06-18 11:18:26 +03:00