llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-29 17:17:40 +02:00

Files

T

Georgi Gerganov 3c81c8deea server : print graphs reused in slot timings (#23279 )

Add graphs reused counter to the per-slot timing output, printed via
llama_perf_context().

Assisted-by: llama.cpp:local pi

Co-authored-by: ggerganov <ggerganov@users.noreply.github.com>

2026-05-19 09:46:58 +03:00

batched-bench

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

cli

llama + spec: MTP Support (#22673 )

2026-05-16 20:06:23 +08:00

completion

llama + spec: MTP Support (#22673 )

2026-05-16 20:06:23 +08:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

export-lora

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fit-params

fit-params : refactor + add option to output estimated memory per device (#22171 )

2026-04-21 09:54:36 +03:00

gguf-split

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

imatrix

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

llama-bench

spec : refactor params (#22397 )

2026-04-28 09:07:33 +03:00

mtmd

mtmd: add chunks and fix preproc for qwen3a (#23073 )

2026-05-15 19:32:47 +02:00

parser

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

perplexity

fit-params : refactor + add option to output estimated memory per device (#22171 )

2026-04-21 09:54:36 +03:00

quantize

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

results

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

rpc

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server

server : print graphs reused in slot timings (#23279 )

2026-05-19 09:46:58 +03:00

tokenize

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

tts

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

ui: Update KaTeX package and clean up logs from sass warnings (#23275 )

2026-05-18 16:26:01 +02:00

CMakeLists.txt

ui: Restructure repo to use tools/ui folder and ui / UI / llama-ui / LLAMA_UI naming (#23064 )

2026-05-16 02:02:40 +02:00