llama.cpp/tools at b9731 - llama.cpp - Gitea: Git with a cup of tea

wylab/llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-30 01:27:42 +02:00

Files

T

History

Adrien Gallouët 4b48a53b6c server : optimize get_token_probabilities (#24796 )

Use std::partial_sort to order only the requested top-n tokens instead
of the full vocabulary

    logprobs sort: vocab=128000 n_top=0 iters=100
    full    sort:   8555.6 us/op
    partial sort:    704.3 us/op

Signed-off-by: Adrien Gallouët <angt@huggingface.co>

2026-06-19 23:26:54 +02:00

..

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

mtmd, arg: fix utf8 handling on windows (#24779 )

2026-06-19 22:28:38 +02:00

completion : remove useless statics (#24226 )

2026-06-06 12:16:16 +02:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

docs: fix export-lora --lora-scaled syntax [no release] (#24703 )

2026-06-18 16:46:17 +02:00

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

Move duplicated imatrix code into single common imatrix-loader.cpp (#22445 )

2026-06-04 17:45:40 +02:00

bench : add --offline (#24511 )

2026-06-16 08:26:05 +02:00

mtmd, arg: fix utf8 handling on windows (#24779 )

2026-06-19 22:28:38 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

perplexity : fix format specifier in LOG_ERR (#23788 )

2026-05-28 10:34:58 +03:00

docs: Update quantization readme (#24133 )

2026-06-05 12:21:26 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server : optimize get_token_probabilities (#24796 )

2026-06-19 23:26:54 +02:00

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

ui: provide touch accessible model selection UI (#24604 )

2026-06-18 13:14:20 +02:00

CMakeLists.txt

cmake: skip cvector-generator and export-lora when CPU backend is disabled (#24053 )

2026-06-04 13:13:19 +03:00