llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-06-30 01:27:42 +02:00

Files

T

Anuj Attri 10786217e9 server : return HTTP 400 on invalid grammar (#24144 ) (#24154 )

Throw on grammar parse failure so the server returns HTTP 400
instead of silently dropping the constraint.
Add a regression test for the invalid-grammar response.

Fixes #24144

2026-06-18 12:49:14 +02:00

batched-bench

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

cli

cli : fix not copying preserved tokens (#24258 )

2026-06-14 11:52:15 +02:00

completion

completion : remove useless statics (#24226 )

2026-06-06 12:16:16 +02:00

cvector-generator

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

export-lora

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

fit-params

cmake : add install() for impl libraries + fix apple builds (#23511 )

2026-05-22 11:46:26 +03:00

gguf-split

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

imatrix

Move duplicated imatrix code into single common imatrix-loader.cpp (#22445 )

2026-06-04 17:45:40 +02:00

llama-bench

bench : add --offline (#24511 )

2026-06-16 08:26:05 +02:00

mtmd

mtmd: refactor preprocessor, add mtmd_image_preproc_out (#24736 )

2026-06-18 12:04:39 +02:00

parser

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

perplexity

perplexity : fix format specifier in LOG_ERR (#23788 )

2026-05-28 10:34:58 +03:00

quantize

docs: Update quantization readme (#24133 )

2026-06-05 12:21:26 +02:00

results

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

rpc

fix: rpc-server cache may not work in Windows environments (#22394 )

2026-04-27 17:25:09 +03:00

server

server : return HTTP 400 on invalid grammar (#24144 ) (#24154 )

2026-06-18 12:49:14 +02:00

tokenize

libs : rename libcommon -> libllama-common (#21936 )

2026-04-17 11:11:46 +03:00

tts

logs : reduce (#23021 )

2026-05-14 13:05:52 +03:00

ui: Update code formatting command in pre-commit hook (#24685 )

2026-06-18 08:33:50 +02:00

CMakeLists.txt

cmake: skip cvector-generator and export-lora when CPU backend is disabled (#24053 )

2026-06-04 13:13:19 +03:00