mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2026-06-09 07:16:44 +02:00
0821c5fcfd
* server: in SSE mode, send HTTP headers when slot starts * ref to pr * stream should be false by default