llama-swap-rocm

Files

T

Benson Wong 22e098ac8b Add Peer Model Support (#438 )

This PR allows a single llama-swap to be the central proxy for models served by other inference servers. The peer servers can be another llama-swap or any API that supports the /v1/* inference endpoint.

Updates: #433, #299
Closes: #296

2025-12-27 20:18:06 -08:00

assets

move header images around [skip ci]

2025-12-02 19:40:42 -08:00

examples

Clean up and Documentation (#347 ) [skip ci]

2025-10-19 14:53:13 -07:00

configuration.md

Add Peer Model Support (#438 )

2025-12-27 20:18:06 -08:00

container-security.md

docs: add documentation for non-root container images and security considerations (#416 )

2025-12-02 08:52:26 -08:00