shc-ai (Caddy v2 + llama.cpp Qwen3.6-35B-A3B) — see /v1/* for API