Skip to main content

Chat completion

POST 

/v1/chat/completions

OpenAI-compatible chat completions. Accepts any model the gateway has a provider for — gpt-*, claude-*, gemini-*, ollama/*, mock/*, and so on. Set stream: true to receive SSE deltas.

The gateway runs the full pre-dispatch pipeline (template resolution, budget enforcement, policy evaluation, PII scan, guardrails, priority admission, model downgrade, context window check) before any upstream call, and the post-dispatch pipeline (schema validation, grounding detection, content filters, PII output scan) on the response.

Request

Responses

Chat completion