Headers
Token for authentication
Optional provider key for running requests against closed source models. Required for providers anthropic and cohere. Removes rate limits for the other providers.
Body
The ID of the model to run (e.g., Qwen/Qwen3-1.7B, openai/gpt-4)
Conversation messages (OpenAI chat format)
Maximum number of tokens to generate
Sampling temperature
Whether to stream responses