OpenAI's open-weight model designed for powerful reasoning, agentic tasks, and versatile developer use cases - optimized for lower latency and specialized use-cases at the edge
Upstream Providers
ZDR only
Hugging Face
huggingface-byok
In$0Out$0TTFT—TPS—
No ZDRTools
DeepInfra
deepinfra
In$0.03Out$0.14TTFT200msTPS120.0 tps
No ZDRTools
Cloudflare AI Gateway
cloudflare
In$0.20Out$0.30TTFT180msTPS150.0 tps
ZDRTools
API Details
Chat Completions API
/api/v1/chat/completions
OpenAI-compatible endpoint with a messages array. Standard interface for chat-based interactions.
Responses API
/api/v1/responses
Native OpenAI Responses format with a simplified input parameter for single-turn requests.
Uptime & Health
Uptime %Finish ReasonsLatency
No uptime data yet
These providers haven't been health-probed for this model yet. The router still routes around upstreams that fail live requests — uptime fills in once probe history accrues.