OpenAI o4-mini is a fast, cost-efficient o-series reasoning model with strong performance across coding and visual tasks, succeeded by GPT-5 Mini for new workloads. Supports a 200,000-token input context with 100,000 output tokens, accepts text and image inputs, and exposes streaming, function calling, structured outputs, and reasoning tokens across Chat Completions, Responses, Realtime, Batch, Fine-tuning, and Assistants APIs. Priced at $1.10 / $4.40 per 1M input/output tokens.
Chat Completions API
/api/v1/chat/completions
OpenAI-compatible endpoint with a messages array. Standard interface for chat-based interactions.
Responses API
/api/v1/responses
Native OpenAI Responses format with a simplified input parameter for single-turn requests.
These providers haven't been health-probed for this model yet. The router still routes around upstreams that fail live requests — uptime fills in once probe history accrues.
curl https://anyrouter.dev/api/v1/chat/completions \
-H "Authorization: Bearer $ANYROUTER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "openai/o4-mini",
"messages": [
{"role": "user", "content": "Say hi in 3 words."}
]
}'ANYROUTER_API_KEY — Dashboard → Keys{
"model": "openai/o4-mini",
"messages": [
{
"role": "user",
"content": "Say hi in 3 words."
}
]
}Change sort, ZDR, effort, or pin a provider to add routing params.