GLM-5 is Z-AI's general-purpose foundation model for reasoning, coding, and agentic workloads, with a 128K context window and 32K output, supporting thinking mode, function calling, and structured outputs.
Upstream Providers
ZDR only
Z-AI
z-ai-byok
In$0Out$0
No ZDRTools
OpenCode Zen
opencode-zen-byok
In$0.60Out$2.20
BYOKBYOKNo ZDRTools
OpenCode Zen Go
opencode-go-byok
In$1.40Out$4.40
BYOKBYOKCachedNo ZDRTools
API Details
Chat Completions API
/api/v1/chat/completions
OpenAI-compatible endpoint with a messages array. Standard interface for chat-based interactions.
Uptime & Health
Uptime %Finish ReasonsLatency
No uptime data yet
These providers haven't been health-probed for this model yet. The router still routes around upstreams that fail live requests — uptime fills in once probe history accrues.