Models & Pricing

Panda World provides access to leading Chinese LLMs. All models are accessible through the single OpenAI-compatible endpoint. You only pay for what you use — no monthly fees.

Pricing Table

Prices are in USD per million tokens (prompt + completion combined unless otherwise specified). Updated weekly.

Chat Models

Model ID	Provider	Input ($/1M)	Output ($/1M)	Context
`deepseek-chat`	DeepSeek	$1.00	$1.00	64K
`deepseek-reasoner`	DeepSeek	$7.00	$7.00	64K
`qwen3-235b`	Alibaba Cloud	$4.00	$4.00	128K
`qwen3-32b`	Alibaba Cloud	$1.50	$1.50	128K
`glm-5-flash`	Zhipu AI	$0.50	$0.50	128K
`glm-5`	Zhipu AI	$2.00	$2.00	128K

Cost Comparison

Model	Panda World	OpenAI GPT-4o	Savings
DeepSeek-V3	$1.00	$2.50	~60%
DeepSeek-R1	$7.00	$10.00 (o1)	~30%
GLM-5-Flash	$0.50	$2.50	~80%

Model Selection Guide

DeepSeek Models

deepseek-chat (DeepSeek-V3): Best all-rounder. Strong on code, reasoning, and general knowledge. The default choice for most use cases.

deepseek-reasoner (DeepSeek-R1): Chain-of-thought reasoning model. Excellent for complex math, logic puzzles, and tasks requiring step-by-step reasoning. Slower but more thorough.

Qwen Models

qwen3-235b: Largest and most capable Qwen model. Best for Chinese language tasks and long-context applications (up to 128K tokens).

qwen3-32b: More efficient 32B variant. Good price-performance balance.

GLM Models

glm-5-flash: Fast and extremely cost-effective. Ideal for high-volume, latency-sensitive applications where quality can be slightly lower.

glm-5: Full GLM-5 model. Higher quality than Flash at a moderate price.

Pricing Notes

Billing is based on total tokens processed (input + output)
Token counting follows each model provider's tokenizer
Requests that return errors due to provider failures are not billed
Prices are subject to change; we'll notify you via email
Volume discounts available — contact us for enterprise pricing