Models & Pricing

Panda World provides access to leading Chinese LLMs. All models are accessible through the single OpenAI-compatible endpoint. You only pay for what you use — no monthly fees.

Pricing Table

Prices are in USD per million tokens (prompt + completion combined unless otherwise specified). Updated weekly.

Chat Models

Model IDProviderInput ($/1M)Output ($/1M)Context
deepseek-chatDeepSeek$1.00$1.0064K
deepseek-reasonerDeepSeek$7.00$7.0064K
qwen3-235bAlibaba Cloud$4.00$4.00128K
qwen3-32bAlibaba Cloud$1.50$1.50128K
glm-5-flashZhipu AI$0.50$0.50128K
glm-5Zhipu AI$2.00$2.00128K

Cost Comparison

ModelPanda WorldOpenAI GPT-4oSavings
DeepSeek-V3$1.00$2.50~60%
DeepSeek-R1$7.00$10.00 (o1)~30%
GLM-5-Flash$0.50$2.50~80%

Model Selection Guide

DeepSeek Models

deepseek-chat (DeepSeek-V3): Best all-rounder. Strong on code, reasoning, and general knowledge. The default choice for most use cases.

deepseek-reasoner (DeepSeek-R1): Chain-of-thought reasoning model. Excellent for complex math, logic puzzles, and tasks requiring step-by-step reasoning. Slower but more thorough.

Qwen Models

qwen3-235b: Largest and most capable Qwen model. Best for Chinese language tasks and long-context applications (up to 128K tokens).

qwen3-32b: More efficient 32B variant. Good price-performance balance.

GLM Models

glm-5-flash: Fast and extremely cost-effective. Ideal for high-volume, latency-sensitive applications where quality can be slightly lower.

glm-5: Full GLM-5 model. Higher quality than Flash at a moderate price.

Pricing Notes

  • Billing is based on total tokens processed (input + output)
  • Token counting follows each model provider's tokenizer
  • Requests that return errors due to provider failures are not billed
  • Prices are subject to change; we'll notify you via email
  • Volume discounts available — contact us for enterprise pricing