Question 1

What should I do if I get a 401 error?

Accepted Answer

A 401 means your API Key is invalid, expired, or disabled. Check: 1) Authorization header format is "Bearer YOUR_API_KEY"; 2) The key is active in your dashboard; 3) The key hasn't expired.

Question 2

How do I handle a 402 (insufficient balance) error?

Accepted Answer

A 402 means your wallet balance is insufficient or suspended. Log in to the dashboard to top up your wallet, or contact your agent/admin for a recharge. Service resumes immediately after top-up.

Question 3

How do I handle 429 rate limit errors?

Accepted Answer

A 429 means you've exceeded the rate limit. Solutions: 1) Add exponential backoff retry logic; 2) Check the X-RateLimit-Remaining response header to throttle requests; 3) Contact admin to increase your limits.

Question 4

What's the difference between streaming and non-streaming? Which should I use?

Accepted Answer

Non-streaming (default) waits for the complete response before returning — good for batch processing. Streaming (stream: true) returns tokens as they're generated — ideal for chat UIs where users see real-time output. Both cost the same.

Question 5

Can I use multiple models at the same time?

Accepted Answer

Yes. The same API Key can specify different model parameters in different requests. Switch freely between models without creating separate keys for each.

Parameter	Type	Required	Description
model	string	Yes	Model ID
messages	array	Yes	Message list
stream	boolean	No	Stream output
temperature	number	No	Temperature 0-2
max_tokens	integer	No	Max output tokens

Parameter	Type	Required	Description
model	string	Yes	Embedding model ID
input	string \| array	Yes	Input text

HTTP Status	Description	Handling
200	Success	-
400	Bad request (missing model/messages or other required fields)	Check request body format
401	Invalid, expired, or disabled API Key	Check Authorization header
402	Insufficient balance or wallet suspended	Recharge and retry
403	No access to this model (model allowlist restriction)	Contact admin to enable model access
429	Rate limit exceeded / Token quota exhausted / Daily limit reached	Reduce request frequency or contact admin to increase limit
500	Internal server error	Retry later
502	Upstream provider unavailable	Retry later or switch model

Response Header	Description
X-RateLimit-Limit	Max requests per minute
X-RateLimit-Remaining	Remaining requests in current window
X-RateLimit-Reset	Limit reset time (ISO 8601)

API Documentation

Chat Completions

Request Parameters

Response Format

Embeddings

Request Parameters

Model List

Error Codes

Rate Limiting

Frequently Asked Questions