Unified AI Model
API Gateway

Better pricing, better stability, OpenAI-compatible API. One interface to call all major AI models.

https://token.bitbeam.cn
/v1/chat/completions
/v1/embeddings
/v1/models
/v1/rerank
/v1/images/generations
/v1/audio/speech
/v1/chat/completions

Supported Providers

OpenAI
OpenAI
Anthropic
Anthropic
Google
Google
Kimi
Kimi
MiniMax
MiniMax
Zhipu AI
Zhipu AI
Qwen
Qwen
DeepSeek
DeepSeek

Core Features

Enterprise-grade AI gateway platform, ready to use

OpenAI Compatible

Fully compatible with OpenAI SDK, switch models with one line of code

Multi-Provider

Unified access to OpenAI, Anthropic, DeepSeek and other major providers

Usage-Based Billing

Token-level metering and billing with prepaid wallet support

Wallet System

Multi-tier wallet system with recharge, redeem codes, and online payment

Token Management

Multiple API key management with group multipliers, limits and allowlists

Circuit Breaker

Auto-detect upstream failures and intelligently switch to backup channels

Analytics

Real-time usage monitoring, trend analysis, and multi-dimensional rankings

Security

IP blacklist, rate limiting, TOTP two-factor authentication

Frequently Asked Questions

What is NextRouter AI Token? How does it compare to OpenRouter and One API?

NextRouter AI Token is a leading AI model API aggregation platform, similar to OpenRouter but optimized for the Chinese market. Compared to OpenRouter, NextRouter supports CNY billing, direct connections to domestic providers (DeepSeek, Qwen, Zhipu AI, Kimi), and lower latency, while maintaining full OpenAI SDK compatibility. Compared to One API, NextRouter offers complete commercial capabilities: multi-tier agent system, wallet billing, redeem codes, and analytics dashboards.

Which AI models are supported? Can I use GPT-4, Claude, and DeepSeek?

Supports 30+ mainstream models including: OpenAI GPT-4.1/GPT-4o/o3/o4-mini, Anthropic Claude Opus 4/Sonnet 4/Haiku, DeepSeek-V3/R1, Google Gemini 2.5 Pro/Flash, Qwen-Max/Plus/Turbo/QwQ, Zhipu GLM-4-Plus, MiniMax M2.7, Kimi K2.5, and more. Covers chat, reasoning, embedding, image generation, and vision understanding.

How do I get started? Do I need to change my code?

Fully compatible with OpenAI SDK (Python / Node.js / Go / Java / Rust). Just replace base_url and api_key. Works with ChatGPT, Cursor, Cherry Studio, LobeChat, Cline, and all OpenAI-compatible tools — a true drop-in replacement.

How is pricing calculated? Is it cheaper than OpenAI or OpenRouter?

Billed per token (priced per million tokens), varies by model — check real-time pricing in the Model Market. Domestic models (DeepSeek, Qwen) are priced via direct connection, much cheaper than OpenRouter's international routing. International models (GPT-4, Claude) are competitively priced through multi-channel aggregation.

Does it support Streaming and Function Calling?

Yes, fully supported. Set stream: true for real-time SSE responses. Function Calling / Tools are compatible with OpenAI format, supported by GPT-4.1, Claude Sonnet 4, Qwen-Max, GLM-4, and more.

How to use DeepSeek API? What's the difference between DeepSeek-V3 and R1?

DeepSeek-V3 is a general chat model for everyday Q&A and coding; DeepSeek-R1 is a deep reasoning model for math, logic, and complex analysis. Both are accessible via OpenAI-compatible API with model parameters deepseek-chat and deepseek-reasoner, priced at just ¥3/million tokens.

How to access Claude API? How much does Claude Opus 4 cost?

Access the full Claude model lineup through NextRouter AI Token with no VPN needed. Claude Opus 4 for deep analysis, Sonnet 4 for best value, Haiku 4.5 for ultra-fast responses. All Claude models support 200K context, vision, and tool use.

What is the Agent program? How do agents earn from markup?

Agents purchase tokens at platform base price and set their own markup rate for downstream customers. For example, base price ¥10/M tokens with 50% markup means customer pays ¥15, agent earns ¥5. The platform provides customer management, dashboards, redeem codes, and automatic profit sharing.

How is data security ensured? Is my conversation data stored?

The platform only forwards requests — no conversation data is stored. API Keys are stored as SHA-256 hashes, upstream keys are AES-256 encrypted. Enterprise-grade security includes IP whitelisting, rate limiting, daily spend caps, and TOTP two-factor authentication.

What happens when an upstream provider goes down?

Built-in circuit breaker and automatic failover. After 5 consecutive failures, the channel is automatically bypassed for 30 seconds and requests route to backup providers. Supports multi-provider configuration per model with priority and weight-based routing for 99.9% availability.

Get Started with NextRouter AI Token

Register to get your API key and connect to all major AI models in minutes