15+ Chinese AI Models · Enterprise Infrastructure · Stable

One KeyAccess All Chinese LLMs

Helin API is OpenAI-compatible, unifying DeepSeek, Qwen, GLM, Kimi, iFlytek Spark, MiniMax and more into a single platform. Manage tokens, billing, logs and pricing in one console.

Base URLhttps://helinapi.com

/v1/chat/completions·/v1/models·/v1/embeddings·/v1/images/generations·/v1/audio/transcriptions·/v1/chat/completions·/v1/models·/v1/embeddings·/v1/images/generations·/v1/audio/transcriptions

<50ms

IEPL Latency

99.7%

30-Day Uptime

15+

Chinese LLMs

Zero

Data Retention

Model Marketplace

DeepSeek Full Lineup

OpenAI-compatible. Pay per token. More vendors coming soon.

Flagship

DeepSeek

V4-Pro

Latest flagship model with 1M context window. Superior reasoning, comprehension and generation. Surpasses GPT-5.5 at 1/10 the price.

1M ContextThinking Mode

¥4.50

/ 1M input tokens · ¥9.00 output

Popular

DeepSeek

V4-Flash

Best value model. 1M context. Integrated R1 reasoning at 33% lower cost than V3.2.

1M ContextThinking Mode

¥1.50

/ 1M input tokens · ¥3.00 output

HOT

DeepSeek

V3.2

Proven general-purpose model, 160K context

160K

¥3.00

/ 1M input · ¥4.50 output

DeepSeek

Deep reasoning specialist, 32K chain-of-thought

128KReasoning

¥6.00

/ 1M input · ¥24.00 output

Value

DeepSeek

R1-Distill-Qwen-32B

Distilled R1, 32B params, beats OpenAI-o1-mini

32B

¥0.03

/ request

DeepSeek

R1-Distill-Qwen-14B

Distilled R1, 14B, balanced performance

14B

¥0.0225

/ request

DeepSeek

R1-Distill-Qwen-7B

Distilled R1, 7B, lightweight reasoning

¥0.015

/ request

DeepSeek

R1-Distill-Qwen-1.5B

Distilled R1, 1.5B, ultra-lightweight

1.5B

¥0.015

/ request

DeepSeek

Coder-33B

Code specialist, 87% code training data, 16K window

16KCode

¥0.03

/ request

DeepSeek

R1-Search

R1 with web search, real-time info retrieval

Web

¥9.00

/ 1M tokens

DeepSeek

V3-Search

V3 with web search

Web

¥4.50

/ 1M tokens

DeepSeek

OCR-2

High-precision image text recognition

OCR

¥0.03

/ request

Qwen · GLM · Kimi · iFlytek Spark · MiniMax coming soon.

Platform

Enterprise AI Gateway

Beyond API relay - a complete AI model access solution with infrastructure, compliance and developer tooling.

⚡

IEPL Accelerated

Dedicated line to upstream providers, <50ms latency. Auto failover and load balancing for high concurrency.

🛡

Full Compliance

ICP licensed, Level-3 certified. Chinese models only. Zero data retention - prompts and completions are never stored.

🔧

Drop-in Replacement

Swap Base URL and API Key. Chat, Stream, Function Calling, Vision all supported. Works with all OpenAI-compatible tools.

Pricing

Pay As You Go

Real-time token billing. Balance never expires. Enterprise invoicing and VAT fapiao available.

Starter

¥50

Instant access

All models
Real-time billing
No expiry

Get Started

Popular

¥500

+ ¥50 bonus

All models
Priority support
Bank transfer
VAT invoice

Get Started

Business

¥1,000

+ ¥150 bonus

All models
Dedicated manager
Custom pricing
Concurrency SLA

Enterprise

¥5,000

+ ¥1,000 bonus

All models
99.9% SLA
Private deployment
24/7 support

FAQ

Questions?

Which models are supported?: DeepSeek V4-Pro/V4-Flash/V3.2/R1, R1-Distill series, Coder-33B, OCR-2 and more. 15+ Chinese LLMs with more being added.
How do I integrate?: Set Base URL to https://helinapi.com and use your API Key from the dashboard. Compatible with OpenAI SDK, Cherry Studio, LobeChat, Cursor, Claude Code and all OpenAI-format tools.
Is my data secure?: Prompts and completions are never stored. Licensed operator with Level-3 security certification. Chinese models only - no cross-border data flows.
Do you support enterprise purchasing?: Yes. Bank transfer, VAT invoices, SLA agreements available. Contact support to set up an enterprise account.

One KeyAccess All Chinese LLMs

DeepSeek Full Lineup

Enterprise AI Gateway

IEPL Accelerated

Full Compliance

Drop-in Replacement

Pay As You Go

Questions?

Ready to Start?