DeepSeek

Copied!
Try AIAdd to Compare
Text GenerationReasoning

Overview

Text GenerationReasoning

A highly efficient, lightweight MoE model with 284 billion parameters in total and 13 billion activated parameters, natively supporting context windows of up to one million tokens. It offers fast inference speed, low latency, and cost-effective invocation, delivering well-balanced overall performance. Designed for high-concurrency, lightweight workloads, it is ideally suited for common, essential use cases such as everyday dialogue, content creation, basic RAG applications, and batch text processing.

Input

Text

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $0.2Per 1M tokens
  • Output
    $0.4Per 1M tokens
  • Input(Implicit Cache)
    $0.04Per 1M tokens

Context

Context
1M
Max Input
1M
Max Output
393.21K

Rate Limits

  • RPMRequests Per Minute
    10K
  • TPMTokens Per Minute
    1.20M

API Reference

Get API Key
Copied!
12345678910111213141516171819202122232425262728293031