DeepSeek
Copied!
Try AIAdd to Compare
Text GenerationReasoning
Overview
Text GenerationReasoning
A highly efficient, lightweight MoE model with 284 billion parameters in total and 13 billion activated parameters, natively supporting context windows of up to one million tokens. It offers fast inference speed, low latency, and cost-effective invocation, delivering well-balanced overall performance. Designed for high-concurrency, lightweight workloads, it is ideally suited for common, essential use cases such as everyday dialogue, content creation, basic RAG applications, and batch text processing.
Input
Text
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$0.2Per 1M tokens
- Output$0.4Per 1M tokens
- Input(Implicit Cache)$0.04Per 1M tokens
Context
Context
1M
Max Input
1M
Max Output
393.21K
Rate Limits
- RPMRequests Per Minute10K
- TPMTokens Per Minute1.20M
API Reference
Get API KeyCopied!
123456789101112131415161718192021222324252627282930