Qwen-Flash
Copied!
Try AIAdd to Compare
ReasoningText Generation
Overview
ReasoningText Generation
The Qwen3 Flash model (snapshot 2025-07-28) offers a powerful fusion of thinking and non-thinking modes with dynamic in-conversation switching, excelling in complex reasoning while showing significant gains in instruction following and text comprehension. It supports a 1M context length and is billed on a tiered model corresponding to context usage.
Input
Text
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$0.05Per 1M tokens
- Output$0.4Per 1M tokens
- Input$0.05Per 1M tokens
- Output$0.4Per 1M tokens
Context
Context
1M
Max Input
995.90K
Max Output
32.76K
Rate Limits
- RPMRequests Per Minute600
- TPMTokens Per Minute5M
API Reference
Get API KeyCopied!
123456789101112131415161718192021222324252627282930