Qwen3.7-Plus
Copied!
Try AIAdd to Compare
ReasoningText GenerationVisual Understanding
Overview
ReasoningText GenerationVisual Understanding
Among the Qwen3.7 series, the cost-effective Plus model builds on its robust text capabilities while delivering a comprehensive upgrade to its vision‑language abilities, all while preserving its full‑stack agent‑level intelligence for coding, tool use, and productivity workflows. Its key distinguishing feature is multi‑modal interactive hybrid agent capabilities, enabling it to perceive real‑world scenes, read screens and interact with GUIs, generate code based on visual references, and perform end‑to‑end navigation within mobile apps.This version is a snapshot as of May 26, 2026.
Input
ImageTextVideo
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$0.4Per 1M tokens
- Output$1.6Per 1M tokens
- Input(Implicit Cache)$0.08Per 1M tokens
- Explicit Cache Creation$0.5Per 1M tokens
- Explicit Cache Read$0.04Per 1M tokens
- Input$0.4Per 1M tokens
- Output$1.6Per 1M tokens
- Input(Implicit Cache)$0.08Per 1M tokens
- Explicit Cache Creation$0.5Per 1M tokens
- Explicit Cache Read$0.04Per 1M tokens
Context
Context
1M
Max Input
991.80K
Max Output
65.53K
Rate Limits
- RPMRequests Per Minute60
- TPMTokens Per Minute1M
Built-in Tools
code_interpreterResponses API
i2i_searchResponses API
t2i_searchResponses API
web_extractorResponses API
web_searchResponses API
API Reference
Get API KeyCopied!
123456789101112131415161718