QVQ-Max
Copied!
Try AIAdd to Compare
ReasoningVisual Understanding
Overview
ReasoningVisual Understanding
The Tongyi Qianwen QVQ visual reasoning model supports visual input and chain-of-thought output, demonstrating stronger capabilities in mathematics, programming, visual analysis, creation, and general tasks.
Input
TextImageVideo
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$1.2Per 1M tokens
- Output$4.8Per 1M tokens
Context
Context
131.07K
Max Input
106.49K
Max Output
8.19K
Rate Limits
- RPMRequests Per Minute60
- TPMTokens Per Minute100K
API Reference
Get API KeyCopied!
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657