QVQ-Max

Copied!
Try AIAdd to Compare
ReasoningVisual Understanding

Overview

ReasoningVisual Understanding

The Tongyi Qianwen QVQ visual reasoning model supports visual input and chain-of-thought output, demonstrating stronger capabilities in mathematics, programming, visual analysis, creation, and general tasks.

Input

TextImageVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $1.2Per 1M tokens
  • Output
    $4.8Per 1M tokens

Context

Context
131.07K
Max Input
106.49K
Max Output
8.19K

Rate Limits

  • RPMRequests Per Minute
    60
  • TPMTokens Per Minute
    100K

API Reference

Get API Key
Copied!
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657