Qwen3-Open-Source

Copied!
Try AIAdd to Compare
Visual Understanding

Overview

Visual Understanding

The Qwen3 series VL models has been comprehensively upgraded in areas such as visual coding and spatial perception. Its visual perception and recognition capabilities have significantly improved, supporting the understanding of ultra-long videos, and its OCR functionality has undergone a major enhancement.

Input

TextImageVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $0.4Per 1M tokens
  • Output
    $1.6Per 1M tokens

Context

Context
131.07K
Max Input
129.02K
Max Output
32.76K

Rate Limits

  • RPMRequests Per Minute
    60
  • TPMTokens Per Minute
    100K

API Reference

Get API Key
Copied!
1234567891011121314151617