Qwen3-Open-Source
Copied!
Try AIAdd to Compare
Visual Understanding
Overview
Visual Understanding
Qwen3-VL 8B Dense model has a reduced memory footprint and delivers comprehensive improvements in image/video understanding, ultra-long context support (e.g., long videos and documents), spatial perception, and object recognition, enabling it to handle complex real-world tasks.
Input
TextImageVideo
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$0.18Per 1M tokens
- Output$0.7Per 1M tokens
Context
Context
131.07K
Max Input
129.02K
Max Output
32.76K
Rate Limits
- RPMRequests Per Minute60
- TPMTokens Per Minute100K
API Reference
Get API KeyCopied!
1234567891011121314151617