Qwen3.6-Open-Source
Copied!
Try AIAdd to Compare
ReasoningVisual UnderstandingText Generation
Overview
ReasoningVisual UnderstandingText Generation
The Qwen3.6 35B-A3B native vision-language model is built on a hybrid architecture that integrates linear attention mechanisms with a sparse mixture-of-experts framework, achieving higher inference efficiency. Compared with the 3.5-35B-A3B, this model demonstrates significantly improved agentic coding capabilities, mathematical and code reasoning abilities, spatial intelligence, as well as object localization and object detection performance.
Input
ImageTextVideo
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$0.375Per 1M tokens
- Output$2.25Per 1M tokens
Context
Context
262.14K
Max Input
260.09K
Max Output
65.53K
Rate Limits
- RPMRequests Per Minute600
- TPMTokens Per Minute1M
Built-in Tools
web_extractorResponses API
web_searchResponses API
code_interpreterResponses API
i2i_searchResponses API
t2i_searchResponses API
API Reference
Get API KeyCopied!
12345678910111213141516171819202122232425262728293031