Qwen3.6-Open-Source

Copied!
Try AIAdd to Compare
ReasoningVisual UnderstandingText Generation

Overview

ReasoningVisual UnderstandingText Generation

The Qwen3.6 35B-A3B native vision-language model is built on a hybrid architecture that integrates linear attention mechanisms with a sparse mixture-of-experts framework, achieving higher inference efficiency. Compared with the 3.5-35B-A3B, this model demonstrates significantly improved agentic coding capabilities, mathematical and code reasoning abilities, spatial intelligence, as well as object localization and object detection performance.

Input

ImageTextVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $0.375Per 1M tokens
  • Output
    $2.25Per 1M tokens

Context

Context
262.14K
Max Input
260.09K
Max Output
65.53K

Rate Limits

  • RPMRequests Per Minute
    600
  • TPMTokens Per Minute
    1M

Built-in Tools

web_extractorResponses API
web_searchResponses API
code_interpreterResponses API
i2i_searchResponses API
t2i_searchResponses API

API Reference

Get API Key
Copied!
123456789101112131415161718192021222324252627282930
Copied!
123456789101112131415161718192021222324252627282930