Qwen3.6-Flash
Copied!
Try AIAdd to Compare
Text GenerationVisual UnderstandingReasoning
Overview
Text GenerationVisual UnderstandingReasoning
The Qwen3.6 native vision-language Flash model series delivers a significant performance boost over the 3.5-Flash version. This model particularly excels in agentic coding capabilities, substantially outperforming its predecessor on multiple code-agent benchmarks, as well as in mathematical and code reasoning. In terms of vision, it features markedly improved spatial intelligence, with especially notable enhancements in object localization and object detection.
Input
ImageTextVideo
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Input$0.25Per 1M tokens
- Output$1.5Per 1M tokens
- Explicit Cache Creation$0.3125Per 1M tokens
- Explicit Cache Read$0.025Per 1M tokens
- Input$0.25Per 1M tokens
- Output$1.5Per 1M tokens
- Explicit Cache Creation$0.3125Per 1M tokens
- Explicit Cache Read$0.025Per 1M tokens
Context
Context
1M
Max Input
991.80K
Max Output
65.53K
Rate Limits
- RPMRequests Per Minute15K
- TPMTokens Per Minute5M
Built-in Tools
web_extractorResponses API
web_searchResponses API
code_interpreterResponses API
i2i_searchResponses API
t2i_searchResponses API
API Reference
Get API KeyCopied!
12345678910111213141516171819202122232425262728293031
import os
from dashscope import Generation
import dashscope
dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'
messages = [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Who are you?"},
]
response = Generation.call(
# If the environment variable is not set, replace it with your Model Studio API key: api_key = "sk-xxx",
api_key=os.getenv("DASHSCOPE_API_KEY"),
model="qwen3.6-flash",
messages=messages,
result_format="message",
# Enable deep thinking
enable_thinking=True,
)
if response.status_code == 200:
# Print thinking process
print("=" * 20 + "Thinking process" + "=" * 20)
print(response.output.choices[0].message.reasoning_content)
# Print response
print("=" * 20 + "Full response" + "=" * 20)
print(response.output.choices[0].message.content)
else:
print(f"HTTP return code: {response.status_code}")
print(f"Error code: {response.code}")
print(f"Error message: {response.message}")