Qwen3.6-Flash

Copied!

Try AIAdd to Compare

Text GenerationVisual UnderstandingReasoning

Overview

Text GenerationVisual UnderstandingReasoning

The Qwen3.6 native vision-language Flash model series delivers a significant performance boost over the 3.5-Flash version. This model particularly excels in agentic coding capabilities, substantially outperforming its predecessor on multiple code-agent benchmarks, as well as in mathematical and code reasoning. In terms of vision, it features markedly improved spatial intelligence, with especially notable enhancements in object localization and object detection.

Input

ImageTextVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

Input
$0.25Per 1M tokens
Output
$1.5Per 1M tokens
Explicit Cache Creation
$0.3125Per 1M tokens
Explicit Cache Read
$0.025Per 1M tokens

Input
$0.25Per 1M tokens
Output
$1.5Per 1M tokens
Explicit Cache Creation
$0.3125Per 1M tokens
Explicit Cache Read
$0.025Per 1M tokens

Context

Max Input

991.80K

Max Output

65.53K

Rate Limits

RPMRequests Per Minute
15K
TPMTokens Per Minute
5M

Built-in Tools

web_extractorResponses API

web_searchResponses API

code_interpreterResponses API

i2i_searchResponses API

t2i_searchResponses API

API Reference

Get API Key

Copied!

12345678910111213141516171819202122232425262728293031

import os
from dashscope import Generation
import dashscope
dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Who are you?"},
]
response = Generation.call(
    # If the environment variable is not set, replace it with your Model Studio API key: api_key = "sk-xxx",
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    model="qwen3.6-flash",
    messages=messages,
    result_format="message",
    # Enable deep thinking
    enable_thinking=True,
)

if response.status_code == 200:
    # Print thinking process
    print("=" * 20 + "Thinking process" + "=" * 20)
    print(response.output.choices[0].message.reasoning_content)
    
    # Print response
    print("=" * 20 + "Full response" + "=" * 20)
    print(response.output.choices[0].message.content)
else:
    print(f"HTTP return code: {response.status_code}")
    print(f"Error code: {response.code}")
    print(f"Error message: {response.message}")