Qwen3.6-Flash

Copied!
Try AIAdd to Compare
Text GenerationVisual UnderstandingReasoning

Overview

Text GenerationVisual UnderstandingReasoning

The Qwen3.6 native vision-language Flash model series delivers a significant performance boost over the 3.5-Flash version. This model particularly excels in agentic coding capabilities, substantially outperforming its predecessor on multiple code-agent benchmarks, as well as in mathematical and code reasoning. In terms of vision, it features markedly improved spatial intelligence, with especially notable enhancements in object localization and object detection.

Input

ImageTextVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $0.25Per 1M tokens
  • Output
    $1.5Per 1M tokens
  • Explicit Cache Creation
    $0.3125Per 1M tokens
  • Explicit Cache Read
    $0.025Per 1M tokens
  • Input
    $0.25Per 1M tokens
  • Output
    $1.5Per 1M tokens
  • Explicit Cache Creation
    $0.3125Per 1M tokens
  • Explicit Cache Read
    $0.025Per 1M tokens

Context

Context
1M
Max Input
991.80K
Max Output
65.53K

Rate Limits

  • RPMRequests Per Minute
    15K
  • TPMTokens Per Minute
    5M

Built-in Tools

web_extractorResponses API
web_searchResponses API
code_interpreterResponses API
i2i_searchResponses API
t2i_searchResponses API

API Reference

Get API Key
Copied!
12345678910111213141516171819202122232425262728293031
import os
from dashscope import Generation
import dashscope
dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Who are you?"},
]
response = Generation.call(
    # If the environment variable is not set, replace it with your Model Studio API key: api_key = "sk-xxx",
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    model="qwen3.6-flash",
    messages=messages,
    result_format="message",
    # Enable deep thinking
    enable_thinking=True,
)

if response.status_code == 200:
    # Print thinking process
    print("=" * 20 + "Thinking process" + "=" * 20)
    print(response.output.choices[0].message.reasoning_content)
    
    # Print response
    print("=" * 20 + "Full response" + "=" * 20)
    print(response.output.choices[0].message.content)
else:
    print(f"HTTP return code: {response.status_code}")
    print(f"Error code: {response.code}")
    print(f"Error message: {response.message}")