Qwen3.6-Max-Preview - Qwen Cloud

Qwen3.6-Max

Copied!

Try AIAdd to Compare

Text GenerationReasoning

Overview

Text GenerationReasoning

The Max model, the largest and most capable variant in the Qwen3.6 series, is now available in a preview version. At present, only its plain-text capabilities are open for experimentation. Compared with the previously released Qwen3-Max and Qwen3.6-Plus, this model features enhanced vibe coding abilities, more efficient coding agent execution, and significantly improved front-end development skills. Additionally, its long-tail knowledge retention has been further upgraded.

Input

Text

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

Input
$1.3Per 1M tokens
Output
$7.8Per 1M tokens
Explicit Cache Creation
$1.625Per 1M tokens
Explicit Cache Read
$0.13Per 1M tokens

Input
$1.3Per 1M tokens
Output
$7.8Per 1M tokens
Explicit Cache Creation
$1.625Per 1M tokens
Explicit Cache Read
$0.13Per 1M tokens

Context

262.14K

Max Input

245.76K

Max Output

65.53K

Rate Limits

RPMRequests Per Minute
600
TPMTokens Per Minute
1M

API Reference

Get API Key

Copied!

12345678910111213141516171819202122232425262728293031

import os
from dashscope import Generation
import dashscope
dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Who are you?"},
]
response = Generation.call(
    # If the environment variable is not set, replace it with your Model Studio API key: api_key = "sk-xxx",
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    model="qwen3.6-max-preview",
    messages=messages,
    result_format="message",
    # Enable deep thinking
    enable_thinking=True,
)

if response.status_code == 200:
    # Print thinking process
    print("=" * 20 + "Thinking process" + "=" * 20)
    print(response.output.choices[0].message.reasoning_content)
    
    # Print response
    print("=" * 20 + "Full response" + "=" * 20)
    print(response.output.choices[0].message.content)
else:
    print(f"HTTP return code: {response.status_code}")
    print(f"Error code: {response.code}")
    print(f"Error message: {response.message}")