Qwen3.6-Max

Copied!
Try AIAdd to Compare
Text GenerationReasoning

Overview

Text GenerationReasoning

The Max model, the largest and most capable variant in the Qwen3.6 series, is now available in a preview version. At present, only its plain-text capabilities are open for experimentation. Compared with the previously released Qwen3-Max and Qwen3.6-Plus, this model features enhanced vibe coding abilities, more efficient coding agent execution, and significantly improved front-end development skills. Additionally, its long-tail knowledge retention has been further upgraded.

Input

Text

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $1.3Per 1M tokens
  • Output
    $7.8Per 1M tokens
  • Explicit Cache Creation
    $1.625Per 1M tokens
  • Explicit Cache Read
    $0.13Per 1M tokens
  • Input
    $1.3Per 1M tokens
  • Output
    $7.8Per 1M tokens
  • Explicit Cache Creation
    $1.625Per 1M tokens
  • Explicit Cache Read
    $0.13Per 1M tokens

Context

Context
262.14K
Max Input
245.76K
Max Output
65.53K

Rate Limits

  • RPMRequests Per Minute
    600
  • TPMTokens Per Minute
    1M

API Reference

Get API Key
Copied!
12345678910111213141516171819202122232425262728293031
import os
from dashscope import Generation
import dashscope
dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1'

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Who are you?"},
]
response = Generation.call(
    # If the environment variable is not set, replace it with your Model Studio API key: api_key = "sk-xxx",
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    model="qwen3.6-max-preview",
    messages=messages,
    result_format="message",
    # Enable deep thinking
    enable_thinking=True,
)

if response.status_code == 200:
    # Print thinking process
    print("=" * 20 + "Thinking process" + "=" * 20)
    print(response.output.choices[0].message.reasoning_content)
    
    # Print response
    print("=" * 20 + "Full response" + "=" * 20)
    print(response.output.choices[0].message.content)
else:
    print(f"HTTP return code: {response.status_code}")
    print(f"Error code: {response.code}")
    print(f"Error message: {response.message}")