Qwen3.5-Plus

Copied!
Try AIAdd to Compare
ReasoningVisual UnderstandingText Generation

Overview

ReasoningVisual UnderstandingText Generation

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency. In a variety of task evaluations, the 3.5 series consistently demonstrates performance on par with state-of-the-art leading models. Compared to the 3 series, these models show a leap forward in both pure-text and multimodal capabilities.This version is a snapshot as of February 15, 2026.

Input

TextImageVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $0.4Per 1M tokens
  • Output
    $2.4Per 1M tokens
  • Input
    $0.4Per 1M tokens
  • Output
    $2.4Per 1M tokens

Context

Standard

Context
1M
Max Input
991.80K
Max Output
65.53K

Thinking

Context
1M
Max Input
983.61K
Max Output
65.53K
Max Reasoning
81.92K

Rate Limits

  • RPMRequests Per Minute
    60
  • TPMTokens Per Minute
    1M

Built-in Tools

web_searchResponses API
web_extractorResponses API
code_interpreterResponses API
t2i_searchResponses API
i2i_searchResponses API

API Reference

Get API Key
Copied!
123456789101112131415161718