Qwen3.7-Plus

Copied!
Try AIAdd to Compare
ReasoningText GenerationVisual Understanding

Overview

ReasoningText GenerationVisual Understanding

Among the Qwen3.7 series, the cost-effective Plus model builds on its robust text capabilities while delivering a comprehensive upgrade to its vision‑language abilities, all while preserving its full‑stack agent‑level intelligence for coding, tool use, and productivity workflows. Its key distinguishing feature is multi‑modal interactive hybrid agent capabilities, enabling it to perceive real‑world scenes, read screens and interact with GUIs, generate code based on visual references, and perform end‑to‑end navigation within mobile apps.This version is a snapshot as of May 26, 2026.

Input

ImageTextVideo

Output

Text

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Input
    $0.4Per 1M tokens
  • Output
    $1.6Per 1M tokens
  • Input(Implicit Cache)
    $0.08Per 1M tokens
  • Explicit Cache Creation
    $0.5Per 1M tokens
  • Explicit Cache Read
    $0.04Per 1M tokens
  • Input
    $0.4Per 1M tokens
  • Output
    $1.6Per 1M tokens
  • Input(Implicit Cache)
    $0.08Per 1M tokens
  • Explicit Cache Creation
    $0.5Per 1M tokens
  • Explicit Cache Read
    $0.04Per 1M tokens

Context

Context
1M
Max Input
991.80K
Max Output
65.53K

Rate Limits

  • RPMRequests Per Minute
    60
  • TPMTokens Per Minute
    1M

Built-in Tools

code_interpreterResponses API
i2i_searchResponses API
t2i_searchResponses API
web_extractorResponses API
web_searchResponses API

API Reference

Get API Key
Copied!
123456789101112131415161718