Qwen-Voice-Design

Copied!
Try AIAdd to Compare
Text-to-Speech

Overview

Text-to-Speech

Qwen-Voice-Design model is a series of voice design models from Qianwen Speech Model. It only requires a simple text description to quickly design a suitable voice. When used in conjunction with the qwen3-tts-vd-realtime model, it can design and output speech in 10 languages. Furthermore, the synthesized audio can adaptively adjust its tone based on the text and has good processing capabilities for complex text synthesis.

Input

Text

Output

Audio

Features

Prefix Completion

Function Calling

Cache

Structured Outputs

Batches

Web Search

Pricing

  • Voice Enrollment And Design
    $0.2Per voice

Rate Limits

  • RPMRequests Per Minute
    180

API Reference

Get API Key
Copied!
1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374757677787980818283848586