Voice-Enrollment
Copied!
Text-to-Speech
Overview
Text-to-Speech
A large-model voice replication service used in conjunction with Cosyvoice-v3. Utilizing advanced large-model technology for feature extraction, it can replicate voices without a training process. Only a very short audio clip is required to quickly generate a highly similar and natural-sounding custom voice.
Input
Audio
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Rate Limits
- RPMRequests Per Minute600
API Reference
Get API KeyCopied!
123456789101112