Qwen-Rerank
Copied!
Embedding
Overview
Embedding
A text-ranking model trained on the Qwen LLM foundation performs relevance ranking for input queries and candidate documents. It supports over 100 languages and long-text inputs, and is suitable for applications such as text retrieval and RAG. Its performance is aligned with the open-source Qwen3-Rerank series models.
Input
Text
Output
Text
Features
Prefix Completion
Function Calling
Cache
Structured Outputs
Batches
Web Search
Pricing
- Text Input$0.1Per 1M tokens
Context
Context
32.76K
Max Input
32.76K
Max Output
-
Rate Limits
- RPMRequests Per Minute5.40K
- TPMTokens Per Minute5B
API Reference
Get API KeyCopied!
123456789101112131415161718