Model selection
Choose a Model
Select the model by workload, context length, modality, region, latency, and cost. The model list changes over time, so final quotes should reference the official current catalog.
Who this is for
Sales and technical leads preparing a model recommendation.
Configuration reference
Values to confirm before setup
Flagship lane
qwen3.7-max for highest-value reasoning, coding, office, and agent work
Balanced lane
qwen3.7-plus or qwen3.6-plus for multimodal production and business automation
Fast lane
qwen3.6-flash or deepseek-v4-flash for lighter high-speed work
Other lanes
qwen-image-2.0, wan2.7-image, kimi-k2.6, glm-5.1, and MiniMax-M2.5 subject to Token Plan support
Setup flow
Practical steps
- 01Write down the customer task in plain language.
- 02Classify the workload: text, vision, image, video, speech, embeddings, or coding tool use.
- 03Confirm context length and output size.
- 04Check the region/deployment mode where that model is available.
- 05Compare estimated input/output tokens and latency requirements.
- 06Pick a fallback model for cost or quota constraints.
Model lanes
For a customer-facing quote, use model lanes instead of a raw catalog dump: flagship for difficult reasoning, balanced for daily production, fast for support/classification/extraction, and specialist lanes for multimodal or retrieval tasks.
Procurement wording
Say 'recommended model lane subject to official availability' until the exact account, region, and billing route are confirmed.
Common mistakes
Check these before escalating
- A model name available in one deployment mode may not be available in another.
- Prices in examples are references and can change.
- Image/video models may use different APIs than text models.
Related guides
Model Catalog by Capability
The catalog spans text generation, multimodal, image generation/editing, video generation/editing, speech, embeddings, reranking, and domain models.
Regions and Deployment Modes
Model Studio endpoints, API keys, data routing, and model availability depend on the deployment mode. A quote must specify the intended region before payment.
Billing and Pricing Structure
A trustworthy quote separates official model usage, Token Plan subscription, shared quota, payment costs, taxes, and ModelSmarter service fees.
Request Quote Checklist
A quote should collect enough information to choose the right plan, endpoint, model, tool route, and service scope before payment.