Older
Qwen-Omni-Turbo
Older omni-modal model for mixed input understanding and streaming text/speech generation.
Model details
Model code
qwen-omni-turbo
Category
Older
Family
Qwen Omni
Capability
Older omni-modal
Modality
Text / image / audio / video -> Text / audio
Release / status
2025-07-18
Snapshot
qwen-omni-turbo-2025-03-26
Source region
Console
Official detail price
Input: Text: $0.07 / 1M tokens · Input: Audio: $4.44 / 1M tokens
Input
Text: $0.07 / 1M tokens
Input
Audio: $4.44 / 1M tokens
Input
Vision: $0.21 / 1M tokens
Input
Vision(Implicit Cache): $0.04 / 1M tokens
Input
Text(Implicit Cache): $0.015 / 1M tokens
Source region: International. This is a copied summary from the official Model Studio detail page checked on June 8, 2026. Final quotes still require official console confirmation for region, account route, quota, promotions, taxes, and current availability.
Buyer review
Questions to confirm before purchase
Source note
Catalog taxonomy and model detail price summaries were checked against Alibaba Cloud Model Studio Console on 2026-06-08. Availability, region, account route, quota, taxes, promotions, and official terms must be confirmed before purchase.
Open official console source