alicloud-ai-multimodal-qwen-omni

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese
Category: provider
类别:服务商

Model Studio Qwen Omni

Model Studio Qwen Omni

Validation

验证

bash
mkdir -p output/alicloud-ai-multimodal-qwen-omni
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/scripts/prepare_omni_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qwen-omni/validate.txt
Pass criteria: command exits 0 and
output/alicloud-ai-multimodal-qwen-omni/validate.txt
is generated.
bash
mkdir -p output/alicloud-ai-multimodal-qwen-omni
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/scripts/prepare_omni_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qwen-omni/validate.txt
通过标准:命令执行返回0,且生成
output/alicloud-ai-multimodal-qwen-omni/validate.txt
文件。

Critical model names

关键模型名称

Use one of these exact model strings:
  • qwen3-omni-flash
  • qwen3-omni-flash-realtime
  • qwen-omni-turbo
  • qwen-omni-turbo-realtime
请使用以下精确模型字符串之一:
  • qwen3-omni-flash
  • qwen3-omni-flash-realtime
  • qwen-omni-turbo
  • qwen-omni-turbo-realtime

Typical use

典型用途

  • Image + audio + text assistant
  • Realtime multimodal agents
  • Spoken responses grounded in visual input
  • 图像+音频+文本助手
  • 实时多模态Agent
  • 基于视觉输入的语音响应

Normalized interface (omni.chat)

标准化接口(omni.chat)

Request

请求

  • model
    (string, optional): default
    qwen3-omni-flash
  • text
    (string, optional)
  • image
    (string, optional)
  • audio
    (string, optional)
  • response_modalities
    (array<string>, optional): e.g.
    ["text"]
    ,
    ["text","audio"]
  • model
    (字符串,可选):默认值为
    qwen3-omni-flash
  • text
    (字符串,可选)
  • image
    (字符串,可选)
  • audio
    (字符串,可选)
  • response_modalities
    (字符串数组,可选):例如
    ["text"]
    ["text","audio"]

Response

响应

  • text
    (string, optional)
  • audio_url
    or
    audio_chunk
    (optional)
  • usage
    (object, optional)
  • text
    (字符串,可选)
  • audio_url
    audio_chunk
    (可选)
  • usage
    (对象,可选)

Quick start

快速开始

bash
python skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/scripts/prepare_omni_request.py \
  --output output/alicloud-ai-multimodal-qwen-omni/request.json
bash
python skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/scripts/prepare_omni_request.py \
  --output output/alicloud-ai-multimodal-qwen-omni/request.json

References

参考资料

  • references/sources.md
  • references/sources.md