alicloud-ai-multimodal-qvq
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseCategory: provider
分类:供应商
Model Studio QVQ Visual Reasoning
Model Studio QVQ 视觉推理
Validation
验证
bash
mkdir -p output/alicloud-ai-multimodal-qvq
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txtPass criteria: command exits 0 and is generated.
output/alicloud-ai-multimodal-qwen-vqv/validate.txtbash
mkdir -p output/alicloud-ai-multimodal-qvq
python -m py_compile skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py && echo "py_compile_ok" > output/alicloud-ai-multimodal-qvq/validate.txt通过标准:命令执行返回0,且生成文件。
output/alicloud-ai-multimodal-qwen-vqv/validate.txtCritical model names
关键模型名称
Use one of these exact model strings:
qvq-plusqvq-max
使用以下精确的模型字符串之一:
qvq-plusqvq-max
Typical use
典型用途
- Mathematical reasoning from screenshots
- Diagram and chart reasoning
- Visually grounded multi-step problem solving
- 基于截图的数学推理
- 图表与图形推理
- 基于视觉的多步骤问题解决
Quick start
快速开始
bash
python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py \
--output output/alicloud-ai-multimodal-qvq/request.jsonbash
python skills/ai/multimodal/alicloud-ai-multimodal-qvq/scripts/prepare_qvq_request.py \
--output output/alicloud-ai-multimodal-qvq/request.jsonNotes
注意事项
- Use for standard image understanding.
skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/ - Use QVQ when the task explicitly needs stronger reasoning over visual evidence.
- 标准图像理解请使用。
skills/ai/multimodal/alicloud-ai-multimodal-qwen-vl/ - 当任务明确需要对视觉证据进行更强的推理时,请使用QVQ。
References
参考资料
references/sources.md
references/sources.md