Loading...
Loading...
Found 48 Skills
Minimal CosyVoice voice design smoke test for Model Studio voice-enrollment.
Use when tasks require all-in-one multimodal understanding or generation with Alibaba Cloud Model Studio Qwen Omni models, including image-plus-audio interaction, voice assistants, and realtime multimodal agents.
Minimal text embedding smoke test for Model Studio embedding models.
Use when text embeddings are needed from Alibaba Cloud Model Studio models for semantic search, retrieval-augmented generation, clustering, or offline vectorization pipelines.
Minimal multimodal omni smoke test for Model Studio Qwen Omni.
Minimal visual reasoning smoke test for Model Studio QVQ.
Minimal CosyVoice voice clone smoke test for Model Studio voice-enrollment.
Use when Alibaba Cloud Model Studio video editing models are needed for style transfer, keyframe-controlled editing, lip sync, retalk, or animation remix workflows.
Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from reference audio and then reusing the returned voice_id in later TTS calls.
Minimal live speech translation smoke test for Model Studio Qwen LiveTranslate.
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from a voice prompt plus preview text before using the returned voice_id in TTS.
Use when OCR-specialized extraction is needed with Alibaba Cloud Model Studio Qwen OCR models (`qwen-vl-ocr`, `qwen-vl-ocr-latest`, and snapshots), including document parsing, table parsing, multilingual OCR, formula recognition, and key information extraction.