fal-models-catalog
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
Chinesefal.ai Models Catalog
fal.ai 模型目录
Endpoint-first navigation for fal.ai production work. Each modality reference lists curated picks organized by use case (premium realism / fast & cheap / 4K / specialized). Before reaching for free-text search, consult the modality reference that matches the task.
Runtime: All endpoint calls run via the genmedia CLI. See theskill for command syntax; rungenmediaonce if not yet installed.genmedia init
以端点为核心的fal.ai生产工作导航。每个模态参考都按用例(超写实风格/快速低成本/4K分辨率/专用场景)列出了精选选项。在使用自由文本搜索之前,请先查阅与任务匹配的模态参考。
Endpoint-first rule
端点优先规则
- Pick the endpoint ID from the right modality reference.
- Verify it: .
genmedia models --endpoint_id <endpoint_id> --json - Inspect it: .
genmedia schema <endpoint_id> --json - Check cost when relevant: .
genmedia pricing <endpoint_id> --json - Use text search only if the routed endpoint is missing, deprecated, rejected, or the role is not covered here:
bash
genmedia models "<task description>" --json
genmedia docs "<topic>" --jsonDo not invent endpoint IDs.
- 从对应的模态参考中选择端点ID。
- 验证端点:。
genmedia models --endpoint_id <endpoint_id> --json - 查看端点详情:。
genmedia schema <endpoint_id> --json - 必要时查看成本:。
genmedia pricing <endpoint_id> --json - 仅当指定的端点缺失、已弃用、被拒绝,或此处未涵盖相关场景时,才使用文本搜索:
bash
genmedia models "<task description>" --json
genmedia docs "<topic>" --json请勿自行编造端点ID。
Modality references
模态参考
Load the reference matching the user's task:
- text-to-image.md, image generation from prompt (text-heavy, premium still, fast draft)
- image-to-image.md, image editing, inpainting, background removal, upscaling
- text-to-video.md, video generation from prompt (highest quality, fast/economical, multi-shot storytelling)
- image-to-video.md, video from a reference frame (including audio-driven and lip-sync variants)
- video-to-video.md, video edit, restyle, upscale, background removal
- text-to-3d.md, 3D model generation from text
- image-to-3d.md, 3D model generation from reference images
- text-to-audio.md. TTS, music, SFX generation
- audio-to-text.md, speech-to-text (Whisper, ElevenLabs Scribe with diarization)
- image-to-text.md. OCR, captioning, VQA, detection, segmentation
加载与用户任务匹配的参考文档:
- text-to-image.md:基于提示词生成图像(文本驱动、超写实静态图、快速草稿图)
- image-to-image.md:图像编辑、图像修复、背景移除、图像超分辨率
- text-to-video.md:基于提示词生成视频(最高画质、快速经济、多镜头叙事)
- image-to-video.md:基于参考帧生成视频(包括音频驱动和唇形同步变体)
- video-to-video.md:视频编辑、风格重塑、超分辨率、背景移除
- text-to-3d.md:基于文本生成3D模型
- image-to-3d.md:基于参考图像生成3D模型
- text-to-audio.md:文本转语音(TTS)、音乐、音效生成
- audio-to-text.md:语音转文本(Whisper、带说话人分离的ElevenLabs Scribe)
- image-to-text.md:光学字符识别(OCR)、图像 captioning、视觉问答(VQA)、目标检测、图像分割
Utility endpoints
实用工具端点
Workflow utility endpoint IDs (resize, composite, mask, audio merge, subtitle, etc.) live in the skill:
fal-workflow/references/utility-endpoints.md.
fal-workflowUtility endpoints are explicit because they are deterministic tools, not creative model choices. Always inspect schema before use.
工作流实用工具端点ID(调整大小、合成、蒙版、音频合并、字幕等)位于技能中:fal-workflow/references/utility-endpoints.md。
fal-workflow实用工具端点是明确的确定性工具,而非创意模型选择。使用前务必查看其schema。