fal-models-catalog

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

fal.ai Models Catalog

fal.ai 模型目录

Endpoint-first navigation for fal.ai production work. Each modality reference lists curated picks organized by use case (premium realism / fast & cheap / 4K / specialized). Before reaching for free-text search, consult the modality reference that matches the task.
Runtime: All endpoint calls run via the genmedia CLI. See the
genmedia
skill for command syntax; run
genmedia init
once if not yet installed.
以端点为核心的fal.ai生产工作导航。每个模态参考都按用例(超写实风格/快速低成本/4K分辨率/专用场景)列出了精选选项。在使用自由文本搜索之前,请先查阅与任务匹配的模态参考。
**运行时:**所有端点调用均通过genmedia CLI执行。有关命令语法,请查看
genmedia
技能;如果尚未安装,请先运行
genmedia init

Endpoint-first rule

端点优先规则

  1. Pick the endpoint ID from the right modality reference.
  2. Verify it:
    genmedia models --endpoint_id <endpoint_id> --json
    .
  3. Inspect it:
    genmedia schema <endpoint_id> --json
    .
  4. Check cost when relevant:
    genmedia pricing <endpoint_id> --json
    .
  5. Use text search only if the routed endpoint is missing, deprecated, rejected, or the role is not covered here:
bash
genmedia models "<task description>" --json
genmedia docs "<topic>" --json
Do not invent endpoint IDs.
  1. 从对应的模态参考中选择端点ID。
  2. 验证端点:
    genmedia models --endpoint_id <endpoint_id> --json
  3. 查看端点详情:
    genmedia schema <endpoint_id> --json
  4. 必要时查看成本:
    genmedia pricing <endpoint_id> --json
  5. 仅当指定的端点缺失、已弃用、被拒绝,或此处未涵盖相关场景时,才使用文本搜索:
bash
genmedia models "<task description>" --json
genmedia docs "<topic>" --json
请勿自行编造端点ID。

Modality references

模态参考

Load the reference matching the user's task:
  • text-to-image.md, image generation from prompt (text-heavy, premium still, fast draft)
  • image-to-image.md, image editing, inpainting, background removal, upscaling
  • text-to-video.md, video generation from prompt (highest quality, fast/economical, multi-shot storytelling)
  • image-to-video.md, video from a reference frame (including audio-driven and lip-sync variants)
  • video-to-video.md, video edit, restyle, upscale, background removal
  • text-to-3d.md, 3D model generation from text
  • image-to-3d.md, 3D model generation from reference images
  • text-to-audio.md. TTS, music, SFX generation
  • audio-to-text.md, speech-to-text (Whisper, ElevenLabs Scribe with diarization)
  • image-to-text.md. OCR, captioning, VQA, detection, segmentation
加载与用户任务匹配的参考文档:
  • text-to-image.md:基于提示词生成图像(文本驱动、超写实静态图、快速草稿图)
  • image-to-image.md:图像编辑、图像修复、背景移除、图像超分辨率
  • text-to-video.md:基于提示词生成视频(最高画质、快速经济、多镜头叙事)
  • image-to-video.md:基于参考帧生成视频(包括音频驱动和唇形同步变体)
  • video-to-video.md:视频编辑、风格重塑、超分辨率、背景移除
  • text-to-3d.md:基于文本生成3D模型
  • image-to-3d.md:基于参考图像生成3D模型
  • text-to-audio.md:文本转语音(TTS)、音乐、音效生成
  • audio-to-text.md:语音转文本(Whisper、带说话人分离的ElevenLabs Scribe)
  • image-to-text.md:光学字符识别(OCR)、图像 captioning、视觉问答(VQA)、目标检测、图像分割

Utility endpoints

实用工具端点

Workflow utility endpoint IDs (resize, composite, mask, audio merge, subtitle, etc.) live in the
fal-workflow
skill: fal-workflow/references/utility-endpoints.md.
Utility endpoints are explicit because they are deterministic tools, not creative model choices. Always inspect schema before use.
工作流实用工具端点ID(调整大小、合成、蒙版、音频合并、字幕等)位于
fal-workflow
技能中:fal-workflow/references/utility-endpoints.md
实用工具端点是明确的确定性工具,而非创意模型选择。使用前务必查看其schema。