fal-models-catalog

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

fal.ai Models Catalog

fal.ai 模型目录

Endpoint-first navigation for fal.ai production work. Each modality reference lists curated picks organized by use case (premium realism / fast & cheap / 4K / specialized). Before reaching for free-text search, consult the modality reference that matches the task.

Runtime: All endpoint calls run via the genmedia CLI. See the
genmedia
skill for command syntax; run
genmedia init
once if not yet installed.

以端点为核心的fal.ai生产工作导航。每个模态参考都按用例（超写实风格/快速低成本/4K分辨率/专用场景）列出了精选选项。在使用自由文本搜索之前，请先查阅与任务匹配的模态参考。

**运行时：**所有端点调用均通过genmedia CLI执行。有关命令语法，请查看
genmedia
技能；如果尚未安装，请先运行
genmedia init
。

Endpoint-first rule

端点优先规则

Pick the endpoint ID from the right modality reference.

Verify it:

genmedia models --endpoint_id <endpoint_id> --json

Inspect it:
```
genmedia schema <endpoint_id> --json
```
.
Check cost when relevant:
```
genmedia pricing <endpoint_id> --json
```
.
Use text search only if the routed endpoint is missing, deprecated, rejected, or the role is not covered here:

bash

genmedia models "<task description>" --json
genmedia docs "<topic>" --json

Do not invent endpoint IDs.

从对应的模态参考中选择端点ID。

验证端点：

genmedia models --endpoint_id <endpoint_id> --json

。

查看端点详情：
```
genmedia schema <endpoint_id> --json
```
。
必要时查看成本：
```
genmedia pricing <endpoint_id> --json
```
。
仅当指定的端点缺失、已弃用、被拒绝，或此处未涵盖相关场景时，才使用文本搜索：

bash

genmedia models "<task description>" --json
genmedia docs "<topic>" --json

请勿自行编造端点ID。

Modality references

模态参考

Load the reference matching the user's task:

text-to-image.md, image generation from prompt (text-heavy, premium still, fast draft)
image-to-image.md, image editing, inpainting, background removal, upscaling
text-to-video.md, video generation from prompt (highest quality, fast/economical, multi-shot storytelling)
image-to-video.md, video from a reference frame (including audio-driven and lip-sync variants)
video-to-video.md, video edit, restyle, upscale, background removal
text-to-3d.md, 3D model generation from text
image-to-3d.md, 3D model generation from reference images
text-to-audio.md. TTS, music, SFX generation
audio-to-text.md, speech-to-text (Whisper, ElevenLabs Scribe with diarization)
image-to-text.md. OCR, captioning, VQA, detection, segmentation

加载与用户任务匹配的参考文档：

text-to-image.md：基于提示词生成图像（文本驱动、超写实静态图、快速草稿图）
image-to-image.md：图像编辑、图像修复、背景移除、图像超分辨率
text-to-video.md：基于提示词生成视频（最高画质、快速经济、多镜头叙事）
image-to-video.md：基于参考帧生成视频（包括音频驱动和唇形同步变体）
video-to-video.md：视频编辑、风格重塑、超分辨率、背景移除
text-to-3d.md：基于文本生成3D模型
image-to-3d.md：基于参考图像生成3D模型
text-to-audio.md：文本转语音（TTS）、音乐、音效生成
audio-to-text.md：语音转文本（Whisper、带说话人分离的ElevenLabs Scribe）
image-to-text.md：光学字符识别（OCR）、图像 captioning、视觉问答（VQA）、目标检测、图像分割

Utility endpoints

实用工具端点

Workflow utility endpoint IDs (resize, composite, mask, audio merge, subtitle, etc.) live in the

fal-workflow

skill: fal-workflow/references/utility-endpoints.md.

Utility endpoints are explicit because they are deterministic tools, not creative model choices. Always inspect schema before use.

工作流实用工具端点ID（调整大小、合成、蒙版、音频合并、字幕等）位于

fal-workflow

技能中：fal-workflow/references/utility-endpoints.md。

实用工具端点是明确的确定性工具，而非创意模型选择。使用前务必查看其schema。