higgsfield-generate
Generate images and videos via Higgsfield AI through 30+ models including Nano Banana 2, Soul V2, Veo 3.1, Kling 3.0, Seedance 2.0, Flux 2, GPT Image 2, plus Marketing Studio for branded ad video/image with curated avatars and imported products. Use when: "generate an image", "make a picture", "create artwork", "make a video", "animate this photo", "image-to-video", "img2vid", "edit this image with AI", "stylize a photo", "remix this image", "produce a clip", "render a scene", "create an ad", "make a UGC video", "generate marketing video", "make a product demo", "create unboxing", "TV spot", "virtual try-on", "product showcase", "brand video", "presenter video for product", "import product from URL", "create avatar for ad". Supports text-to-image, image-to-image, image-to-video, reference-based generation, and Marketing Studio (avatars + products + ad modes). Auto-detects whether passed IDs are uploads or previous jobs. Chain with higgsfield-soul-id when the user wants their face in the output. NOT for: training Soul Character (use higgsfield-soul-id), professional product photoshoots with mode-specific prompt enhancement (use higgsfield-product-photoshoot), text-only / chat / TTS tasks.
NPX Install
npx skill4agent add higgsfield-ai/skills higgsfield-generateTags
Translated version includes tags in frontmatterSKILL.md Content
View Translation Comparison →Higgsfield Generate
higgsfieldStep 0 — Bootstrap
- If is not on
higgsfield, install it:$PATHbashcurl -fsSL https://raw.githubusercontent.com/higgsfield-ai/cli/main/install.sh | sh - If fails with
higgsfield account status/Session expired, ask the user to runNot authenticated(interactive, opens a browser) and wait for them to confirm before continuing.higgsfield auth login
higgsfield account statusUX Rules
- Be concise. No raw IDs, no JSON dumps in chat. Print result URL when ready.
- No internal jargon. Don't narrate "calling higgsfield cost", "polling job".
- Detect the user's language from the first message and reply in it. Technical args () stay English.
--aspect_ratio 16:9 - Don't batch-ask. Pick a sane default model and ask one thing at a time only if genuinely missing.
- Don't pre-estimate cost. Just submit unless the user asks.
- Pass to
--waitso the command blocks until done and prints the result URL itself. Avoid the two-stepgenerate create→createpattern.wait
Workflow — generic generation
-
Pick a model. Practical defaults from production use:Image:
- Brand product visual (Pinterest pin, lifestyle, hero banner, ad pack, virtual try-on) → use instead. NOT this skill.
higgsfield-product-photoshoot - Branded ad image with avatar + product (Marketing Studio shape) → Marketing Studio Image (see Marketing Studio below)
- Aesthetic UGC / fashion editorial / lifestyle character → Soul 2.0
- Cinematic still frame → Soul Cinema
- Highly characterful creative persona (text-only, distinctive) → Soul Cast
- Locations / environments / no-people scenes → Soul Location (best in class)
- Vector illustrations OR face edit + complex scene swap → Seedream 4.5
- Soul Character (reference id from ) → Soul 2.0 for stills, Soul Cinema for cinematic
higgsfield-soul-id - Fast and cheap iteration → Z Image
- Character or cartoon-style work → Nano Banana 2; step up to Nano Banana Pro on hard cases
- Default for everything else → GPT Image 2. Graphic design, UI, banners, typography, and high-fidelity general generation.
Video:- All advertising / commercial / branded ad video → Marketing Studio (see Marketing Studio below)
- Default all-purpose serious video (multi-shot, consistent identity, motion-heavy) → Seedance 2.0. SOTA.
- Single-plane scene without strong dynamics, cheaper than Seedance 2.0 → Kling 3.0
- Cheap clean shot without cuts → Seedance 1.5 Pro
- Cinema-grade highest fidelity → Cinema Studio Video 3.0
- Cheap with strong physics, no audio needed → Minimax Hailuo
- Fast batch / volume → Veo 3.1 Lite
For the actualID to pass to--model, runhiggsfield generate createto map display names to IDs. Seehiggsfield model list --json | jqfor the full table.references/model-catalog.md - Brand product visual (Pinterest pin, lifestyle, hero banner, ad pack, virtual try-on) → use
-
Pass media inputs straight to flags. Media flags accept a local file path or a UUID. CLI auto-uploads paths and auto-detects job vs upload for UUIDs. No need to pre-upload. Each model declares accepted roles (,
image,start_image,end_image,video) — seeaudio.references/media-inputs.md -
Validate quickly. If unsure of params, runonce and pass only what's needed. Use schema defaults otherwise. The server returns
higgsfield model get <jst> --jsonfor non-fatal coercions (e.g.adjustments→ closest match) and a structured error for invalid declared-param values.aspect_ratio=99:99 -
Submit and wait in one shot.. Blocks until terminal status and prints the result URL on stdout. Tunables:
higgsfield generate create <jst> --prompt "..." [media flags] [param flags] --wait(default 10m),--wait-timeout 20m(default 3s).--wait-interval 5s -
Deliver. Send the URL plus a one-line summary (model, duration if video).
higgsfield generate list --jsonhiggsfield generate get <id> --jsonhiggsfield generate wait <id>--waitMedia flags
| Flag | Use for | Models that accept it |
|---|---|---|
| reference image | most image models, |
| first frame for image-to-video transitions | |
| last frame for transitions | |
| reference video | |
| reference audio (lipsync, soundtrack match) | |
higgsfield upload createMEDIA_ROLESreferences/media-inputs.mdCommon params
higgsfield model get <jst>higgsfield generate create gpt_image_2 --prompt "neon city at dusk" --aspect_ratio 16:9 --resolution 2k --wait
higgsfield generate create nano_banana_2 --prompt "anime character concept, expressive pose" --image ./ref.png --wait
higgsfield generate create seedance_2_0 --prompt "camera dollies in" --start-image ./first.png --duration 8 --wait
higgsfield generate create text2image_soul_v2 --prompt "..." --soul-id <soul_ref_id> --wait--json--wait --json--waitecho "..." | higgsfield generate create z_image --waitMarketing Studio
marketing_studio_videomarketing_studio_imageConcepts
- Avatar — presenter face. Curated (browse
preset) orhiggsfield marketing-studio avatars list(uploaded photos viacustom).higgsfield marketing-studio avatars create - Product — brand item with title + reference images. Imported from URL () or created from uploaded images (
higgsfield marketing-studio products fetch --url ...).higgsfield marketing-studio products create - Webproduct — App Store / web page version. Auto-routes when fetching App Store URLs.
UX rules (additional)
- One question per phase. Don't ask product+avatar+mode upfront.
Workflow — quick ad video
- Get product.
- URL → (polls until import done)
higgsfield marketing-studio products fetch --url <url> --wait - Local images → then
higgsfield upload create <photo>...Capture product id.higgsfield marketing-studio products create --title "..." --image <id>...
- URL →
- Pick avatar.
- Default: and pick a preset matching the brand voice.
higgsfield marketing-studio avatars list - Custom: .
higgsfield marketing-studio avatars create --name "..." --image <upload_id>
- Default:
- Pick mode. Default . Other slugs (canonical from MCP):
ugc,ugc_how_to,ugc_unboxing,product_showcase,product_review,tv_spot,wild_card,ugc_virtual_try_on. Seevirtual_try_on.references/marketing-modes.md - Generate (one-shot).
Resolution isbash
higgsfield generate create marketing_studio_video \ --prompt "..." \ --avatars '[{"id":"<avatar_id>","type":"preset"}]' \ --product_ids '[<product_id>]' \ --mode ugc \ --duration 15 \ --resolution 720p \ --aspect_ratio 9:16 \ --waitor480p. Aspect ratio is one of720p/auto/21:9/16:9/4:3/1:1/3:4.9:16is supported here (unlike--generate-audio true).seedance_2_0blocks until done; bump--waitfor longer ad runs.--wait-timeout 30m - Deliver. URL + one-line summary (mode, duration).
Click-to-Ad shortcut (URL-driven)
# 1. Trigger fetch (returns the product id and starts background scrape)
higgsfield marketing-studio products fetch --url https://shop.example.com/sneakers --wait
# 2. Generate the marketing video against the same URL — backend reuses the entity
higgsfield generate create marketing_studio_video \
--url https://shop.example.com/sneakers \
--mode ugc \
--duration 15 \
--aspect_ratio 9:16 \
--waitWorkflow — marketing image
marketing_studio_imagehiggsfield generate create marketing_studio_image \
--prompt "..." \
--aspect_ratio 1:1 \
--resolution 2k \
--waitErrors
- → user gave no prompt; ask for it.
Missing required params: prompt - → bad enum; pick from allowed.
Invalid values: aspect_ratio=99:99 (allowed: ...) - → schema doesn't accept that flag; check
Unknown params: foo.higgsfield model get <jst> - →
Session expired.higgsfield auth login
references/troubleshooting.mdReference docs
- — picking the right model for the task
references/model-catalog.md - — writing prompts that work
references/prompt-engineering.md - — image/video reference flows
references/media-inputs.md - — common errors and fixes
references/troubleshooting.md - — preset vs custom avatars
references/marketing-avatars.md - — URL fetch vs manual product create
references/marketing-products.md - — every Marketing Studio mode
references/marketing-modes.md