Xiaohongshu Image Workflow
⚠️ Core Principle: Propose first, generate later
Never skip the design proposal to generate images directly. Correct process:
Understand content → Design proposal (2-3 directions) → User selection → Generate → Preview confirmation → Upload
Uncle Hua Design Aesthetic Profile
Preferences
- Texture and warmth — Organic elements such as paper wrinkles, handwritten strokes, seals, tapes, etc.
- Handwritten/calligraphy fonts — Especially suitable for note-style, personal sharing scenarios
- Text fills the screen — Xiaohongshu is for mobile vertical screens, text should be large enough to be seen at a glance
- Core element enhancement — Key numbers/keywords as visual heroes (enlarge 3 times, change color, add decoration)
- Clear structure — Main title > Subtitle > List, distinct hierarchy
- Warm tones — Cream color, warm orange, warm gold perform well
Dislikes
- HTML screenshots — Too flat, like PPT templates, no soul (only fallback for precise data tables)
- Cyber neon/dark blue background — Dark blue backgrounds such as #0D1117 are aesthetic forbidden areas
- Signature/watermark — Cover images should not have "花生", "花叔", "@花生"
- Excessive white space — Better to be full than empty
Proven Good Styles
| Style | Performance | Applicable Scenarios |
|---|
| Handwritten notes (warm paper + calligraphy + hand-drawn icons) | ⭐⭐⭐⭐⭐ | Tutorials, dry content, personal sharing |
| Dark gold poster (dark background + large gold text) | ⭐⭐⭐⭐ | Product releases, shocking titles (needs matching good content) |
| Minimalist infographic (light background + large numbers + simple hierarchy) | ⭐⭐⭐⭐ | Data-driven content, comparison |
Core Parameters
| Parameter | Value |
|---|
| Standard size | 1080 x 1440 px (3:4) |
| AI generation resolution | |
| AI Prompt aspect ratio statement | 3:4 portrait aspect ratio, 1080x1440 pixels
|
| HTML viewport (for fallback) | --viewport-size=1080,1440
|
Step 1: Understand the Content
Read the content provided by the user, quickly extract:
- Theme: What is this content about?
- Core keywords: Which words/numbers need to be made into visual heroes?
- Mood/tone: Suspense? Dry content? Warm? Shocking?
- Number and type of images: Single cover / Carousel set / Infographic?
No need to show the analysis results to the user, go directly to Step 2.
Step 2: Design Proposal ✅ Must wait for user selection
This is the most critical step of the entire process. Skipping is prohibited.
Proposal Format
Show 2-3 design directions to the user, each direction includes:
### Direction A: [Style Name]
- Visual style: [One sentence describing the visual sense, e.g. "warm notebook paper + brush calligraphy large text + hand-drawn small icons"]
- Color: [Background color + main color + accent color]
- Copy layout: [Which text to enlarge as hero, which as subtitle, overall arrangement]
- Mood: [First impression of users after seeing it]
Proposal Principles
- Each direction has clear differences (at least one of style, mood, color is completely different)
- Mark recommendations (explain why a direction is recommended based on content characteristics)
- Copy should be specific (not "enlarge the title", but "the number '28' is a 200px hero element, emphasized with orange")
- No more than 3 directions (too many choices make selection harder)
Proposal Example
Direction A: Handwritten note style (Recommended)
- Visual style: Cream checkered paper background + brush calligraphy main title + hand-drawn technology small icons
- Color: Background #FDF6EC + main color #D97706 (warm orange) + accent circle lines
- Copy layout: "Alibaba C4 Building" and "A group of Cantonese people came" fill the upper half as hero, "AI Open Source Champion" is highlighted in orange as visual anchor, "Qianwen APP" seal at the bottom right corner
- Mood: Friendly, real, like a friend sharing inside information
Direction B: Dark gold reveal style
- Visual style: Dark frosted background + large gold text + badge decoration
- Color: Background #1A1A1A + main color #E2B714 (gold) + white auxiliary
- Copy layout: "Global AI Open Source Champion" giant gold text fills the screen, "Big Factory Inside Information" gold badge at the top, subtitle in white at the bottom
- Mood: Shocking, inside information, sense of weight
Only enter Step 3 after the user makes a selection. Users may:
- Select one directly → Enter generation
- Request mixing/adjustment → Modify the direction and confirm again
- Not satisfied with any → Ask for preferences and re-propose
Step 3: Generate Images
Build Prompt
Based on the direction selected by the user, build a complete prompt.
Prompt Template:
Create a [style] cover for a Xiaohongshu post. 3:4 portrait aspect ratio, 1080x1440 pixels, high quality rendering.
VISUAL STYLE: [Expand from the visual style description in the proposal]
COLOR PALETTE: [Specific color description]
TYPOGRAPHY: text fills most of the canvas, oversized bold typography, clear visual hierarchy.
TEXT TO RENDER:
- [Main title — hero element, visually dominant]
- [Subtitle]
- [Other text elements]
The word/number "[core keyword]" is visually dominant, 3x larger than other text, with decorative emphasis.
IMPORTANT: Do NOT include any personal signature, watermark, or author name like "花生" or "花叔".
[1-2 sentences describing the mood of the image]
Uncle Hua preferred prompt keywords (add as needed):
- Large text →
text fills most of the canvas, oversized bold typography
- Core enhancement →
the word/number "XX" is visually dominant, 3x larger than other text, with decorative emphasis
- Handwritten style →
handwritten style Chinese text / brush calligraphy lettering
- Paper texture →
warm cream paper texture with subtle grid lines, notebook page feel
- Clear structure →
clear visual hierarchy with distinct heading, subheading, and list levels
- No signature →
Do NOT include any personal signature, watermark, or author name
Two generation paths (output both every time)
| Path | Tool | Advantages | Disadvantages | Cost |
|---|
| AI Generation | Gemini nano-banana-pro | Good texture, warm, rich visual | Chinese may be rendered incorrectly | API fee required |
| HTML Screenshot | Playwright | 100% accurate text, zero cost, batch available | Relatively flat, lack of texture | Free |
Output both paths every time for users to compare and choose. HTML is zero cost, you can generate more variants (color/layout) for each direction to give users more choices. Only 1 image per direction for AI path is enough.
File organization specification (must comply)
When generating multiple versions, all related files (png + html source files) are placed in the same subfolder:
Article directory/
├── article.md
└── [article abbreviation]-xiaohongshu-images/ ← Subfolder
├── A-note-style-AI.png
├── A1-note-style-HTML-warm.png
├── A1-note-style-HTML-warm.html
├── B-newspaper-style-AI.png
└── ...
Naming convention:
[direction letter][variant number]-[Chinese style name]-[path AI/HTML]-[variant description].png
- Direction letter: A/B/C (corresponding to the design proposal direction)
- No serial number for AI path:
- HTML variants with serial number:
A1-note-style-HTML-warm.png
, A2-note-style-HTML-green.png
- Folder name uses article keywords:
[keyword]-xiaohongshu-images/
AI generation command
bash
export $(grep GEMINI_API_KEY ~/.claude/.env) && \
uv run /Users/alchain/Documents/写作/.claude/skills/xhs-image/scripts/generate_image.py \
--prompt "[full prompt]" \
--filename "[direction]-[style]-AI.png" \
--resolution 2K
Move to the image subfolder after generation. Carousel images can be generated in parallel (
).
Step 4: Preview Confirmation
Browser preview (must do)
After generation, open all images with the
command for users to compare side by side:
bash
open "[image path 1]" "[image path 2]" "[image path 3]"
Inline preview
At the same time, use the Read tool to display the generated results in the terminal.
Basic check items:
- Is Chinese text rendered correctly?
- Is the ratio 3:4 vertical?
- Does the style match the selected direction?
- Are there no signatures/watermarks?
Design review (must do)
Score each image from two dimensions (10-point scale), and give optimization directions:
| Dimension | Evaluation Criteria |
|---|
| Design Score | Visual hierarchy, typography, color matching, texture, creativity |
| Xiaohongshu Attractiveness | Whether it is eye-catching in the information flow, whether the text is large enough, information density, emotional transmission, whether it arouses curiosity |
Review output format:
- Each image: Comprehensive score + 1 core comment + 1 optimization direction
- Finally give a summary ranking table, mark recommendations
- Users can decide whether to adopt optimization suggestions by themselves
User feedback handling:
- Satisfied → Step 5 Upload
- Text error → Use HTML fallback to render this image
- Wrong style → Adjust prompt and regenerate
- Major direction change → Go back to Step 2 to re-propose
Step 5: Upload to Image Hosting
bash
python3 /Users/alchain/Documents/写作/tools/upload_image.py "[image path]"
Return ImgBB permanent link.
HTML Screenshot Path (used when AI text rendering fails, for precise data tables, or when users request comparison)
bash
npx playwright screenshot "file:///path/to/card.html" output.png \
--viewport-size=1080,1440 --wait-for-timeout=1000
HTML template requirements:
- Canvas:
width: 1080px; height: 1440px
- Font:
font-family: "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", sans-serif
- Safe area: 80px top and bottom, 60px left and right
Quick Reference
Chinese text rendering limits (AI path)
- Main title ≤ 7 characters
- Subtitle ≤ 15 characters
- Body text per line ≤ 20 characters
- Must be verified one by one
Uncle Hua Technology Account Color Matching
| Scheme Name | Background Color | Main Color | Accent Color | Applicable Scenarios |
|---|
| Warm Gray Professional | #F5F0EB | #D97706 | #4A90D9 | AI tools, sharing |
| Minimalist Professional | #F5F5F5 | #4A90D9 | #FF6B35 | Tutorials, comparison |
| Dark Night Gold | #1A1A2E | #E2B714 | #FFFFFF | Product release |
| Terminal Green | #1A1A1A | #00FF41 | #888888 | Programming related |
Golden Rules
- Title is large, bold, eye-catching (occupies 30-50% of the screen)
- Core numbers/keywords are visually enhanced (enlarge, change color, add decoration)
- Cover has large amount of information → arouse curiosity
- Carousel images have unified style
- Vertical 3:4, make full use of screen space
- No signature/watermark
Related Skills
| Skill | Function |
|---|
| Official Account supporting images (sister skill) |
| PPT supporting images (style library source) |
Reference Files
references/style-gallery.md
— Complete style library and prompt template
references/design-guidelines.md
— Xiaohongshu platform design specifications
Produced by Uncle Hua | AI Native Coder · Independent Developer
Official Account「花叔」| 300k+ followers | AI tools and efficiency improvement
Representative works: 小猫补光灯 (Top 1 in AppStore paid list) · 《一本书玩转DeepSeek》