paper-deck

Original：🇨🇳 Chinese

Translated

1 scriptsChecked / no sensitive code detected

Turn papers, technical articles, or knowledge content into highly realistic AIGC slides. First create a narrative structure and page-by-page visual direction, then call an image generation model to produce a 16:9 slide image for each page, and finally synthesize into PPTX/PDF. Suitable for paper presentations, group meetings, open courses, technical sharing, and commercial research presentations; use this skill when users mention "paper PPT", "AI-generated PPT", "PPT that doesn't look like AI", "high-quality slides", or "page-by-page AI-generated PPT".

82installs

Sourcezsyggg/paper-craft-skills

Added on2026-05-25

NPX Install

npx skill4agent add zsyggg/paper-craft-skills paper-deck

SKILL.md Content (Chinese)

View Translation Comparison →

Paper Deck — Visual Slide Director

Turn papers/knowledge content into slides that look professionally designed.

The core approach is not to rigidly arrange layouts using PPT objects, but rather:

First understand the content, create a deck brief and page-by-page narrative.
Clearly define for each page: "What do you want the audience to see, feel, and remember from this page?"
Use an image generation model to produce 16:9 slide images.
Synthesize into PPTX/PDF, and retain prompts as source files for revision.

When to Use

Suitable for:

Paper group meetings, defenses, reading groups, technical sharing
Visual presentations that "don't look like template PPT at first glance"
Users who prefer high-quality image-based pages and prioritize overall visual impact and dissemination effect
Need page-by-page revision: redo page N, change style, increase realism, reduce AI traces

Not suitable for:

Need multiple people to edit each text box in PowerPoint with precision
Large amounts of tables, financial statements, compliance materials
Need to accurately replicate existing corporate PPT masters

If users require fully editable PPT, explain that V1 of this skill is raster-first; they can switch to regular PPTX tools, or use a hybrid solution of "image background + editable text layers".

Workflow

Step 1: Input Analysis

Accepts:

arXiv / DOI / web links
PDF paths
Markdown / text / articles
Existing outlines
Reference images or reference PPT screenshots

For papers, prioritize reusing the reading method of

paper-analyzer

: read abstract, methods, experiments, charts, conclusions; search code repositories if necessary. The goal is not to write long articles, but to extract core narratives suitable for slides.

Output and save

analysis.md

:

Topic, audience, presentation scenario
1-sentence thesis of the paper/content
3-5 core points that must be clearly explained
Recommended number of pages, recommended style, language
Types of images to generate: cover, mechanism diagram, flowchart, data page, conclusion page, etc.
Directly usable real materials: paper Figure/Table, PDF screenshots, user-provided screenshots, code screenshots, experimental curves

Step 2: Pre-Generation Confirmation

Confirmation is required by default; do not generate images directly unless the user explicitly says "generate directly/no need to confirm/follow defaults".

Limit questions to 3 or fewer when asking:

Number of pages and purpose: group meeting / defense / public sharing / commercial presentation, how many pages are needed?
Style: see
```
references/style-system.md
```
.
Insert real materials: allow screenshots from PDF/paper charts, or user-provided screenshots/images? If allowed, specify which real materials are expected to be used on which pages.

Recommended script:

text

I suggest making 12 pages with the journal-minimal style: like Nature/IEEE paper figures + formal academic presentations, clear, restrained, and not flashy.
You can also switch to business-research for commercial research sharing, warm-notes for notebook style, or liquid-glass for Apple-style glass texture.
For this paper, I recommend inserting a partial screenshot of the original paper's method diagram on page 4, and a screenshot of experimental curves/tables on page 8, then doing design-based layout based on these real materials.
After confirmation, I will first generate outline.md and prompts for each page, then produce images page by page and synthesize into PPTX/PDF.

Step 3: Deck Brief

Save as

deck-brief.md

. Must include:

```
style_preset
```
```
audience
```
```
slide_count
```
```
language
```
```
visual_rules
```
```
do_not_use
```
```
reference_images
```
(if any)
```
source_visual_plan
```
: which pages use real charts/screenshots, sources and processing methods

Read

references/style-system.md

for style details as needed. Read

references/source-visuals.md

for real material strategies as needed.

Step 4: Outline

Save as

outline.md

. Each page uses a fixed structure:

markdown

## 01. Slide Title
- Role: cover / context / method / mechanism / evidence / result / takeaway
- Message: The only key point to be clearly explained on this page
- Visual: Main visual and composition of the page
- Text: Short text allowed on the page
- Evidence: Cited paper charts/formulas/experimental data/code locations
- Source visual: Whether to use real screenshots/paper charts; source, cropping range and placement
- Repair handle: Position description that can be referenced for subsequent revisions

Rules:

Each page only carries one main point.
Minimize text on the page; place complex explanations in speaker scripts or notes.
For mechanism pages, prioritize drawing "input → processing → output" instead of abstract inspiration.
For data pages, only include the 1-3 most convincing numbers.
Real paper figures/screenshots are usually more credible than AI-generated ones; prioritize planning the placement of real materials when available.
Do not overuse white space. The main visual, chart, or evidence area should usually occupy 60%-80% of the page, except for covers or section pages.
For decks with more than 8 pages, must have rhythm changes: alternate between cover, problem, method, mechanism, evidence, conclusion.

Step 5: Prompt Files

Must write a prompt file for each page before calling any image generation tool.

Path:

text

paper-deck/{topic-slug}/
├── analysis.md
├── deck-brief.md
├── outline.md
├── prompts/
│   ├── 01-slide-cover.md
│   ├── 02-slide-context.md
│   └── ...
├── images/
│   ├── 01-slide-cover.png
│   ├── 02-slide-context.png
│   └── ...
├── {topic-slug}.pptx
└── {topic-slug}.pdf

Read

references/prompt-template.md

for prompt writing guidelines.

Hard rules:

Prompt must explicitly specify 16:9.
Prompt must clearly state style, composition, text language, and text quantity limits.
Do not let the model generate page numbers, logos, watermarks, or PPT frames.
If precise text is needed, minimize text within the image; a hybrid text layer can be added later.
If real materials are used on this page, the prompt must explain how the materials are integrated into the page: embedded, cropped, carried by glass panels, annotated, magnified, rather than letting the model redraw facts from scratch.

Step 6: Generate Images

Image backend options:

Use built-in
```
imagegen
```
first in Codex environment.
If the user specifies backends like
```
baoyu-imagine
```
, Gemini, OpenAI, Seedream, follow the user's choice.
If no available image generation backend is present, stop and inform the user that a raster image backend is required.

Generation strategy:

First generate page 1 as the style anchor.
For subsequent pages, if the backend supports reference images, use page 1 as the style reference to reduce style drift.
Check thumbnails every 3-4 pages; if style drift is found, revise the prompt before continuing.
Save failed pages; do not overwrite successful ones.

Step 7: Synthesize PPTX/PDF

After generating images, run:

bash

python3 <SKILL_ROOT>/scripts/merge_deck.py paper-deck/{topic-slug}

The script will read

images/NN-*.png|jpg|webp

and output a corresponding

.pptx

and

.pdf

. Each image fills a full 16:9 page.

Step 8: Quality Check

Check against

references/quality-gate.md

before delivery:

Does it look like a real design work at first glance, rather than a template stack?
Does each page only carry one main point?
Is there excessive meaningless white space; does key content occupy enough of the page?
Are real material pages clearly recorded with source, page number/figure number and placement?
Is the style consistent?
Is the text in the image clear, free of typos, and not gibberish?
Are common AI issues present: fake UI, fake logos, garbled labels, excessive cyberpunk style, plastic 3D, meaningless decorations?
Can the PPTX/PDF be opened, and is the number of pages correct?

Step 9: Revision

Always modify source files first when revising:

User says	Action
"Make page 5 more academic"	Revise `prompts/05-*.md` , keep the old image, generate a new one
"Unify the texture to match page 1"	Add the style anchor of page 1 to relevant prompts
"Too much text on page 7"	Modify the Text section in the outline, then revise the prompt
"Only redo the background, don't change the content"	Keep Message/Text in the prompt, rewrite the Visual section
"Add a new page for mechanism details"	Update the outline, add a new prompt, generate the image, rerun the synthesis script

Do not use programs to alter text on generated images. If text is incorrect, revise the prompt and regenerate, or switch to the hybrid text layer solution.

Reference Files

```
references/style-system.md
```
: Style presets and selection rules
```
references/layouts.md
```
: Common page roles and compositions
```
references/source-visuals.md
```
: Usage strategies for PDF screenshots, paper charts, and user-provided images
```
references/prompt-template.md
```
: Page-by-page image generation prompt templates
```
references/quality-gate.md
```
: Pre-delivery inspection and revision standards