Loading...
Loading...
Converts PDF pages to images and uses vision analysis to extract content including diagrams, charts, and visual elements. Use for PDFs with rich visual content. Requires pdf2image and poppler-utils.
npx skill4agent add childbamboo/claude-code-marketplace-sample pdf-vision-reader# 1. PDF を画像に変換
wsl python3 scripts/pdf_to_images.py "/mnt/c/path/to/file.pdf"
# 2. 各画像を Read ツールで読み込んで解析
# 3. Markdown 形式でまとめる# Python パッケージ
wsl pip3 install pdf2image Pillow
# システムパッケージ (poppler)
wsl sudo apt-get update
wsl sudo apt-get install -y poppler-utilswsl python3 scripts/pdf_to_images.py "/mnt/c/path/to/document.pdf"document_pages/page_001.pngpage_002.pngpage_003.pngPlease provide a detailed description of this image's content including:
- Titles and headings
- Body text
- Diagram and chart descriptions
- Graph and chart data
- Key pointsUser: "Analyze presentation.pdf using vision and convert it to Markdown"
Assistant:
1. Convert the PDF to images using scripts/pdf_to_images.py
2. Load each image with the Read tool
3. Analyze each page's content (titles, diagrams, text)
4. Integrate analysis results from all pages
5. Save as a Markdown file using the Write toolUser: "Analyze only pages 5-10 of document.pdf"
Assistant:
1. Convert the PDF to images (all pages)
2. Load only page_005.png to page_010.png using Read
3. Convert the relevant pages' content to Markdown# [PDF Title]
**Analysis Date:** YYYY-MM-DD
**Total Pages:** N
---
## Page 1: [Page Title]
### Overview
[Page overview description]
### Key Content
- [Point 1]
- [Point 2]
### Diagrams and Charts
**Figure 1: [Diagram Title]**
[Diagram description]
### Text Content
[Page text content]
---
## Page 2: [Page Title]
...python scripts/pdf_to_images.py <pdf_path> [output_dir] [dpi]
# Example
python scripts/pdf_to_images.py document.pdf ./images 300[pdf_name]_pages/page_001.png[pdf_name]_pages/page_002.png| PDF Type | Recommended Skill |
|---|---|
| Text-focused documents | pdf-reader |
| Presentation materials | pdf-vision-reader |
| Materials with many diagrams/graphs | pdf-vision-reader |
| Technical drawings/blueprints | pdf-vision-reader |
| Research papers (with diagrams) | pdf-vision-reader |
| Simple text PDFs | pdf-reader |
wsl pip3 install pdf2imagewsl sudo apt-get update
wsl sudo apt-get install -y poppler-utilspython scripts/pdf_to_images.py document.pdf ./images 300| Number of Pages | Image Conversion | Analysis (Claude Vision) | Total |
|---|---|---|---|
| 10 Pages | 5 seconds | 30-60 seconds | ~1 minute |
| 30 Pages | 15 seconds | 90-180 seconds | ~3 minutes |
| 100 Pages | 50 seconds | 300-600 seconds | ~10 minutes |
C:\Users\.../mnt/c/Users/...D:\Projects\.../mnt/d/Projects/...