Search Results: text-extraction

Found 54 Skills

Document Processingk-dense-ai/claude-scienti...

docx

Document toolkit (.docx). Create/edit documents, tracked changes, comments, formatting preservation, text extraction, for professional document processing.

🇺🇸|EnglishTranslated

102

11 scripts/Attention

Data Processingdkyazzentwatwa/chatgpt-sk...

ocr-document-processor

Extract text from images and scanned PDFs using OCR. Supports 100+ languages, table detection, structured output (markdown/JSON), and batch processing.

🇺🇸|EnglishTranslated

1 scripts/Checked

Document Processingopenai/skills

pdf

Use when tasks involve reading, creating, or reviewing PDF files where rendering and layout matter; prefer visual checks by rendering pages (Poppler) and use Python tools such as `reportlab`, `pdfplumber`, and `pypdf` for generation and extraction.

🇺🇸|EnglishTranslated

Document Processingwshuyi/translate-pdf-skil...

translate-pdf

Translate PDF documents to any language while preserving original structure, layout, and styling (colors, backgrounds, positions). Use when user wants to: (1) translate a PDF to another language, (2) convert PDF from one language to another, (3) create translated version of PDF document. Triggers: "translate PDF", "PDF翻译", "把PDF翻译成", "translate this PDF to Chinese/English/Japanese", "翻译成中文/英文"

🇺🇸|EnglishTranslated

2 scripts/Checked

Document Processingmiantiao-me/bm.md

bm-md

Use bm.md service for Markdown typesetting, rendering and format conversion, supporting multiple platforms such as WeChat Official Account, Zhihu, Juejin, etc.

🇨🇳|ChineseTranslated

Document Processingdkyazzentwatwa/chatgpt-sk...

document-converter-suite

Convert between 8 formats (PDF, DOCX, PPTX, XLSX, TXT, CSV, MD, HTML). Best-effort text extraction, batch processing, and document format transformation.

🇺🇸|EnglishTranslated

23 scripts/Checked

Document Processingclaude-office-skills/skil...

pdf-extraction

Extract text, tables, and metadata from PDFs using pdfplumber

🇺🇸|EnglishTranslated

AI & Machine Learningclaude-office-skills/skil...

smart-ocr

🇺🇸|EnglishTranslated

AI & Machine Learningaktsmm/agent-skills

ocr-super-surya

GPU-optimized OCR using Surya. Use when: (1) Extracting text from images/screenshots, (2) Processing PDFs with embedded images, (3) Multi-language document OCR, (4) Layout analysis and table detection. Supports 90+ languages with 2x accuracy over Tesseract.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningjohnlindquist/claude

gemini-image

Analyze images using Gemini's vision capabilities. Use for image analysis, text extraction from screenshots, and visual content understanding.

🇺🇸|EnglishTranslated

Document Processingdevskale/skale-skills

markdown-converter

Convert documents to Markdown using markitdown. Use when you need to extract text and convert PDF, Word, PowerPoint, Excel, HTML, CSV, JSON, XML, images (with EXIF/OCR), audio, ZIP archives, YouTube URLs, or EPUBs to Markdown format for LLM processing or text analysis.

🇺🇸|EnglishTranslated

2 scripts/Checked

Document Processingchildbamboo/claude-code-m...

docx-reader

Reads Microsoft Word (.docx) files and extracts text content. Use when needing to read .docx documents. Requires python-docx package.

🇺🇸|EnglishTranslated

1 scripts/Checked