Search Results: text-extraction

Found 54 Skills

financial-document-processor

Guidance for processing financial documents (invoices, receipts, statements) with OCR and text extraction. This skill should be used when tasks involve extracting data from financial PDFs or images, generating summaries (CSV/JSON), or moving/organizing processed documents. Emphasizes data safety practices to prevent catastrophic data loss.

🇺🇸|EnglishTranslated

Tools & Utilitiesxyuanbuilds/my_skills

ocr

Extract text from images using OCR. Use when the user needs to read text from screenshots, photos, or image files.

🇺🇸|EnglishTranslated

5 scripts/Attention

Document Processingshareai-lab/learn-claude-...

pdf

Process PDF files - extract text, create PDFs, merge documents. Use when user asks to read PDF, create PDF, or work with PDF files.

🇺🇸|EnglishTranslated

AI & Machine Learningvamseeachanta/workspace-h...

document-rag-pipeline

Build complete document knowledge bases with PDF text extraction, OCR for scanned documents, vector embeddings, and semantic search. Use this for creating searchable document libraries from folders of PDFs, technical standards, or any document collection.

🇺🇸|EnglishTranslated

Document Processingguglxni/hyperbots-agent-s...

hyperbots-api

Integrate with HyperAPI for financial document processing - OCR text extraction, document classification, PDF splitting, and structured data extraction from invoices, receipts, and financial documents. Use when the user needs to parse PDFs, extract text from documents, classify document types, split multi-document PDFs, or extract structured entities like invoice numbers, vendor names, line items. Keywords: hyperapi, hyperbots, document parsing, OCR, PDF processing, invoice extraction, receipt processing, document classification, VLM, vision language model.

🇺🇸|EnglishTranslated

1 scripts/Checked

Data Processingaffaan-m/everything-claud...

regex-vs-llm-structured-text

Decision framework for choosing between regex and LLM when parsing structured text — start with regex, add LLM only for low-confidence edge cases.

🇺🇸|EnglishTranslated

Tools & Utilitiespascalorg/skills

image-to-text

Extract text from images using OCR. Use when the user shares a screenshot and you need to read the text content, copy UI labels, or extract copy from a design mockup.

🇺🇸|EnglishTranslated

2 scripts/Checked

AI & Machine Learningletta-ai/skills

extracting-pdf-text

Extract text from PDFs for LLM consumption. Use when processing PDFs for RAG, document analysis, or text extraction. Supports API services (Mistral OCR) and local tools (PyMuPDF, pdfplumber). Handles text-based PDFs, tables, and scanned documents with OCR.

🇺🇸|EnglishTranslated

4 scripts/Checked

Document Processingleastbit/claude_skills_zh...

pdf

全面的 PDF 操作工具包，用于提取文本和表格、创建新 PDF、合并/拆分文档以及处理表单。当 Claude 需要填写 PDF 表单或以编程方式大规模处理、生成或分析 PDF 文档时使用。

🇺🇸|EnglishTranslated

8 scripts/Checked

AI & Machine Learningparlamento-ai/parlamento-...

mistral-ocr

Extract text from images and PDFs using Mistral OCR API. Convert scanned documents to Markdown, JSON, or plain text. No external dependencies required. Use when you need OCR, extract text from images, convert PDFs to markdown, or digitize documents.

🇺🇸|EnglishTranslated

Tools & Utilitieszhayujie/chatgpt-on-wecha...

web-fetch

Fetch and extract readable content from web pages. Use for lightweight page access without browser automation.

🇺🇸|EnglishTranslated

1 scripts/Attention

Document Processingaig787/agpm

pdf-processor

Process PDF files for text extraction, form filling, and document analysis. Use when you need to extract content from PDFs, fill forms, or analyze document structure.

🇺🇸|EnglishTranslated

1 scripts/Checked