Search Results: ocr

Found 83 Skills

Document Processingaidenwu0209/paddleocr-ski...

paddleocr-doc-parsing

Advanced document parsing with PaddleOCR. Returns complete document structure including text, tables, formulas, charts, and layout information. Claude extracts relevant content based on user needs.

🇺🇸|EnglishTranslated

5 scripts/Checked

AI & Machine Learningcharleswiltgen/axiom

axiom-vision-ref

Vision framework API, VNDetectHumanHandPoseRequest, VNDetectHumanBodyPoseRequest, person segmentation, face detection, VNImageRequestHandler, recognized points, joint landmarks, VNRecognizeTextRequest, VNDetectBarcodesRequest, DataScannerViewController, VNDocumentCameraViewController, RecognizeDocumentsRequest

🇺🇸|EnglishTranslated

Document Processingaffaan-m/everything-claud...

visa-doc-translate

Translate visa application documents (images) to English and create a bilingual PDF with original and translation

🇺🇸|EnglishTranslated

Document Processingclaude-office-skills/skil...

layout-analyzer

🇺🇸|EnglishTranslated

AI & Machine Learningclaude-office-skills/skil...

smart-ocr

🇺🇸|EnglishTranslated

Document Processingexistential-birds/beagle

docling

Docling document parser for PDF, DOCX, PPTX, HTML, images, and 15+ formats. Use when parsing documents, extracting text, converting to Markdown/HTML/JSON, chunking for RAG pipelines, or batch processing files. Triggers on DocumentConverter, convert, convert_all, export_to_markdown, HierarchicalChunker, HybridChunker, ConversionResult.

🇺🇸|EnglishTranslated

Data Processingmindmorass/reflex

pdf-harvester

Extract text and data from PDF documents

🇺🇸|EnglishTranslated

Document Processingkrishagel/geoffrey

pdf-to-markdown

Convert PDF to clean Markdown with image content described as text. Use when user wants to convert a PDF to markdown, extract content from PDF, or prepare PDF content for AI tools.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningminimax-ai/skills

vision-analysis

Analyze, describe, and extract information from images using the MiniMax vision MCP tool. Use when: user shares an image file path or URL (any message containing .jpg, .jpeg, .png, .gif, .webp, .bmp, or .svg file extension) or uses any of these words/phrases near an image: "analyze", "analyse", "describe", "explain", "understand", "look at", "review", "extract text", "OCR", "what is in", "what's in", "read this image", "see this image", "tell me about", "explain this", "interpret this", in connection with an image, screenshot, diagram, chart, mockup, wireframe, or photo. Also triggers for: UI mockup review, wireframe analysis, design critique, data extraction from charts, object detection, person/animal/activity identification. Triggers: any message with an image file extension (jpg, jpeg, png, gif, webp, bmp, svg), or any request to analyze/describ/understand/review/extract text from an image, screenshot, diagram, chart, photo, mockup, or wireframe.

🇺🇸|EnglishTranslated

Document Processingsickn33/antigravity-aweso...

pdf-official

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmaticall...

🇺🇸|EnglishTranslated

8 scripts/Checked

AI & Machine Learningmembranedev/application-s...

azure-ai-vision

Azure AI Vision integration. Manage data, records, and automate workflows. Use when the user wants to interact with Azure AI Vision data.

🇺🇸|EnglishTranslated

Tools & Utilitiesfamaoai-creator/gemini-sk...

doc-to-text

Extract text content from various file formats. Supports PDF, Excel, Word, Images (OCR), Email, and ZIP Archives. Use for summarizing or analyzing binary files.

🇺🇸|EnglishTranslated

1 scripts/Checked