Search Results: ocr

Found 84 Skills

Document Processingmicrock/ordinary-claude-s...

markitdown

Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when converting documents to markdown, extracting text from PDFs/Office files, transcribing audio, performing OCR on images, extracting YouTube transcripts, or processing batches of files. Supports 20+ formats including DOCX, XLSX, PPTX, PDF, HTML, EPUB, CSV, JSON, images with OCR, and audio with transcription.

🇺🇸|EnglishTranslated

AI & Machine Learningiankiku/forwward-teams

medic

Use when analyzing patient records, clinical notes, medical PDFs, FHIR data, or advising on how to present medical data in health-tech products — OCR interpretation, clinical summarization, differential diagnosis support, drug interaction flags

🇺🇸|EnglishTranslated

AI & Machine Learningvamseeachanta/workspace-h...

document-rag-pipeline

Build complete document knowledge bases with PDF text extraction, OCR for scanned documents, vector embeddings, and semantic search. Use this for creating searchable document libraries from folders of PDFs, technical standards, or any document collection.

🇺🇸|EnglishTranslated

Tools & Utilities0xdarkmatter/claude-mods

markitdown

Convert local documents to Markdown using Microsoft's markitdown CLI. Best for: PDF, Word, Excel, PowerPoint, images (OCR), audio. Can fetch URLs but Jina is faster for web. Triggers on: convert to markdown, read PDF, parse document, extract text from, docx, xlsx, pptx, OCR image, local file.

🇺🇸|EnglishTranslated

AI & Machine Learningsreeram5678/india-market-...

corporate_spy

Handles fetching, reading, and summarizing official Indian corporate filings (BSE/NSE). Specialized in OCR for scanned PDFs.

🇺🇸|EnglishTranslated

Document Processingkjanat/paperless-mcp

paperless-ngx

Manages documents in Paperless-ngx via MCP tools. Searches, uploads, tags, organizes, and bulk-edits documents, correspondents, and document types. Use when working with Paperless-ngx, document management, OCR, or any mcp_paperless_* tool task.

🇺🇸|EnglishTranslated

2 scripts/Attention

Tools & Utilitiesxyuanbuilds/my_skills

ocr

Extract text from images using OCR. Use when the user needs to read text from screenshots, photos, or image files.

🇺🇸|EnglishTranslated

5 scripts/Attention

Document Processingguglxni/hyperbots-agent-s...

hyperbots-api

Integrate with HyperAPI for financial document processing - OCR text extraction, document classification, PDF splitting, and structured data extraction from invoices, receipts, and financial documents. Use when the user needs to parse PDFs, extract text from documents, classify document types, split multi-document PDFs, or extract structured entities like invoice numbers, vendor names, line items. Keywords: hyperapi, hyperbots, document parsing, OCR, PDF processing, invoice extraction, receipt processing, document classification, VLM, vision language model.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningcinience/alicloud-skills

aliyun-qwen-ocr

Use when OCR-specialized extraction is needed with Alibaba Cloud Model Studio Qwen OCR models (`qwen-vl-ocr`, `qwen-vl-ocr-latest`, and snapshots), including document parsing, table parsing, multilingual OCR, formula recognition, and key information extraction.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningmugnimaestra/video-frames...

video-frames

Extract frames from video files using ffmpeg for AI/LLM analysis. Use when (1) the user asks to analyze, describe, or summarize a video file, (2) the user wants to extract frames or screenshots from a video, (3) the user provides a video file (.mp4, .mov, .avi, .mkv, .webm, etc.) and asks questions about its visual content, (4) the user wants to identify scenes, objects, or events in a video, (5) the user wants timestamps overlaid on extracted frames for temporal reference. Converts video into JPEG frames that can be attached to LLM prompts as images. Requires ffmpeg on PATH. Supports scene-change detection, model-aware optimization (Claude/OpenAI/Gemini), quality presets (efficient/balanced/detailed/ocr), grayscale and high-contrast OCR mode, and automatic FPS calculation via --max-frames.

🇺🇸|EnglishTranslated

1 scripts/Attention

Document Processingfindinfinitelabs/chuuk

document-ocr-processing

Process scanned documents and images containing Chuukese text using OCR with specialized post-processing for accent characters and traditional formatting. Use when working with scanned books, documents, or images that contain Chuukese text that needs to be digitized.

🇺🇸|EnglishTranslated

AI & Machine Learningmicrosoft/github-copilot-...

azure-ai

Use for Azure AI: Search, Speech, OpenAI, Document Intelligence. Helps with search, vector/hybrid search, speech-to-text, text-to-speech, transcription, OCR. USE FOR: AI Search, query search, vector search, hybrid search, semantic search, speech-to-text, text-to-speech, transcribe, OCR, convert text to speech. DO NOT USE FOR: Function apps/Functions (use azure-functions), databases (azure-postgres/azure-kusto), general Azure resources.

🇺🇸|EnglishTranslated

154.8k