Loading...
Loading...
Found 27 Skills
Extract text from images using OCR. Use when the user shares a screenshot and you need to read the text content, copy UI labels, or extract copy from a design mockup.
Convert DOC, DOCX, and PDF files to TXT format. Invoke when user wants to extract text from these document types.
OCR Web Service integration. Manage Documents. Use when the user wants to interact with OCR Web Service data.
Extract text and metadata from PDF files using pdf-parse. Use when: user uploads a PDF or asks to read/analyze PDF content. NOT for: creating PDFs, editing PDFs, or OCR on scanned documents.
High-precision Optical Character Recognition (OCR) service. Supports text detection and extraction for multi-language, multi-format images, and provides text area coordinates and confidence scores, suitable for document digitization and image content analysis.
Fetch and extract readable content from web pages. Use for lightweight page access without browser automation.
Expert in extracting text from images using Tesseract, EasyOCR, PaddleOCR, Google Vision, AWS Textract, Claude Vision. Trigger: When extracting text from images, screenshots, scanned documents, or PDFs.
PDF text extraction, form filling, and merging using pypdf and pdfplumber.
Creation, editing, and analysis of Word documents, supporting track changes, comments, format retention, and text extraction. Use this when you need to create .docx files, modify content, handle track changes/comments, or perform other document tasks.
Manipulate PDF documents programmatically. Merge, split, rotate, and watermark PDFs. Extract text and metadata. Handle form filling and encryption/decryption.
Process PDF files for text extraction, form filling, and document analysis. Use when you need to extract content from PDFs, fill forms, or analyze document structure.
Extract text from images using OCR. Use when the user needs to read text from screenshots, photos, or image files.