Total 50,313 skills, Document Processing has 738 skills
Showing 12 of 738 skills
PDF/PPTX/DOCX/XLSX/HTML/CSV/JSON/XML/ZIP/EPUB/이미지 같은 문서파일을 md로 변환할 때 사용한다. "문서 md 변환", "pdf를 markdown", "pptx를 markdown", "docx를 markdown", "문서 추출", "백업솔루션 md 변환" 요청에서 사용한다. markitdown 설치 확인, HWPX 추출 환경 설치, 단일 파일 또는 폴더 변환, 오류 로그 생성을 함께 처리한다.
Applies Generic Service (GS) branding to Markdown documents and generates branded PDFs. This is the PRIMARY skill for generating PDFs from Markdown, creating proposals, or applying project-specific templates. Do not use the generic 'pdf' skill for generating documents from Markdown or React templates.
Create, modify, and manage Word documents.
Extract text, create PDFs, and handle forms. Useful for press releases, branded one-pagers, and printable design deliverables.
Generate professional Hebrew documents including PDF, DOCX, and PPTX with full RTL support and proper Hebrew typography. Use when user asks to create Hebrew PDF, generate Israeli business documents, "lehafik heshbonit", "litstor hozeh", build Hebrew Word document, create Hebrew PowerPoint, or produce Israeli templates such as Heshbonit Mas (tax invoice), Hozeh (contract), Hatza'at Mechir (proposal), or Protokol (meeting minutes). Covers reportlab, WeasyPrint, python-docx, and pptxgenjs with bidi paragraph support. Do NOT use for OCR or reading existing documents (use hebrew-ocr-forms instead).
Extract Feishu (Lark) Docs, Wiki pages, Wiki collections/hubs, spreadsheets, and Minutes (妙记) transcripts into clean high-fidelity local Markdown. The primary path is the lark-cli API — programmatic extraction with no LLM rewriting of the body — which recursively follows a collection's reference graph (mention-doc / sheet / cross-tenant links) and uses error codes to resolve permission boundaries precisely; a browser-DOM path is the fallback only when lark-cli cannot reach the content. Use this whenever the source is a Feishu/Lark URL and fidelity matters — including 导出飞书文档/合集/妙记转写, 把飞书 wiki/知识库转 markdown, scraping or archiving a Feishu collection, exporting a Feishu Minutes/妙记 transcript, or saving a Feishu page locally — even if the user only says clipping, archiving, converting, or "save this". Also covers the permission-denied path (owner-exported .docx → faithful Markdown with heading/highlight restoration).
Verify citations and references in scientific documents to detect hallucinated or invalid sources. Extracts DOIs, URLs, arXiv IDs, PubMed IDs, and ISBNs from Markdown, LaTeX, org-mode, and plain text, then validates them using API lookups and web fetches. Use this skill when: - Reviewing AI-generated content for citation accuracy - Validating references in papers, reports, or documentation - Checking if DOIs/URLs resolve to actual papers - Auditing a document for broken or fake citations
Generate SKILL.md and TOC.md for a documentation package. Use standalone to regenerate artifacts without re-downloading.
Convert each page of a PDF file into image files; supports custom image formats (PNG/JPG) and resolution; suitable for scenarios such as document processing and image-based archiving
Converts Jira issues and backlog data to structured Markdown format using Atlassian MCP. Use when working with Jira data visualization, documentation, or reporting.
Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.
QUERY LENGTH LIMIT EXCEEDED. MAX ALLOWED QUERY : 500 CHARS