Total 50,320 skills, Document Processing has 738 skills
Showing 12 of 738 skills
Extract structured data from multiple documents into comparison matrix with citations. Use for bulk document review.
Manipulate PDF files — extract text, count pages, render thumbnails, merge or split documents. Use for PDF-specific operations that don't fit `markdown-converter` (general read) or `pandic-office` (write from markdown).
Rewrite specified documents to remove AI-generated traces. Automatically select the most suitable humanization strategy (humanizer-zh / humanize-chinese / technical-writing), and iterate rewriting until the result meets the standard or reaches a maximum of 42 iterations. Suitable for de-AI processing of Chinese texts, including general articles, technical documents, academic papers, etc. Use when user says: "humanize this", "去AI味", "降AIGC", "人性化改写", "改成人话", "去除AI痕迹", "humanize document", "make text human-like", "去机器味", "降低AI率", "过AIGC检测"
Use when converting PDFs, DOCX, HTML, scanned papers, reports, proposals, tables, or figures into Markdown, text, extracted assets, or quality reports for an academic research repository.
Use when a Head of Ops, Knowledge Manager, or TPM-Internal needs to author, validate, or clean up company SOPs and internal runbooks (procurement intake, vendor offboarding, incident-comms cascade, employee onboarding, expense reimbursement, system-access provisioning, customer-escalation playbook) — including 5W2H completeness checks (Who-What-When-Where-Why-How-HowMuch), cross-link and orphan-page validation across a sprawling Notion/Confluence/Obsidian wiki, KB ingestion + hygiene reporting, ops onboarding doc generation, and runbook step verification (named owner, expected duration, observable success signal, rollback path, escalation contact). Pairs Kaoru Ishikawa's 5W2H method, Atul Gawande's *The Checklist Manifesto*, ISO 9001, ITIL v4 Service Operation, FDA 21 CFR Part 211, and Google SRE Workbook runbook discipline with deterministic stdlib-only Python tools that score completeness, detect anti-patterns, and emit prioritized cleanup lists. Distinct from `engineering/llm-wiki` (Karpathy-style personal PKM second brain), `engineering-team/runbook-generator` (system-ops production debugging runbook), `project-management/*` (Jira/Confluence delivery + ticket tracking), and sibling `business-operations/process-mapper` (BPMN process *design*, while knowledge-ops is process *documentation*).
Translates a contract review into a summary the business stakeholder will actually read. Not a legal memo — a two-minute answer to "can I sign this and what do I need to know." Use when user says "summarize for the business", "write this up for [stakeholder]", "explain this to procurement", "non-legal summary", or when a review is done and needs to go to someone outside legal.
Converts Notion databases and pages to structured Markdown format using Notion MCP. Use when working with Notion data visualization, documentation, or exporting Notion content.
Extract text from PDFs/scans (pymupdf, marker-pdf).
7 ocr & translation skills. Trigger: scanning documents, recognizing formulas, translating academic papers. Design: specialized OCR (LaTeX, handwriting) and translation for scholarly content.
Polish and rewrite the prose of academic LaTeX (.tex) files to sound professional and domain-appropriate, while leaving commands and structure intact.
Review contracts and legal agreements (PDF, Word, images) for risks, unfair clauses, missing provisions, and key obligations using SoMark for accurate document parsing. Provides structured risk analysis with severity ratings. Requires SoMark API Key (SOMARK_API_KEY).
Process and transform markdown documents. Convert to HTML, extract headings, generate TOC, check links. Use when working with documentation or markdown content.