pdf-processing

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

PDF Processing Skill

PDF处理技能

Overview

概述

This skill enables agents to process PDF documents, extract text content, and analyze the information within them.
此技能使Agent能够处理PDF文档、提取文本内容并分析其中的信息。

When to Use

适用场景

Activate this skill when:
  • User asks to read or analyze a PDF file
  • User needs to extract text from a PDF
  • User wants to summarize a PDF document
  • User needs to search for information within a PDF
当出现以下情况时激活此技能:
  • 用户要求读取或分析PDF文件
  • 用户需要从PDF中提取文本
  • 用户想要总结PDF文档内容
  • 用户需要在PDF中搜索信息

Instructions

操作步骤

  1. First, verify the PDF file exists at the specified path
  2. Use appropriate tools to read the PDF content
  3. Extract text while preserving structure where possible
  4. For large PDFs, process in chunks to manage context
  5. Summarize or analyze based on user's specific request
  1. 首先,验证指定路径下的PDF文件是否存在
  2. 使用合适的工具读取PDF内容
  3. 在尽可能保留结构的前提下提取文本
  4. 对于大型PDF,分块处理以管理上下文
  5. 根据用户的具体请求进行总结或分析

Best Practices

最佳实践

  • Always confirm the file path before processing
  • Handle encrypted PDFs gracefully with appropriate error messages
  • For scanned PDFs, note that OCR may be required
  • Preserve important formatting like tables and lists
  • 处理前务必确认文件路径
  • 妥善处理加密PDF并返回合适的错误提示
  • 对于扫描版PDF,需注意可能需要使用OCR技术
  • 保留表格和列表等重要格式