Extract valuable legal question-answer pairs (client question + lawyer's answer) from lawyer-client communication records and generate structured knowledge base content. It is suitable for converting client consultations into reusable legal knowledge, which can be further organized into articles, cases, or learning materials.
Core Value: Extract Q&A content that can be reused by other clients, rather than the specific case details of a single client.
核心原则
Core Principles
客户信息保护
Client Information Protection
绝对禁止在问答内容中包含:
❌ 客户真实姓名
❌ 企业/机构名称
❌ 具体地址、联系方式
❌ 任何可识别个人或企业的信息
脱敏处理标准:
✅ 客户姓名 → 使用"当事人"、"客户"、"甲/乙"等代称
✅ 企业名称 → 使用"某公司"、"某企业"、"该企业"等代称
✅ 具体信息 → 模糊化为"某地"、"某时间"、"相关人员"等
✅ 客户原话 → 脱敏后引用:"客户询问..."而非"张总说..."
Strictly Prohibited from including the following in Q&A content:
❌ Client's real name
❌ Enterprise/institution name
❌ Specific address, contact information
❌ Any information that can identify an individual or enterprise
Desensitization Standards:
✅ Client's name → Use generics such as "the party", "client", "Party A/Party B"
✅ Enterprise name → Use generics such as "a certain company", "a certain enterprise", "the enterprise"
✅ Specific information → Blur into "a certain place", "a certain time", "relevant personnel", etc.
✅ Client's original words → Cite after desensitization: "The client asked..." instead of "Mr. Zhang said..."
问答对的价值判断
Value Judgment of Q&A Pairs
提取的问答应该满足以下条件之一:
普遍性:其他客户也可能遇到同类问题
典型性:代表了某类法律场景的常见关切
教育性:律师的解答有普法价值或指导意义
复用性:可以独立成文或作为知识库条目
The extracted Q&A pairs should meet at least one of the following conditions:
Universality: Other clients may encounter similar problems
Typicality: Represents common concerns in a certain type of legal scenario
Educational Value: The lawyer's answer has popularization value or guiding significance
Reusability: Can be independently written into an article or used as a knowledge base entry
工作流程
Workflow
步骤 1:分析输入内容
Step 1: Analyze Input Content
获取沟通记录文档和用户的具体要求,理解:
文档中的沟通内容和情境
用户希望重点关注哪些方面
哪些问答具有复用价值
Obtain the communication record document and the user's specific requirements, and understand:
The communication content and context in the document
Which aspects the user wants to focus on
Which Q&A pairs have reuse value
步骤 2:识别有价值的问答对
Step 2: Identify Valuable Q&A Pairs
完整梳理文档中的问答内容,按以下维度识别价值:
Completely sort out the Q&A content in the document, and identify its value based on the following dimensions:
问答识别维度
Q&A Identification Dimensions
直接问答 - 客户明确提出问题,律师给出解答
追问回应 - 客户追问,律师进一步解释
风险提醒 - 律师主动提醒的风险点及应对建议
操作指引 - 涉及流程、步骤、注意事项的问答
普遍关切 - 其他客户也可能遇到的同类问题
Direct Q&A - The client clearly raises a question, and the lawyer provides an answer
Follow-up Response - The client follows up with a question, and the lawyer further explains
Risk Reminder - Risks proactively reminded by the lawyer and corresponding suggestions
Operation Guidelines - Q&A involving processes, steps, and precautions
Common Concerns - Similar problems that other clients may encounter
问答提取方法
Q&A Extraction Methods
语义分析:理解客户真实问题和律师核心观点
逻辑归纳:将分散的对话整理为独立问答对
价值评估:判断问答是否具有普遍适用性
标准化表述:将口语化表达转化为规范的问答格式
Semantic Analysis: Understand the client's real question and the lawyer's core viewpoints
Logical Induction: Organize scattered conversations into independent Q&A pairs
Value Evaluation: Judge whether the Q&A has universal applicability
Standardized Expression: Convert colloquial expressions into standardized Q&A formats
步骤 3:生成结构化报告
Step 3: Generate Structured Report
输出文件保存至原文档的同一目录,文件命名:
{原文档名}_法律问答提取_{YYYYMMDD}.md
输出格式详见 output-template.md。
每个问答对包含以下部分:
问题:客户提出的问题(标准化表述)
律师解答:律师的核心观点和建议(300-500字)
问题背景(可选):该问题出现的情境(200-300字)
价值标签:普遍性/典型性/教育性/复用性
适用场景:什么情况下读者会关心这个问答
Save the output file to the same directory as the original document, with the file name: