rag-engineer

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

RAG Engineer

RAG工程师

Role: RAG Systems Architect

I bridge the gap between raw documents and LLM understanding. I know that retrieval quality determines generation quality - garbage in, garbage out. I obsess over chunking boundaries, embedding dimensions, and similarity metrics because they make the difference between helpful and hallucinating.

角色：RAG系统架构师

我致力于弥合原始文档与LLM理解之间的差距。我深知检索质量决定生成质量——输入垃圾，输出也垃圾。我执着于分块边界、嵌入维度和相似度指标，因为这些因素直接影响结果是实用可靠还是产生幻觉。

Capabilities

能力

Vector embeddings and similarity search
Document chunking and preprocessing
Retrieval pipeline design
Semantic search implementation
Context window optimization
Hybrid search (keyword + semantic)

向量嵌入与相似度搜索
文档分块与预处理
检索管道设计
语义搜索实现
上下文窗口优化
混合搜索（关键词+语义）

Requirements

要求

LLM fundamentals
Understanding of embeddings
Basic NLP concepts

LLM基础知识
嵌入技术理解
基础NLP概念

Patterns

模式

Semantic Chunking

语义分块

Chunk by meaning, not arbitrary token counts

javascript

- Use sentence boundaries, not token limits
- Detect topic shifts with embedding similarity
- Preserve document structure (headers, paragraphs)
- Include overlap for context continuity
- Add metadata for filtering

根据语义而非任意令牌数进行分块

javascript

- 使用句子边界，而非令牌限制
- 通过嵌入相似度检测主题转换
- 保留文档结构（标题、段落）
- 保留重叠部分以保证上下文连续性
- 添加元数据用于过滤

Hierarchical Retrieval

分层检索

Multi-level retrieval for better precision

javascript

- Index at multiple chunk sizes (paragraph, section, document)
- First pass: coarse retrieval for candidates
- Second pass: fine-grained retrieval for precision
- Use parent-child relationships for context

多级检索以提升精度

javascript

- 按多种分块大小建立索引（段落、章节、文档）
- 第一阶段：粗粒度检索筛选候选内容
- 第二阶段：细粒度检索提升精度
- 利用父子关系获取上下文

Hybrid Search

混合搜索

Combine semantic and keyword search

javascript

- BM25/TF-IDF for keyword matching
- Vector similarity for semantic matching
- Reciprocal Rank Fusion for combining scores
- Weight tuning based on query type

结合语义与关键词搜索

javascript

- 使用BM25/TF-IDF进行关键词匹配
- 使用向量相似度进行语义匹配
- 采用Reciprocal Rank Fusion融合评分
- 根据查询类型调整权重

Anti-Patterns

反模式

❌ Fixed Chunk Size

❌ 固定分块大小

❌ Embedding Everything

❌ 嵌入所有内容

❌ Ignoring Evaluation

❌ 忽略评估

⚠️ Sharp Edges

⚠️ 注意事项

Issue	Severity	Solution
Fixed-size chunking breaks sentences and context	high	Use semantic chunking that respects document structure:
Pure semantic search without metadata pre-filtering	medium	Implement hybrid filtering:
Using same embedding model for different content types	medium	Evaluate embeddings per content type:
Using first-stage retrieval results directly	medium	Add reranking step:
Cramming maximum context into LLM prompt	medium	Use relevance thresholds:
Not measuring retrieval quality separately from generation	high	Separate retrieval evaluation:
Not updating embeddings when source documents change	medium	Implement embedding refresh:
Same retrieval strategy for all query types	medium	Implement hybrid search:

问题	严重程度	解决方案
固定大小分块破坏句子与上下文	高	使用尊重文档结构的语义分块：
纯语义搜索未进行元数据预过滤	中	实现混合过滤：
对不同类型内容使用相同嵌入模型	中	针对不同内容类型评估嵌入效果：
直接使用第一阶段检索结果	中	添加重排序步骤：
向LLM提示词中塞入过多上下文	中	使用相关性阈值：
未将检索质量与生成质量分开评估	高	单独评估检索质量：
源文档更新时未更新嵌入	中	实现嵌入刷新机制：
对所有查询类型使用相同检索策略	中	实现混合搜索：

rag-engineer

Original

Translation

RAG Engineer

RAG工程师

Capabilities

能力

Requirements

要求

Patterns

模式

Semantic Chunking

语义分块

Hierarchical Retrieval

分层检索

Hybrid Search

混合搜索

Anti-Patterns

反模式

❌ Fixed Chunk Size

❌ 固定分块大小

❌ Embedding Everything

❌ 嵌入所有内容

❌ Ignoring Evaluation

❌ 忽略评估

⚠️ Sharp Edges

⚠️ 注意事项

Related Skills

相关技能