Loading...
Loading...
Use when building features that answer questions from private data, documents, policies, or time-sensitive information — RAG architecture, chunking strategies, hybrid search, re-ranking, vector databases, evaluation, agentic RAG, multimodal RAG...
npx skill4agent add peterbamuhigire/skills-web-dev ai-rag-patternsai-rag-patternsreferencesSKILL.md| Category | Artifact | Format | Example |
|---|---|---|---|
| Correctness | RAG retrieval evaluation report | Markdown doc covering recall / precision / answer-quality on a fixed eval set | |
| Data safety | Index ingestion + tenancy isolation note | Markdown doc covering chunking, source filtering, and per-tenant index segregation | |
references/| Condition | Action |
|---|---|
| Knowledge base < 200K tokens (~500 pages) | Include everything in context — no RAG needed |
| Knowledge base > 200K tokens | Use RAG |
| Data changes frequently (menus, prices, stock) | RAG (update documents, not model) |
| Data is private/confidential | RAG (keeps data out of training pipelines) |
| Need source citations | RAG (chunks are traceable to source) |
| Model needs brand voice / domain jargon | Fine-tune instead |
| Factor | RAG | Fine-Tuning |
|---|---|---|
| Up-to-date content | ✅ Yes (add docs anytime) | ❌ Stale until retrained |
| Hallucinations | ✅ Lower (document-grounded) | ❌ Higher |
| Source citations | ✅ Yes | ❌ No |
| Brand voice control | ❌ Weak | ✅ Strong |
| Domain jargon | ❌ Weak | ✅ Strong |
| Up-front cost | ✅ Lower | ❌ High |
Pipeline ArchitectureChunking StrategiesEmbedding Model SelectionVector Database SelectionRetrieval AlgorithmsRe-RankingFull RAG Query AlgorithmQuery Rewriting (Multi-Turn)RAG Schema (Multi-Tenant)Evaluation FrameworkProduction PatternsAgentic RAGMultimodal RAGEdge CasesCost OptimisationSourcesRAG Maturity ModelQuery TransformationContextual CompressionSelf-RAGRAGAS EvaluationEmbedding PipelineCost Management Decision TreeFailure Mode PlaybookGates Before Shipping