RAG (Retrieval-Augmented Generation) implementation workflow covering embedding selection, vector database setup, chunking strategies, and retrieval optimization.
✓Works with OpenClaudeOverview
Specialized workflow for implementing RAG (Retrieval-Augmented Generation) systems including embedding model selection, vector database setup, chunking strategies, retrieval optimization, and evaluation.
When to Use This Workflow
Use this workflow when:
- Building RAG-powered applications
- Implementing semantic search
- Creating knowledge-grounded AI
- Setting up document Q&A systems
- Optimizing retrieval quality
Workflow Phases
Phase 1: Requirements Analysis
Skills to Invoke
ai-product- AI product designrag-engineer- RAG engineering
Actions
- Define use case
- Identify data sources
- Set accuracy requirements
- Determine latency targets
- Plan evaluation metrics
Copy-Paste Prompts
Use @ai-product to define RAG application requirements
Phase 2: Embedding Selection
Skills to Invoke
embedding-strategies- Embedding selectionrag-engineer- RAG patterns
Actions
- Evaluate embedding models
- Test domain relevance
- Measure embedding quality
- Consider cost/latency
- Select model
Copy-Paste Prompts
Use @embedding-strategies to select optimal embedding model
Phase 3: Vector Database Setup
Skills to Invoke
vector-database-engineer- Vector DBsimilarity-search-patterns- Similarity search
Actions
- Choose vector database
- Design schema
- Configure indexes
- Set up connection
- Test queries
Copy-Paste Prompts
Use @vector-database-engineer to set up vector database
Phase 4: Chunking Strategy
Skills to Invoke
rag-engineer- Chunking strategiesrag-implementation- RAG implementation
Actions
- Choose chunk size
- Implement chunking
- Add overlap handling
- Create metadata
- Test retrieval quality
Copy-Paste Prompts
Use @rag-engineer to implement chunking strategy
Phase 5: Retrieval Implementation
Skills to Invoke
similarity-search-patterns- Similarity searchhybrid-search-implementation- Hybrid search
Actions
- Implement vector search
- Add keyword search
- Configure hybrid search
- Set up reranking
- Optimize latency
Copy-Paste Prompts
Use @similarity-search-patterns to implement retrieval
Use @hybrid-search-implementation to add hybrid search
Phase 6: LLM Integration
Skills to Invoke
llm-application-dev-ai-assistant- LLM integrationllm-application-dev-prompt-optimize- Prompt optimization
Actions
- Select LLM provider
- Design prompt template
- Implement context injection
- Add citation handling
- Test generation quality
Copy-Paste Prompts
Use @llm-application-dev-ai-assistant to integrate LLM
Phase 7: Caching
Skills to Invoke
prompt-caching- Prompt cachingrag-engineer- RAG optimization
Actions
- Implement response caching
- Set up embedding cache
- Configure TTL
- Add cache invalidation
- Monitor hit rates
Copy-Paste Prompts
Use @prompt-caching to implement RAG caching
Phase 8: Evaluation
Skills to Invoke
llm-evaluation- LLM evaluationevaluation- AI evaluation
Actions
- Define evaluation metrics
- Create test dataset
- Measure retrieval accuracy
- Evaluate generation quality
- Iterate on improvements
Copy-Paste Prompts
Use @llm-evaluation to evaluate RAG system
RAG Architecture
User Query -> Embedding -> Vector Search -> Retrieved Docs -> LLM -> Response
| | | |
Model Vector DB Chunk Store Prompt + Context
Quality Gates
- Embedding model selected
- Vector DB configured
- Chunking implemented
- Retrieval working
- LLM integrated
- Evaluation passing
Related Workflow Bundles
ai-ml- AI/ML developmentai-agent-development- AI agentsdatabase- Vector databases
Related granular-workflow-bundle Skills
Other Claude Code skills in the same category — free to download.
ai-agent-development
AI agent development workflow for building autonomous agents, multi-agent systems, and agent orchestration with CrewAI, LangGraph, and custom agents.
api-security-testing
API security testing workflow for REST and GraphQL APIs covering authentication, authorization, rate limiting, input validation, and security best practices.
bash-scripting
Bash scripting workflow for creating production-ready shell scripts with defensive patterns, error handling, and testing.
e2e-testing
End-to-end testing workflow with Playwright for browser automation, visual regression, cross-browser testing, and CI/CD integration.
kubernetes-deployment
Kubernetes deployment workflow for container orchestration, Helm charts, service mesh, and production-ready K8s configurations.
linux-troubleshooting
Linux system troubleshooting workflow for diagnosing and resolving system issues, performance problems, and service failures.
postgresql-optimization
PostgreSQL database optimization workflow for query tuning, indexing strategies, performance analysis, and production database management.
python-fastapi-development
Python FastAPI backend development with async patterns, SQLAlchemy, Pydantic, authentication, and production API patterns.
Want a granular-workflow-bundle skill personalized to YOUR project?
This is a generic skill that works for everyone. Our AI can generate one tailored to your exact tech stack, naming conventions, folder structure, and coding patterns — with 3x more detail.