feat: add RAG (Retrieval-Augmented Generation) module#2
feat: add RAG (Retrieval-Augmented Generation) module#2
Conversation
- Add complete RAG module with vector and keyword search - Implement document indexing with configurable chunking strategies (paragraph, sentence, fixed, semantic) - Add embedding providers: local (transformers.js), OpenAI, Voyage, Cohere, custom - Implement hybrid search combining vector (70%) and keyword (30%) - Add SQLite storage with FTS5 for full-text search - Add 8 MCP tools: rag_index_document, rag_index_project, rag_search, rag_query_context, rag_list_documents, rag_delete_document, rag_get_stats, rag_configure - Add HTTP endpoints for all RAG operations - Add comprehensive test suite (chunking, embeddings, RAG) - Add RAG benchmarks for performance testing - Update README and API documentation
There was a problem hiding this comment.
Welcome to CortexFlow! Thanks for your first pull request!
A maintainer will review your changes shortly. Here's what to expect:
Review Process:
- Automated checks will run (tests, linting, security)
- A maintainer will review your code
- You may receive feedback or requests for changes
- Once approved, your PR will be merged
While you wait:
- Ensure all CI checks pass
- Respond to any review comments
- Keep your branch up to date with
master
Thank you for contributing to CortexFlow!
Automatically formatted by GitHub Actions: - Prettier formatting applied - ESLint auto-fixes applied
|
✨ Auto-formatted! I've automatically fixed formatting and lint issues in this PR. Please pull the latest changes before pushing new commits. |
🚀 Benchmark ResultsCortexFlow Benchmark ResultsLast Run: 2025-12-28T17:01:18.330Z Summary
Performance ResultsStorage
Intelligent
Token Efficiency
Context Handoff
RAG Chunking
RAG Token Estimation
RAG Chunk Processing
Token Efficiency
Memory Usage
|
🚀 Benchmark ResultsCortexFlow Benchmark ResultsLast Run: 2025-12-28T17:01:42.863Z Summary
Performance ResultsStorage
Intelligent
Token Efficiency
Context Handoff
RAG Chunking
RAG Token Estimation
RAG Chunk Processing
Token Efficiency
Memory Usage
|
🚀 Benchmark ResultsCortexFlow Benchmark ResultsLast Run: 2025-12-28T17:06:01.885Z Summary
Performance ResultsStorage
Intelligent
Token Efficiency
Context Handoff
RAG Chunking
RAG Token Estimation
RAG Chunk Processing
Token Efficiency
Memory Usage
|
|
This pull request has been automatically marked as stale because it has not had recent activity. If you're still working on this:
Thank you for your contributions! |
|
This PR was closed due to inactivity. Feel free to reopen when you're ready to continue! |
Description
Type of Change
Related Issues
Fixes #
Changes Made
Testing
Test Configuration
Tests Performed
npm test)npm run build)npm run lint)Test Cases
Screenshots / Logs
Checklist
Breaking Changes
Additional Notes