deepakdgupta1
diff --git a/‎CHANGELOG.md‎
Lines changed: 24 additions & 1 deletion b/‎CHANGELOG.md‎
Lines changed: 24 additions & 1 deletion
diff --git a/‎KnowCode.md‎
Lines changed: 39 additions & 0 deletions b/‎KnowCode.md‎
Lines changed: 39 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 36 additions & 8 deletions b/‎README.md‎
Lines changed: 36 additions & 8 deletions
diff --git a/‎environment_variables.env‎
Lines changed: 1 addition & 0 deletions b/‎environment_variables.env‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎pyproject.toml‎
Lines changed: 4 additions & 0 deletions b/‎pyproject.toml‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎scripts/evaluate.py‎
Lines changed: 94 additions & 0 deletions b/‎scripts/evaluate.py‎
Lines changed: 94 additions & 0 deletions
diff --git a/‎src/knowcode/__init__.py‎
Lines changed: 3 additions & 0 deletions b/‎src/knowcode/__init__.py‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎src/knowcode/agent.py‎
Lines changed: 93 additions & 0 deletions b/‎src/knowcode/agent.py‎
Lines changed: 93 additions & 0 deletions
@@ -1,4 +1,27 @@
-## [Unreleased] - 2025-12-17
+## [2.1.0] - 2025-12-19
+
+**Focus:** Semantic Search & Retrieval Quality
+
+### 🚀 Features
+* **Semantic Search**: Implemented dense vector retrieval using FAISS and OpenAI embeddings.
+* **Hybrid Retrieval**: Combined BM25 sparse search with dense embeddings using Reciprocal Rank Fusion (RRF).
+* **Code Chunking**: Added intelligent code chunking for modules, imports, and entities.
+* **Watch Mode**: Integrated file system monitoring for real-time background re-indexing.
+* **Dependency Expansion**: Improved context quality by automatically including caller/callee dependencies in search results.
+* **New CLI Commands**: Added `knowcode index` and `knowcode semantic-search`.
+* **API Enhancement**: Added `/api/v1/context/query` endpoint for rich semantic queries.
+
+### 🐛 Fixes
+* Fixed `VectorStore` persistence bug where `id_map` was reset after loading.
+* Fixed `Chunker` instability issue where collected chunks were reset mid-parsing.
+* Resolved stubbed implementation in `completeness.py`.
+
+### 🏗️ Architectural Impact
+* Introduced a new retrieval pipeline: `Indexer` -> `ChunkRepository` -> `VectorStore` -> `HybridIndex` -> `SearchEngine`.
+* Added background processing and file monitoring for improved live updates.
+
+---
+
 
 **Focus:** Feature Development
 
 
@@ -135,6 +135,7 @@ Entity:
   id: UUID
   kind: function | class | module | config_key | feature_flag | api_endpoint
   source_location: Location
+  embeddings: vector (1536d)
   confidence: float (0.0-1.0)
   provenance: static_analysis | runtime_trace | llm_inference | human_annotation
   created_at: timestamp
@@ -163,6 +164,44 @@ Entity:
 
 ---
 
+---
+
+## **4a. [NEW] Semantic Search & Indexing Layer (v2.1)**
+
+### **Purpose**
+
+Enable **retrieval-augmented generation (RAG)** by indexing code semantics in a high-dimensional vector space alongside traditional lexical search.
+
+### **Responsibilities**
+
+* **Chunking**: Break code into logical units (functions, classes, module headers)
+* **Embedding**: Generate dense vector representations (e.g., OpenAI text-embedding-3-small)
+* **Vector Storage**: Persist vectors for fast nearest-neighbor search
+* **Hybrid Retrieval**: Combine dense (vector) and sparse (BM25) search results
+* **Reranking**: Optimize results based on metadata, recency, and completeness
+* **[HARDENED]** Sliding window chunking with overlap
+* **[HARDENED]** Real-time incremental indexing (Watch Mode)
+* **[HARDENED]** Dependency-aware result expansion (Completeness)
+
+### **Inputs**
+
+* Code entities from Semantic Graph
+* Raw source code
+
+### **Outputs**
+
+* FAISS Vector Index
+* In-memory Chunk Repository
+* Ranked search results
+
+### **Downstream Consumers**
+
+* API `/context/query` endpoint
+* CLI `semantic-search` command
+* Context Synthesis Layer
+
+---
+
 ## **4\. Static Behavioral Analysis Layer**
 
 ### **Purpose**
 
@@ -42,10 +42,16 @@ knowcode context "MyClass.important_method"
 # 4. Export documentation
 knowcode export -o docs/
 
-# 5. Start the intelligence server
-knowcode server --port 8080
+# 5. Build semantic search index
+knowcode index src/
+
+# 6. Perform semantic search
+knowcode semantic-search "How does parsing work?"
 
-# 6. View statistics
+# 7. Start the intelligence server with watch mode
+knowcode server --port 8080 --watch
+
+# 8. View statistics
 knowcode stats
 ```
 
@@ -114,11 +120,31 @@ Show statistics about the knowledge store.
 knowcode stats [--store <path>]
 ```
 
+### `index`
+Build a semantic search index for your codebase.
+
+```bash
+knowcode index <directory> [--output <path>]
+```
+
+### `semantic-search`
+Perform a natural language search against the semantic index.
+
+```bash
+knowcode semantic-search <query> [--index <path>] [--limit <n>]
+```
+
+**Example:**
+```bash
+knowcode semantic-search "Where is the graph built?"
+```
+```
+
 ### `server`
 Start the FastAPI intelligence server. This is the preferred way for locally hosted AI agents (IDEs) to interact with KnowCode.
 
 ```bash
-knowcode server [--host <host>] [--port <port>] [--store <path>]
+knowcode server [--host <host>] [--port <port>] [--store <path>] [--watch]
 ```
 
 **Example:**
@@ -128,7 +154,8 @@ knowcode server --port 8080
 
 Once running, you can access endpoints like:
 - `GET /api/v1/context?target=MyClass`
-- `GET /api/v1/search?q=parser`
+- `GET /api/v1/search?q=parser` `(lexical search)`
+- `POST /api/v1/context/query` `(semantic search)`
 - `POST /api/v1/reload` (to refresh data after a new `analyze` run)
 
 ## Supported Languages (MVP)
@@ -147,8 +174,9 @@ KnowCode follows a layered architecture:
 2. **Parsers** - Language-specific parsing (Python AST, Tree-sitter for others)
 3. **Graph Builder** - Constructs semantic graph with entities and relationships
 4. **Knowledge Store** - In-memory graph with JSON persistence
-5. **Context Synthesizer** - Generates token-efficient context bundles with priority ranking
-6. **CLI** - User interface for all operations
+5. **Indexer** - Vector embedding and hybrid retrieval engine (FAISS + BM25)
+6. **Context Synthesizer** - Generates token-efficient context bundles with priority ranking
+7. **CLI** - User interface for all operations
 
 See [KnowCode.md](KnowCode.md) for the complete reference architecture.
 
@@ -225,9 +253,9 @@ See [KnowCode.md](KnowCode.md) for the full vision. The MVP focuses on:
 - ✅ v1.3: Token budget optimization, priority ranking
 - ✅ v1.4: Runtime signal integration
 - ✅ v2.0: Intelligence Server mode (local API for local IDE agents)
+- ✅ v2.1: Semantic search with embeddings, hybrid retrieval, and watch mode
 
 **Future releases:**
-- v2.1: Semantic search with embeddings
 - v3.0: Team sharing & Enterprise features (RBAC, SSO, etc.)
 
 ## License
 
@@ -0,0 +1 @@
+GOOGLE_API_KEY="AIzaSyCDHxIUW-sHcVmtTLhPJ1rT2C13xqI7Xho"
@@ -15,6 +15,10 @@ dependencies = [
     "tiktoken>=0.7.0",
     "fastapi>=0.100.0",
     "uvicorn>=0.22.0",
+    "openai>=1.0.0",
+    "faiss-cpu>=1.7.0",
+    "numpy>=1.24.0",
+    "watchdog>=3.0.0",
 ]
 
 [project.scripts]
 
@@ -0,0 +1,94 @@
+"""Evaluation script for retrieval quality."""
+
+import json
+import sys
+from pathlib import Path
+from knowcode.chunk_repository import InMemoryChunkRepository
+from knowcode.vector_store import VectorStore
+from knowcode.hybrid_index import HybridIndex
+from knowcode.embedding import OpenAIEmbeddingProvider
+from knowcode.models import EmbeddingConfig, CodeChunk
+
+def evaluate(ground_truth_path: Path, index_path: Path) -> dict:
+    """Evaluate retrieval quality against ground truth."""
+    if not ground_truth_path.exists():
+        return {"error": "Ground truth file not found"}
+        
+    with open(ground_truth_path) as f:
+        ground_truth = json.load(f)
+    
+    # Load index components
+    repo = InMemoryChunkRepository()
+    # Assuming index_path is directory containing chunks.json and vectors used by Indexer.load
+    # Note: Indexer.load logic:
+    # chunks_file = path / "chunks.json"
+    # vector_path = path / "vectors"
+    
+    chunks_file = index_path / "chunks.json"
+    if chunks_file.exists():
+        with open(chunks_file) as f:
+            data = json.load(f)
+            for c_data in data["chunks"]:
+                repo.add(CodeChunk(**c_data))
+                
+    vs = VectorStore(dimension=1536, index_path=index_path / "vectors")
+    # Note: We need a real provider for queries, or mock if vectors are precomputed?
+    # For evaluation we assume we have an API key or use the same provider used for indexing.
+    # Here we assume OpenAI.
+    try:
+        provider = OpenAIEmbeddingProvider(EmbeddingConfig())
+    except:
+        print("Skipping evaluation: No OpenAI API Key found")
+        return {}
+
+    hybrid = HybridIndex(repo, vs)
+    
+    # Metrics
+    hits_at_5 = 0
+    hits_at_10 = 0
+    mrr_sum = 0.0
+    total_queries = len(ground_truth)
+    
+    for item in ground_truth:
+        query = item.get("query")
+        expected_ids = set(item.get("expected_ids", []))
+        
+        if not query or not expected_ids:
+            continue
+            
+        q_vec = provider.embed_single(query)
+        # Search directly on hybrid index (skipping SearchEngine wrapper for raw retrieval eval)
+        results = hybrid.search(query, q_vec, limit=10)
+        
+        found_ids = [c.id for c, _ in results]
+        
+        # Recall@k
+        if any(fid in expected_ids for fid in found_ids[:5]):
+            hits_at_5 += 1
+        if any(fid in expected_ids for fid in found_ids[:10]):
+            hits_at_10 += 1
+            
+        # MRR
+        rank = 0
+        for i, fid in enumerate(found_ids):
+            if fid in expected_ids:
+                rank = i + 1
+                break
+        if rank > 0:
+            mrr_sum += 1.0 / rank
+
+    return {
+        "precision_at_5": hits_at_5 / total_queries if total_queries else 0,
+        "recall_at_10": hits_at_10 / total_queries if total_queries else 0,
+        "mrr": mrr_sum / total_queries if total_queries else 0,
+    }
+
+
+if __name__ == "__main__":
+    if len(sys.argv) < 3:
+        print("Usage: python evaluate.py <ground_truth.json> <index_dir>")
+        sys.exit(1)
+        
+    gt_path = Path(sys.argv[1])
+    idx_path = Path(sys.argv[2])
+    print(evaluate(gt_path, idx_path))
@@ -1,3 +1,6 @@
 """KnowCode - Transform your codebase into an effective knowledge base."""
 
 __version__ = "0.1.0"
+
+from knowcode.models import CodeChunk, EmbeddingConfig
+from knowcode.chunk_repository import ChunkRepository, InMemoryChunkRepository
@@ -0,0 +1,93 @@
+"""Agent module for KnowCode."""
+
+import os
+from typing import Optional
+
+from openai import OpenAI, OpenAIError
+
+from knowcode.service import KnowCodeService
+
+
+class Agent:
+    """Agent that answers questions about the codebase using an LLM."""
+
+    def __init__(self, service: KnowCodeService, model: str = "gpt-4o") -> None:
+        """Initialize the agent.
+        
+        Args:
+            service: KnowCodeService instance for context retrieval.
+            model: OpenAI model to use.
+        """
+        self.service = service
+        self.model = model
+        api_key = os.environ.get("OPENAI_API_KEY")
+        if not api_key:
+             # We allow initialization without key, but answer() will fail if not provided later or found.
+             # This is to allow CLI to start up even if key is missing (until 'ask' is actually called).
+             pass
+        self.client = OpenAI(api_key=api_key) if api_key else None
+
+    def answer(self, query: str) -> str:
+        """Answer a question about the codebase.
+
+        Args:
+            query: User's question.
+
+        Returns:
+            The agent's answer.
+            
+        Raises:
+            ValueError: If OPENAI_API_KEY is not set.
+            OpenAIError: If the API call fails.
+        """
+        if not self.client:
+            api_key = os.environ.get("OPENAI_API_KEY")
+            if not api_key:
+                raise ValueError("OPENAI_API_KEY environment variable is not set.")
+            self.client = OpenAI(api_key=api_key)
+
+        # 1. Retrieve knowledge
+        # Simple strategy: Search for keywords in the query to find relevant entities
+        # then get context for the top match.
+        # Ideally, we would have a vector store search here. For MVP, we use the graph search.
+        search_results = self.service.search(query)
+        
+        context_str = ""
+        if search_results:
+            # Get up to 3 relevant entities
+            top_entities = search_results[:3]
+            context_parts = []
+            for entity in top_entities:
+                try:
+                    # Limit tokens for each to fit in context window comfortably
+                    bundle = self.service.get_context(entity.id, max_tokens=1500)
+                    context_parts.append(bundle["context_text"])
+                except Exception:
+                    continue
+            
+            if context_parts:
+                context_str = "\n\n".join(context_parts)
+        else:
+            context_str = "No specific entities found in the codebase matching the query terms."
+
+        # 2. Construct Prompt
+        system_prompt = (
+            "You are an expert software engineering assistant. "
+            "You have access to context from the user's codebase. "
+            "Answer the user's question based strictly on the provided context. "
+            "If the context doesn't contain the answer, say so, but try to be helpful based on the visible code structures."
+        )
+        
+        user_message = f"Context:\n{context_str}\n\nQuestion: {query}"
+
+        # 3. Call LLM
+        response = self.client.chat.completions.create(
+            model=self.model,
+            messages=[
+                {"role": "system", "content": system_prompt},
+                {"role": "user", "content": user_message},
+            ],
+            temperature=0.0,
+        )
+
+        return response.choices[0].message.content or "No response from LLM."
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+GOOGLE_API_KEY="AIzaSyCDHxIUW-sHcVmtTLhPJ1rT2C13xqI7Xho"`