Reflex

Local-first code search engine with full-text search, symbol extraction, and dependency analysis for AI coding workflows

Reflex is a code search engine designed for developers and AI coding assistants. It combines trigram indexing for full-text search with Tree-sitter parsing for symbol extraction and static analysis for dependency tracking. Unlike symbol-only tools, Reflex finds every occurrence of patterns, function calls, variable usage, comments, and more with deterministic, repeatable results.

✨ Features

🔍 Complete Coverage: Find every occurrence, not just symbol definitions
⚡ Fast Queries: Trigram indexing with memory-mapped I/O for efficient search
🎯 Symbol-Aware: Runtime tree-sitter parsing for precise symbol filtering
🖥️ Interactive Mode: Live TUI for exploring code with instant search and filters
🔄 Incremental: Only reindexes changed files (blake3 hashing)
🌍 Multi-Language: Rust, TypeScript/JavaScript, Vue, Svelte, PHP, Python, Go, Java, C, C++, C#, Ruby, Kotlin, Zig
🤖 AI Query Assistant: Natural language search with rfx ask (OpenAI, Anthropic, Groq)
📡 MCP Support: Model Context Protocol server for AI assistants
📦 Local-First: Fully offline, all data stays on your machine
🎨 Regex Support: Trigram-optimized regex search
🌳 AST Queries: Structure-aware search with Tree-sitter
🔒 Deterministic: Same query → same results (no probabilistic ranking)

🚀 Quick Start

Installation

# Via NPM
npm install -g reflex-search

# Or via cargo
cargo install reflex-search

Important Setup Notes:

Run rfx commands from the root of your project directory
Add .reflex/ to your .gitignore file to exclude the search index from version control

Basic Usage

# Index your codebase
rfx index

# Full-text search (finds all occurrences)
rfx query "extract_symbols"

# Symbol-only search (definitions only)
rfx query "extract_symbols" --symbols

# Filter by language and symbol kind
rfx query "parse" --lang rust --kind function --symbols

# Include dependency information (imports)
rfx query "MyStruct" --dependencies

# Regex search
rfx query "fn.*test" --regex

# Paths-only mode (for piping to other tools)
vim $(rfx query "TODO" --paths)

# Export as JSON for AI agents
rfx query "unwrap" --json --limit 10

🤖 AI Query Assistant

Don't want to remember search syntax? Use rfx ask to translate natural language questions into rfx query commands.

Setup

First-time setup requires configuring an AI provider (OpenAI, Anthropic, or Groq):

# Interactive configuration wizard (recommended)
rfx ask --configure

This will guide you through:

Selecting an AI provider
Entering your API key
Choosing a model (optional)

Configuration is saved to ~/.reflex/config.toml:

[semantic]
provider = "openai"  # or anthropic, groq

[credentials]
openai_api_key = "sk-..."
openai_model = "gpt-4o-mini"  # optional

Usage

There are two ways to use rfx ask:

Interactive mode

Interactive chat mode with conversation history. This mode uses --agentic and --answer under the hood.

rfx ask

CLI-only mode

One-shot, non-conversational commands that return results directly via CLI.

# Ask a question (generates and executes rfx query commands, only returns query results)
rfx ask "Find all TODOs in Rust files"

# Use a specific provider
rfx ask "Show me error handling code" --provider groq

# Agentic mode (multi-step reasoning with automatic context gathering)
rfx ask "How does authentication work?" --agentic

# Get a conversational answer based on search results
rfx ask "What does the indexer module do?" --answer

How it works:

Your natural language question is sent to an LLM
The LLM generates one or more rfx query commands
You review and confirm (or use --execute to auto-run)
Results are displayed as normal search output

Agentic mode (--agentic) enables multi-step reasoning where the LLM can:

Gather context by running multiple searches
Refine queries based on initial results
Iteratively explore the codebase
Generate comprehensive answers with --answer

📋 Command Reference

`rfx index`

Build or update the search index.

rfx index [OPTIONS]

Options:
  --force              Force full reindex (ignore incremental)
  --languages <LANGS>  Limit to specific languages (comma-separated)

Subcommands:
  status               Show background symbol indexing status
  compact              Compact cache (remove deleted files, reclaim space)

`rfx query`

Search the codebase with CLI or interactive TUI mode.

Interactive Mode (TUI):

# Launch interactive mode (no pattern required)
rfx query

# Features:
# - Live search with instant results
# - Toggle filters: symbols-only, regex, language
# - Navigate results with keyboard (j/k, arrows)
# - Open files in $EDITOR (press 'o')
# - Query history with Ctrl+P/Ctrl+N
# - Press '?' for help, 'q' to quit

CLI Mode:

Run rfx query --help for full options.

Key Options:

--symbols, -s - Symbol-only search (definitions, not usage)
--regex, -r - Treat pattern as regex
--lang <LANG> - Filter by language
--kind <KIND> - Filter by symbol kind (function, class, struct, etc.)
--dependencies - Include dependency information (supports: Rust, TypeScript, JavaScript, Python, Go, Java, C, C++, C#, PHP, Ruby, Kotlin)
--paths, -p - Return only file paths (no content)
--json - Output as JSON
--limit <N> - Limit number of results
--timeout <SECS> - Query timeout (default: 30s)

Examples:

# Find function definitions named "parse"
rfx query "parse" --symbols --kind function

# Find test functions using regex
rfx query "fn test_\w+" --regex

# Search Rust files only
rfx query "unwrap" --lang rust

# Get paths of files with TODOs
rfx query "TODO" --paths

# Include import information
rfx query "Config" --symbols --dependencies

`rfx mcp`

Start as an MCP (Model Context Protocol) server for AI coding assistants.

{
  "mcpServers": {
    "reflex": {
      "command": "rfx",
      "args": ["mcp"],
      "env": {},
      "disabled": false
    }
  }
}

Error Handling:

If any MCP tool returns an error about a missing or stale index (e.g., "Index not found. Run 'rfx index' to build the cache first."), the AI agent should:

Call index_project to rebuild the index
Wait for indexing to complete
Retry the previously failed operation

This pattern ensures that queries always run against an up-to-date index.

Available MCP Tools:

list_locations - Fast location discovery (file + line only, minimal tokens)
count_occurrences - Quick statistics (total count + file count)
search_code - Full-text or symbol search with detailed results
search_regex - Regex pattern matching
search_ast - AST pattern matching (structure-aware, slow)
index_project - Trigger reindexing
get_dependencies - Get all dependencies of a specific file
get_dependents - Get all files that depend on a file (reverse lookup)
get_transitive_deps - Get transitive dependencies up to a specified depth
find_hotspots - Find most-imported files (with pagination)
find_circular - Detect circular dependencies (with pagination)
find_unused - Find files with no incoming dependencies (with pagination)
find_islands - Find disconnected components (with pagination)
analyze_summary - Get dependency analysis summary (counts only)

`rfx analyze`

Analyze codebase structure and dependencies. By default shows a summary; use specific flags for detailed results.

Subcommands:

--circular - Detect circular dependencies (A → B → C → A)
--hotspots - Find most-imported files
--unused - Find files with no incoming dependencies
--islands - Find disconnected components

Pagination (default: 200 results per page):

Use --limit N to specify results per page
Use --offset N to skip first N results
Use --all to return unlimited results

Examples:

# Show summary of all analyses
rfx analyze

# Find circular dependencies
rfx analyze --circular

# Find hotspots (most-imported files)
rfx analyze --hotspots --min-dependents 5

# Find unused files
rfx analyze --unused

# Find disconnected components (islands)
rfx analyze --islands --min-island-size 3

# Get JSON summary of all analyses
rfx analyze --json

# Get pretty-printed JSON summary
rfx analyze --json --pretty

# Paginate results
rfx analyze --hotspots --limit 50 --offset 0  # First 50
rfx analyze --hotspots --limit 50 --offset 50 # Next 50

# Export as JSON with pagination metadata
rfx analyze --circular --json

JSON Output Format (specific analyses with pagination):

{
  "pagination": {
    "total": 347,
    "count": 200,
    "offset": 0,
    "limit": 200,
    "has_more": true
  },
  "results": [...]
}

Summary JSON Output Format (bare rfx analyze --json):

{
  "circular_dependencies": 17,
  "hotspots": 10,
  "unused_files": 82,
  "islands": 81,
  "min_dependents": 2
}

`rfx deps`

Analyze dependencies for a specific file. Shows what a file imports (dependencies) or what imports it (dependents).

Key Options:

--reverse - Show files that depend on this file (reverse lookup)
--depth N - Traverse N levels deep for transitive dependencies (default: 1)
--format - Output format: tree, table, json (default: tree)
--json - Output as JSON
--pretty - Pretty-print JSON output

Examples:

# Show direct dependencies
rfx deps src/main.rs

# Show files that import this file (reverse lookup)
rfx deps src/config.rs --reverse

# Show transitive dependencies (depth 3)
rfx deps src/api.rs --depth 3

# JSON output
rfx deps src/main.rs --json

# Pretty-printed JSON
rfx deps src/main.rs --json --pretty

# Table format
rfx deps src/main.rs --format table

Supported Languages: Rust, TypeScript, JavaScript, Python, Go, Java, C, C++, C#, PHP, Ruby, Kotlin

Note: Only static imports (string literals) are tracked. Dynamic imports are filtered by design.

`rfx context`

Generate codebase context for AI prompts. Useful with rfx ask --additional-context.

Key Options:

--structure - Show directory structure
--file-types - Show file type distribution
--project-type - Detect project type (CLI/library/webapp/monorepo)
--framework - Detect frameworks and conventions
--entry-points - Show entry point files
--test-layout - Show test organization pattern
--config-files - List important configuration files
--path <PATH> - Focus on specific directory
--depth <N> - Tree depth for structure (default: 1)

By default (no flags), all context types are shown. Use individual flags to show specific types only.

Examples:

# Full context (all types - default behavior)
rfx context

# Full context for monorepo subdirectory
rfx context --path services/backend

# Specific context types only
rfx context --framework --entry-points

# Use with semantic queries
rfx ask "find auth code" --additional-context "$(rfx context --framework)"

Other Commands

rfx stats - Display index statistics
rfx clear - Clear the search index
rfx list-files - List all indexed files
rfx watch - Watch for file changes and auto-reindex

Run rfx <command> --help for detailed options.

🌳 AST Pattern Matching

Reflex supports structure-aware code search using Tree-sitter AST queries.

⚠️ WARNING: AST queries are SLOW and scan the entire codebase. Use --symbols instead for 95% of cases (much faster).

When to use AST queries:

You need to match code structure, not just text
--symbols search is insufficient for your use case
You have a very specific structural pattern

Basic usage:

rfx query <PATTERN> --ast <AST_PATTERN> --lang <LANGUAGE>

# Example: Find all Rust functions
rfx query "fn" --ast "(function_item) @fn" --lang rust

# Example: Find all TypeScript classes
rfx query "class" --ast "(class_declaration) @class" --lang typescript

Supported languages: Rust, TypeScript, JavaScript, Python, Go, Java, C, C++, C#, PHP, Ruby, Kotlin, Zig

For detailed AST query syntax and examples, see the Tree-sitter documentation.

🌐 Supported Languages/Dialects

Language	Extensions	Symbol Extraction
Rust	`.rs`	Functions, structs, enums, traits, impls, modules, methods
TypeScript	`.ts`, `.tsx`, `.mts`, `.cts`	Functions, classes, interfaces, types, enums, React components
JavaScript	`.js`, `.jsx`, `.mjs`, `.cjs`	Functions, classes, constants, methods, React components
Vue	`.vue`	Functions, constants, methods from `<script>` blocks
Svelte	`.svelte`	Functions, variables, reactive declarations
PHP	`.php`	Functions, classes, interfaces, traits, methods, namespaces, enums
Python	`.py`	Functions, classes, methods, decorators, lambdas
Go	`.go`	Functions, types, interfaces, methods, constants
Java	`.java`	Classes, interfaces, enums, methods, fields, constructors
C	`.c`, `.h`	Functions, structs, enums, unions, typedefs
C++	`.cpp`, `.hpp`, `.cxx`	Functions, classes, namespaces, templates, methods
C#	`.cs`	Classes, interfaces, structs, enums, methods, properties
Ruby	`.rb`, `.rake`, `.gemspec`	Classes, modules, methods, constants, variables
Kotlin	`.kt`, `.kts`	Classes, functions, interfaces, objects, properties
Zig	`.zig`	Functions, structs, enums, constants, variables

Note: Full-text search works on all file types regardless of parser support. Symbol filtering requires a language parser.

🏗️ Architecture

Reflex uses a trigram-based inverted index combined with runtime symbol detection:

Indexing Phase

Extract trigrams (3-character substrings) from all files
Build inverted index: trigram → [file_id, line_no]
Store full file contents in memory-mapped content.bin
Start background symbol indexing (caches symbols for faster queries)

Query Phase

Full-text queries: Intersect trigram posting lists → verify matches
Symbol queries: Trigrams narrow to ~10-100 candidates → parse with tree-sitter → filter symbols
Memory-mapped I/O for instant cache access

Cache Structure (`.reflex/`)

.reflex/
  meta.db          # SQLite: file metadata, stats, config, hashes
  trigrams.bin     # Inverted index (memory-mapped)
  content.bin      # Full file contents (memory-mapped)
  config.toml      # Index settings
  indexing.status  # Background symbol indexer status

⚡ Performance

Reflex is designed for speed at every level:

Query Performance:

Full-text & Regex: Efficient queries via trigram indexing
Symbol queries: Slower due to runtime tree-sitter parsing, but still efficient
Cached queries: Repeated searches benefit from memory-mapped cache
Scales well from small projects to large codebases (10k+ files)

Indexing Performance:

Initial indexing: Parallel processing using 80% of CPU cores
Incremental updates: Only reindexes changed files via blake3 hashing
Memory-mapped I/O: Zero-copy access for cache reads

🔧 Configuration

Reflex respects .gitignore files automatically. Additional configuration via .reflex/config.toml:

[index]
languages = []  # Empty = all supported languages
max_file_size = 10485760  # 10 MB
follow_symlinks = false

[search]
default_limit = 100

[performance]
parallel_threads = 0  # 0 = auto (80% of available cores)

🤖 AI Integration

Reflex provides clean JSON output for AI coding assistants and automation:

rfx query "parse_tree" --json --symbols

Output includes file paths, line numbers, symbol types, and code previews with pagination metadata.

🔍 Use Cases

Code Navigation: Find all usages of functions, classes, and variables
Refactoring: Identify all call sites before making changes
AI Assistants: Retrieve relevant code snippets and context for LLMs
Debugging: Locate where variables and functions are used
Documentation: Find examples of API usage across the codebase
Security: Search for potential vulnerabilities or anti-patterns

🧪 Testing

Reflex has comprehensive test coverage including core modules, real-world code samples across all supported languages, and end-to-end workflows.

cargo test                    # Run all tests
cargo test -- --nocapture     # Run with output
cargo test indexer::tests     # Run specific module

🤝 Contributing

Contributions welcome! Reflex is built to be:

Fast: Efficient search using trigram indexing and memory-mapped I/O
Accurate: Complete coverage with deterministic results
Extensible: Easy to add new language parsers

📄 License

MIT License - see LICENSE for details.

🙏 Acknowledgments

Built with:

tree-sitter - Incremental parsing
rkyv - Zero-copy deserialization
memmap2 - Memory-mapped I/O
rusqlite - SQLite bindings
blake3 - Fast hashing
ignore - gitignore support

Inspired by:

Zoekt - Trigram-based code search
Sourcegraph - Code search for teams
ripgrep - Fast text search

Made with ❤️ for developers and AI coding assistants

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
.bruno		.bruno
.context		.context
.github/workflows		.github/workflows
aur		aur
distribution-templates/homebrew-reflex		distribution-templates/homebrew-reflex
docs		docs
examples		examples
src		src
tests		tests
.gitignore		.gitignore
BATCH_OPTIMIZATION.md		BATCH_OPTIMIZATION.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.toml		Cargo.toml
README.md		README.md
RELEASE.md		RELEASE.md
SCHEMA_MIGRATION_STATUS.md		SCHEMA_MIGRATION_STATUS.md
build.rs		build.rs
cog.toml		cog.toml
dist-workspace.toml		dist-workspace.toml

reflex-search/reflex

Folders and files

Latest commit

History

Repository files navigation

Reflex

✨ Features

🚀 Quick Start

Installation

Basic Usage

🤖 AI Query Assistant

Setup

Usage

📋 Command Reference

rfx index

rfx query

rfx mcp

rfx analyze

rfx deps

rfx context

Other Commands

🌳 AST Pattern Matching

🌐 Supported Languages/Dialects

🏗️ Architecture

Indexing Phase

Query Phase

Cache Structure (.reflex/)

⚡ Performance

🔧 Configuration

🤖 AI Integration

🔍 Use Cases

🧪 Testing

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 34

Contributors 3

Uh oh!

Languages

`rfx index`

`rfx query`

`rfx mcp`

`rfx analyze`

`rfx deps`

`rfx context`

Cache Structure (`.reflex/`)