feat(25): E2E Core Pipeline Tests — 10 tests across 4 files#14
Merged
RichardHightower merged 10 commits intomainfrom Feb 11, 2026
Merged
feat(25): E2E Core Pipeline Tests — 10 tests across 4 files#14RichardHightower merged 10 commits intomainfrom
RichardHightower merged 10 commits intomainfrom
Conversation
- Add e2e-tests to workspace members - Create Cargo.toml with all workspace dependencies - Implement TestHarness with temp dir, storage, and index paths - Add helpers: ingest_events, create_test_events, build_toc_segment - All helpers pub for reuse by later E2E test plans Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- test_full_pipeline_ingest_toc_grip_route_query: verifies ingest -> TOC segment build -> grip extraction -> BM25 index -> route_query - test_grip_provenance_expand_with_context: verifies grip expansion returns excerpt events with surrounding context - Both tests use pretty_assertions and structural + content assertions - Add tonic dev-dependency for Request type in tests Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- SUMMARY.md with task commits, decisions, and deviation documentation - STATE.md updated to Phase 25, Plan 1 of 3 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- test_bm25_ingest_index_search_ranked: 3 topic segments, verifies ranking - test_bm25_search_filters_by_doc_type: TocNode/Grip filter isolation - test_bm25_search_with_agent_attribution: agent field propagation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Vector search returns semantically similar results ordered by score - Group A (Rust) ranks first for Rust queries, Group B (cooking) for pasta - Agent attribution propagates through vector search results - OnceLock shared embedder prevents concurrent model loading race condition - Tests marked #[ignore] due to ~80MB model download requirement Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- get_top_topics returns topics ordered by importance score - Topic keyword search finds matching topics by label and keywords - Topic graph status correctly reports availability and count - All tests use pretty_assertions and direct handler testing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
- Phase 25 fully complete (3/3 plans done) - SUMMARY.md with self-check passed - STATE.md updated with position, decisions, metrics Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
5/5 success criteria verified: full pipeline (E2E-01), BM25 search (E2E-02), vector search (E2E-03), topic graph (E2E-04), grip provenance (E2E-07). 10 E2E tests across 4 test files. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
crates/e2e-tests/crate with shared TestHarness (temp RocksDB, mock summarizer, event builders)Test plan
cargo test -p e2e-tests --test pipeline_test— 2 passed (5.82s)cargo test -p e2e-tests --test bm25_teleport_test— 3 passed (8.73s)cargo test -p e2e-tests --test topic_graph_test— 3 passed (0.05s)cargo test -p e2e-tests --test vector_search_test— 2 exist (marked #[ignore] due to ~80MB model download)cargo clippy -p e2e-tests --all-targets -- -D warnings— clean🤖 Generated with Claude Code