Bound `_query_chunk_search_fallback` result set with LIMIT by Copilot · Pull Request #39 · duck-lint/local-agent

Copilot · 2026-03-23T19:52:10Z

The projection-substring fallback path issued an unbounded SELECT over chunk_search, meaning short or common queries could materialize the entire table into Python before ranking — a latency and memory hazard whenever FTS5 is unavailable.

Changes

agent/corpus_db.py
- Added limit: int keyword parameter to _query_chunk_search_fallback()
- Appended LIMIT ? (guarded with max(1, int(limit))) to the fallback SQL query
- Both call sites now pass fetch_limit (max(50, safe_limit * 10)), matching the cap already applied on the FTS5 path

# Before
candidates = _query_chunk_search_fallback(conn, query_text=raw_query)

# After
candidates = _query_chunk_search_fallback(conn, query_text=raw_query, limit=fetch_limit)

tests/test_retrieval_contract.py
- Added FallbackLimitRegressionTests class with three focused tests exercising _query_chunk_search_fallback directly against an in-memory SQLite DB:
  - Asserts result count is capped at the given limit when many more rows match (prevents reintroduction of the unbounded query)
  - Asserts all rows are returned when total matches fall below the limit
  - Asserts every returned row includes a backend_score float

🔒 GitHub Advanced Security automatically protects Copilot coding agent pull requests. You can protect all pull requests by enabling Advanced Security for your repositories. Learn more about Advanced Security.

Co-authored-by: duck-lint <250741170+duck-lint@users.noreply.github.com> Agent-Logs-Url: https://github.com/duck-lint/local-agent/sessions/bed97c53-aa6e-4946-87d6-4562d2cf0d13

Copilot

Pull request overview

This PR bounds the projection-substring (“fallback”) lexical retrieval path in agent/corpus_db.py so that when FTS5 is unavailable, the system no longer risks materializing an unbounded chunk_search result set into Python before ranking.

Changes:

Pass a computed fetch_limit into _query_chunk_search_fallback() from both fallback call sites.
Add a required limit kwarg to _query_chunk_search_fallback() and apply LIMIT ? (sanitized via max(1, int(limit))) to the SQL query.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

agent/corpus_db.py

Copilot · 2026-03-23T19:59:27Z

agent/corpus_db.py

+            candidates = _query_chunk_search_fallback(conn, query_text=raw_query, limit=fetch_limit)
    else:
        actual_backend = "projection_substring"
-        candidates = _query_chunk_search_fallback(conn, query_text=raw_query)
+        candidates = _query_chunk_search_fallback(conn, query_text=raw_query, limit=fetch_limit)


The fallback path is now bounded via limit=fetch_limit, but there’s no regression test asserting that the projection-substring backend doesn’t return an unbounded candidate set when FTS5 is unavailable. Adding a focused test (e.g., patching _ensure_chunk_search_fts to false and inserting many chunk_search rows, then asserting the candidate fetch is capped) would prevent accidental reintroduction of the unbounded query.

@copilot apply changes based on this feedback

Done in commit 0c8fc41. Added FallbackLimitRegressionTests to tests/test_retrieval_contract.py with three focused tests that exercise _query_chunk_search_fallback directly against an in-memory SQLite DB: (1) result count is capped at the given limit when 200 matching rows exist, (2) all rows are returned when total matches fall below the limit, and (3) every returned row includes a backend_score float.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: duck-lint <250741170+duck-lint@users.noreply.github.com>

…orcement Co-authored-by: duck-lint <250741170+duck-lint@users.noreply.github.com> Agent-Logs-Url: https://github.com/duck-lint/local-agent/sessions/ad4ff7de-5ca2-4fd0-9a25-44ff1603d5f3

Initial plan

27b23a5

Copilot AI assigned Copilot and duck-lint Mar 23, 2026

Copilot AI mentioned this pull request Mar 23, 2026

20260323 implementation 1 complete #38

Merged

Copilot started work on behalf of duck-lint March 23, 2026 19:52 View session

fix: add LIMIT to _query_chunk_search_fallback to bound result sets

4c661ce

Co-authored-by: duck-lint <250741170+duck-lint@users.noreply.github.com> Agent-Logs-Url: https://github.com/duck-lint/local-agent/sessions/bed97c53-aa6e-4946-87d6-4562d2cf0d13

Copilot AI changed the title ~~[WIP] [WIP] Address feedback from PR #38 for implementation 1 adjustments~~ Bound _query_chunk_search_fallback result set with LIMIT Mar 23, 2026

Copilot AI requested a review from duck-lint March 23, 2026 19:55

Copilot finished work on behalf of duck-lint March 23, 2026 19:55

duck-lint marked this pull request as ready for review March 23, 2026 19:57

Copilot AI review requested due to automatic review settings March 23, 2026 19:57

Copilot started reviewing on behalf of duck-lint March 23, 2026 19:57 View session

Copilot AI reviewed Mar 23, 2026

View reviewed changes

Update agent/corpus_db.py

a2a3bf6

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: duck-lint <250741170+duck-lint@users.noreply.github.com>

Copilot started work on behalf of duck-lint March 23, 2026 20:01 View session

test: add regression tests for _query_chunk_search_fallback LIMIT enf…

0c8fc41

…orcement Co-authored-by: duck-lint <250741170+duck-lint@users.noreply.github.com> Agent-Logs-Url: https://github.com/duck-lint/local-agent/sessions/ad4ff7de-5ca2-4fd0-9a25-44ff1603d5f3

Copilot finished work on behalf of duck-lint March 23, 2026 20:05

duck-lint merged commit ce09819 into 20260323_implementation-1_lockstep Mar 23, 2026
4 checks passed

duck-lint deleted the copilot/sub-pr-38 branch March 23, 2026 20:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bound `_query_chunk_search_fallback` result set with LIMIT#39

Bound `_query_chunk_search_fallback` result set with LIMIT#39
duck-lint merged 4 commits into20260323_implementation-1_lockstepfrom
copilot/sub-pr-38

Copilot AI commented Mar 23, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Mar 23, 2026

Uh oh!

duck-lint Mar 23, 2026

Uh oh!

Copilot AI Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Copilot AI commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

duck-lint Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Mar 23, 2026 •

edited

Loading