feat: query-relevance ordering for memory selection

get_memories_for_context now accepts an optional query string. When provided, candidate memories are reranked by lexical overlap with the query (stemmed token intersection, ties broken by confidence) before the budget walk. Without a query the order is unchanged — effectively "by confidence desc" as before — so non-builder callers see no behaviour change. The fetch limit is raised from 10 to 30 so there's a real pool to rerank. Token overlap reuses _normalize/_tokenize from reinforcement.py so ranking and reinforcement matching share the same notion of distinctive terms. build_context passes the user_prompt through to both the identity/ preference and project-memory calls. The retrieval harness regression the fix is targeting: - p05-vendor-signal FAIL @ 1161645: "Zygo" missing from the pack even though an active vendor memory contained it. Root cause: higher-confidence p05 memories filled the 25% budget slice before the vendor memory ever got a chance. Query-aware ordering puts the vendor memory first when the query is about vendors. New regression test test_project_memories_query_relevance_ordering locks the behaviour in with two p05 memories and a tight budget. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-11 12:47:05 -04:00
parent 4da81c9e4e
commit 5aeeb1cad1
3 changed files with 89 additions and 3 deletions
--- a/src/atocore/context/builder.py
+++ b/src/atocore/context/builder.py
@@ -115,6 +115,7 @@ def build_context(
    memory_text, memory_chars = get_memories_for_context(
        memory_types=["identity", "preference"],
        budget=memory_budget,
+        query=user_prompt,
    )

    # 2b. Get project-scoped memories (third precedence). Only
@@ -135,6 +136,7 @@ def build_context(
            budget=project_memory_budget,
            header="--- Project Memories ---",
            footer="--- End Project Memories ---",
+            query=user_prompt,
        )

    # 3. Calculate remaining budget for retrieval