feat(retrieval): persist explicit chunk project ids

2026-04-24 11:02:30 -04:00
parent f44a211497
commit c03022d864
12 changed files with 332 additions and 24 deletions
--- a/docs/master-plan-status.md
+++ b/docs/master-plan-status.md
@@ -135,7 +135,7 @@ deferred from the shared client until their workflows are exercised.

 - canonical AtoCore runtime on Dalidou (`2b86543`, deploy.sh verified)
 - 33,253 vectors across 6 registered projects
- 950 captured interactions as of the 2026-04-24 live dashboard; refresh
+- 951 captured interactions as of the 2026-04-24 live dashboard; refresh
  exact live counts with
  `python scripts/live_status.py`
 - 6 registered projects:
@@ -150,10 +150,9 @@ deferred from the shared client until their workflows are exercised.
  dashboard
 - context pack assembly with 4 tiers: Trusted Project State > identity/preference > project memories > retrieved chunks
 - query-relevance memory ranking with overlap-density scoring
- retrieval eval harness: 20 fixtures; current live has 18 pass, 1 known
-  content gap, and 1 blocking cross-project bleed guard targeted by the
-  current retrieval-scoping branch
- 553 tests passing on the audit-improvements branch
+- retrieval eval harness: 20 fixtures; current live has 19 pass, 1 known
+  content gap, and 0 blocking failures after the audit-improvements deploy
+- 556 tests passing on the active `codex/project-id-metadata-retrieval` branch
 - nightly pipeline: backup → cleanup → rsync → OpenClaw import → vault refresh → extract → triage → **auto-promote/expire** → weekly synth/lint → **retrieval harness** → **pipeline summary to project state**
 - Phase 10 operational: reinforcement-based auto-promotion (ref_count ≥ 3, confidence ≥ 0.7) + stale candidate expiry (14 days unreinforced)
 - pipeline health visible in dashboard: interaction totals by client, pipeline last_run, harness results, triage stats