ATOCore

Files

Anto01 1a2ee5e07f feat: Day 3 — auto-triage via LLM second pass

scripts/auto_triage.py: fetches candidate memories, asks a triage
model (claude -p, default sonnet) to classify each as promote /
reject / needs_human, and executes the verdict via the API.

Trust model:
- Auto-promote: model says promote AND confidence >= 0.8 AND
  dedup-checked against existing active memories for the project
- Auto-reject: model says reject
- needs_human: everything else stays in queue for manual review

The triage model receives both the candidate content AND a summary
of existing active memories for the same project, so it can detect
duplicates and near-duplicates. The system prompt explicitly lists
the rejection categories learned from the first two manual triage
passes (stale snapshots, impl details, planned-not-implemented,
process rules that belong in ledger not memory).

deploy/dalidou/batch-extract.sh now runs extraction (Step A) then
auto-triage (Step B) in sequence. The nightly cron at 03:00 UTC
will run the full pipeline: backup → cleanup → rsync → extract →
triage. Only needs_human candidates reach the human.

Supports --dry-run for preview without executing.
Supports --model override for multi-model triage (e.g. opus for
higher-quality review, or a future Gemini/Ollama backend).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-12 12:30:57 -04:00

candidate_queue_2026-04-12.json

feat: Day 3 — auto-triage via LLM second pass

2026-04-12 12:30:57 -04:00

candidate_queue_2026-04-12.txt

feat: Day 3 — auto-triage via LLM second pass

2026-04-12 12:30:57 -04:00

candidate_queue_snapshot.jsonl

feat: first live triage — 16 promoted, 35 rejected from LLM extraction

2026-04-12 06:06:02 -04:00

extractor_labels_2026-04-11.json

feat(eval-loop): Day 1+2 — labeled extractor corpus + baseline scorecard