Go to file

Anto01 3f23ca1bc6 feat: signal-aggressive extraction + auto vault refresh in nightly cron

Extraction prompt rewritten for signal-aggressive mode. The old prompt
rewarded silence ("durable insight only, empty is correct") which
caused quiet failures — real project signal (Schott quotes arriving,
stakeholder events, blockers) was dropped as "not architectural enough".

New prompt explicitly lists what to emit:
1. Project activity (mentions with context — quote received, blocker,
   action item)
2. Decisions and choices (architectural commitments, vendor selection)
3. Durable engineering insight (earned knowledge, generalizable)
4. Stakeholder and vendor events (emails sent, meetings scheduled)
5. Preferences and adaptations (how Antoine works)

Philosophy shift: "capture more signal, let triage filter noise"
replaces "extract only durable architectural facts". Auto-triage
already rejects noise well, so moving the filter downstream gives us
visibility into weak signals without polluting active memory.

Added 'episodic' to the candidate types list to support stakeholder
events with a timestamp feel.

LLM_EXTRACTOR_VERSION bumped to llm-0.4.0.

Also: cron-backup.sh now runs POST /ingest/sources before extraction
so new PKM files flow in automatically. Fail-open, non-blocking.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-14 10:24:50 -04:00

.claude/commands

fix(P1+P2): alias-aware project state lookup + slash command corpus fallback

2026-04-07 07:47:03 -04:00

config

Expand active project wave and serialize refreshes

2026-04-06 14:58:14 -04:00

deploy

feat: signal-aggressive extraction + auto vault refresh in nightly cron

2026-04-14 10:24:50 -04:00

docs

docs: knowledge architecture — dual-layer model + domain knowledge

2026-04-13 09:14:32 -04:00

openclaw-plugins/atocore-capture

feat: add OpenClaw AtoCore capture plugin

2026-04-12 22:06:07 +00:00

scripts

feat: signal-aggressive extraction + auto vault refresh in nightly cron

2026-04-14 10:24:50 -04:00

src/atocore

feat: signal-aggressive extraction + auto vault refresh in nightly cron

2026-04-14 10:24:50 -04:00

t420-openclaw

Add AtoCore integration tooling and operations guide

2026-04-06 19:28:09 -04:00

tests

feat: auto-project-detection + project stages

2026-04-13 17:16:04 -04:00

.dockerignore

Add Dalidou storage foundation and deployment prep

2026-04-05 18:33:52 -04:00

.env.example

Harden runtime and add backup foundation

2026-04-06 10:15:00 -04:00

.gitignore

slash command for daily AtoCore use + backup-restore procedure

2026-04-07 06:46:50 -04:00

AGENTS.md

feat: DEV-LEDGER.md as shared operating memory + session protocol

2026-04-11 14:46:21 -04:00

CLAUDE.md

feat: DEV-LEDGER.md as shared operating memory + session protocol

2026-04-11 14:46:21 -04:00

DEV-LEDGER.md

docs: full session sync — master plan + ledger + atomizer-v2 ingested

2026-04-12 20:32:47 -04:00

Dockerfile

Ship project registry config in image

2026-04-06 08:10:05 -04:00

pyproject.toml

fix: add markdown to pyproject.toml (container pip install reads this, not requirements.txt)

2026-04-13 15:37:22 -04:00

README.md

Merge origin/main into codex/dalidou-storage-foundation

2026-04-07 06:20:19 -04:00

requirements-dev.txt

feat: implement AtoCore Phase 0 + Phase 0.5 (foundation + PoC)

2026-04-05 09:21:27 -04:00

requirements.txt

feat: HTML mirror pages — readable project dashboards in browser

2026-04-13 15:31:03 -04:00

README.md

AtoCore

Personal context engine that enriches LLM interactions with durable memory, structured context, and project knowledge.

Quick Start

pip install -e .
uvicorn src.atocore.main:app --port 8100

Usage

# Ingest markdown files
curl -X POST http://localhost:8100/ingest \
  -H "Content-Type: application/json" \
  -d '{"path": "/path/to/notes"}'

# Build enriched context for a prompt
curl -X POST http://localhost:8100/context/build \
  -H "Content-Type: application/json" \
  -d '{"prompt": "What is the project status?", "project": "myproject"}'

# CLI ingestion
python scripts/ingest_folder.py --path /path/to/notes

# Live operator client
python scripts/atocore_client.py health
python scripts/atocore_client.py audit-query "gigabit" 5

API Endpoints

Method	Path	Description
POST	/ingest	Ingest markdown file or folder
POST	/query	Retrieve relevant chunks
POST	/context/build	Build full context pack
GET	/health	Health check
GET	/debug/context	Inspect last context pack

Architecture

FastAPI (port 8100)
  |- Ingestion: markdown -> parse -> chunk -> embed -> store
  |- Retrieval: query -> embed -> vector search -> rank
  |- Context Builder: retrieve -> boost -> budget -> format
  |- SQLite (documents, chunks, memories, projects, interactions)
  '- ChromaDB (vector embeddings)

Configuration

Set via environment variables (prefix ATOCORE_):

Variable	Default	Description
ATOCORE_DEBUG	false	Enable debug logging
ATOCORE_PORT	8100	Server port
ATOCORE_CHUNK_MAX_SIZE	800	Max chunk size (chars)
ATOCORE_CONTEXT_BUDGET	3000	Context pack budget (chars)
ATOCORE_EMBEDDING_MODEL	paraphrase-multilingual-MiniLM-L12-v2	Embedding model

Testing

pip install -e ".[dev]"
pytest

Operations

scripts/atocore_client.py provides a live API client for project refresh, project-state inspection, and retrieval-quality audits.
docs/operations.md captures the current operational priority order: retrieval quality, Wave 2 trusted-operational ingestion, AtoDrive scoping, and restore validation.

Architecture Notes

Implementation-facing architecture notes live under docs/architecture/.

Current additions:

docs/architecture/engineering-knowledge-hybrid-architecture.md — 5-layer hybrid model
docs/architecture/engineering-ontology-v1.md — V1 object and relationship inventory
docs/architecture/engineering-query-catalog.md — 20 v1-required queries
docs/architecture/memory-vs-entities.md — canonical home split
docs/architecture/promotion-rules.md — Layer 0 to Layer 2 pipeline
docs/architecture/conflict-model.md — contradictory facts detection and resolution