Stage 04 · Intelligence

Tool calls and retrieval graph

Files & commands

mcp/server.py · fastmcp · all tools registered
tools/search.py · SearXNG primary, Tavily fallback, raises on dual-fail
tools/fetch.py · trafilatura → clean markdown · optional vault save
tools/vault.py · read by path · write creates/updates note
tools/code.py · sandboxed Python exec
Research mode wired into chat · model receives tool descriptions
tests/test_tools/* · mocked HTTP per tool

rag/indexer.py · chunk + nomic-embed-text + Qdrant upsert
rag/retriever.py · query → search → rerank → top N
rag/graph.py · LangGraph: decompose → retrieve → synthesise
Collection vault · Obsidian notes
Collection web · fetched URLs
Collection memory · user facts
Nightly re-index cron for vault
tests/test_rag/* · indexer chunking · retriever · graph nodes

## Runner — Phase 4 (MCP)
Build the fastmcp server + 4 tools per spec. Search must call
SearXNG via the Tailscale URL set in env. Tavily is fallback.
If both fail, raise — never fall back to a stub. All tools tested
with mocked HTTP.

## Runner — Phase 5 (RAG)  ⚠ HIGHEST RISK
LangGraph retrieval graph. Start by reading the LangGraph quickstart
together. Build indexer first, prove an upsert works, THEN write the
graph. If the graph node signatures fight you, simplify — start with
linear flow before parallel retrieve.

Wildcards

+12 buffer

LangGraph

Newest library in the stack. Budget 28→40 credits. Build linear first, then parallelise.

WATCH

code_exec sandbox

Use a real sandbox (subprocess + tmpdir + timeout). Never exec() in-process.

WATCH

Embedding throughput

First vault index could be slow. Batch upserts, log progress, don't block the worker.