- http_post: POST to external URLs with per-user URL prefix allowlist
(home/{user}/http_allowlist.json); admin-only, confirm-required
- nc_talk_history: read recent NC Talk messages via Basic Auth (requires
nc_username + nc_app_password in channels.json under nextcloud)
- openai_orchestrator: _chat_with_retry() wraps both API calls with
exponential backoff (3 attempts, 1s/2s) on connection errors and
transient status codes (429, 500, 502, 503, 504)
- Docs updated: CLAUDE.md, HELP.md, TODO, MASTER, ROADMAP (50 tools)
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
78 lines
4.0 KiB
Markdown
78 lines
4.0 KiB
Markdown
# Cortex — Roadmap
|
|
|
|
> Phases and priorities. For active tasks see `TODO__Agents.md`.
|
|
> Last updated: 2026-05-09
|
|
|
|
---
|
|
|
|
## Phase 0 — Foundation ✅
|
|
- Syncthing fleet sync (`agents_sync/`) operational
|
|
- MCP tools (`ae_*`) available in all Claude Code sessions
|
|
- Fleet agents running independently on each machine
|
|
|
|
## Phase 1 — Dispatcher Core ✅
|
|
- FastAPI service with streaming SSE responses
|
|
- Claude CLI and Gemini CLI subprocess backends
|
|
- Session context management (rolling window, file persistence)
|
|
- Nextcloud Talk bot (HMAC-signed webhook)
|
|
- Memory distiller (APScheduler — short/mid/long cycles)
|
|
- Local web UI (single-page, mobile-responsive)
|
|
- Auth status monitoring (`/auth/status`, UI banner)
|
|
- Session logging and file browser
|
|
|
|
## Phase 2 — Identity & Multi-User ✅
|
|
- Inara persona formalized (`IDENTITY.md`, `SOUL.md`, `PROTOCOLS.md`, context tiers)
|
|
- Two-level user/persona layout (`home/{user}/persona/{name}/`)
|
|
- Session auth: bcrypt passwords, JWT cookies, invite tokens, Google OAuth
|
|
- Multi-user live: Scott, Holly, Brian
|
|
- Per-user channel config (`channels.json`)
|
|
- Per-user Gemini API key (settings UI)
|
|
- Help & Reference system (shared base + per-persona additions)
|
|
- Lucide icons, persona picker page, session persistence across navigation
|
|
|
|
## Phase 3 — Intelligence Layer (In Progress)
|
|
- ✅ Gemini API orchestrator (tool loop → Claude responder)
|
|
- ✅ Tool suite: web search, AE Journal read/write, tasks, scratch, reminders, cron, system, email_send (+ per-user allowlist), nc_talk_send
|
|
- ✅ Agent mode in UI (async job, poll for result); role-based tool access + confirmation gate
|
|
- ✅ Local LLM backend (Open WebUI/Ollama, per-user multi-model config); inline model edit in registry UI
|
|
- ✅ Proactive cron (`message` / `brief` job types → NC Talk)
|
|
- ✅ Session search (full-text across past session logs)
|
|
- ✅ Distill notifications (NC Talk after mid/long runs)
|
|
- ✅ Local backend for distillation (DISTILL_BACKEND_MID/LONG in .env)
|
|
- ✅ Local orchestrator — OpenAI-compatible ReAct loop; fires when orchestrator role → local model
|
|
- ✅ Web push notifications — VAPID; `web_push` tool; PWA-installable; subscribe via ☰ menu
|
|
- ✅ Proactive notifications — daily reminder check (09:00); `notify()` routes to any configured channel; dedicated settings page
|
|
- ✅ Sub-agent spawning — `spawn_agent` tool; per-host concurrency limit; Gemini API + local OpenAI backends
|
|
- ✅ Web content extraction — `web_read` via trafilatura; strips ads/nav/boilerplate; 128K cap
|
|
- ✅ Session log reader — `session_read(date)` tool; complements `session_search`
|
|
- ✅ `http_post` — POST to external URLs with per-user URL prefix allowlist; admin-only, confirm-required
|
|
- ✅ `nc_talk_history` — read recent NC Talk messages; requires nc_username + nc_app_password in channels.json
|
|
- ✅ Local orchestrator retry — exponential backoff on 429/5xx/connection errors (3 attempts)
|
|
- [ ] Knowledge import — markdown → AE Journals (import script)
|
|
- [ ] Dev agent pipeline — specialist agents + supervisor + approval gate
|
|
- [ ] Gitea webhook integration + Actions CI
|
|
|
|
## Phase 4 — Channel Expansion
|
|
- ✅ Web UI
|
|
- ✅ Nextcloud Talk
|
|
- ✅ Google Chat
|
|
- [ ] WhatsApp (Business API or bridge — investigating)
|
|
- [ ] Webhook triggers from Aether platform events
|
|
|
|
## Phase 5 — Routing Intelligence & Scale
|
|
- [ ] Intelligent model routing (by task type, privacy, context length)
|
|
- [ ] Agent-to-agent task delegation across fleet
|
|
|
|
## Phase 6 — Infrastructure
|
|
- [ ] Server DMZ finalized
|
|
- [ ] WireGuard for all Cortex-accessing devices
|
|
- [ ] Camera/IoT VLAN segmentation
|
|
|
|
---
|
|
|
|
## Deferred / Watching
|
|
- **Speculative decoding** — llama.cpp supports it (E4B + E2B draft ≈ 2x speed); Ollama does not yet
|
|
- **RAG via Open WebUI** — feed Nextcloud docs into local knowledge collections; possible complement to AE Journals search
|
|
- **Multi-host local models** — per-user config already supports multiple hosts; routing logic TBD
|
|
- **WhatsApp** — requires Business API account or a bridge; not started
|