Files
Cortex-Inara/documentation/ROADMAP.md
Scott Idem f8f7cd75da feat: audit log, usage tracking UI, OpenAI orchestrator compaction, onboarding + docs
Tool audit log:
- Every orchestrator tool call logged to home/{user}/tool_audit/YYYY-MM-DD.jsonl
- Files panel sidebar: audit log group (collapsed), date-linked read-only table
- Admin endpoints: /api/audit/files, /api/audit/day, /api/audit/recent, /api/audit/stats
- Engine and model name recorded per entry

OpenAI orchestrator improvements:
- Context budget enforcement: 75% of model context_k (min 16k)
- Message compaction: truncates old tool results when approaching budget
- max_rounds respected per model config (intersected with server cap)

OpenRouter onboarding (setup.html, onboarding.py, app.js, settings.html):
- Step 3 of 3: /setup/model with curated model picker
- Chat banner for users on server-default model (informational, not alarmist)
- Settings quick-link card; /setup/model works standalone for existing users

Model registry + session store:
- set_role_config / get_role_config for per-role tool lists and system_append
- session_store: session rename, session name backfill endpoint

UI updates (app.js, index.html, style.css, local_llm.html):
- Role toggle in context panel
- Off-the-record mode
- Agent notes read-only viewer
- OPERATIONS.md loaded at T2+ in context

Documentation:
- HELP.md: full tool table, per-role tool sets, Agent Notes, usage tracking
- TOOLS.md: Agent Notes section, count corrected to 44
- ARCH__SYSTEM.md, ARCH__BACKENDS.md, MASTER.md updated to match reality
- CLAUDE.md: onboarding flow, documentation philosophy sections
- README.md: stack in practice, DeepSeek TUI mention, architecture diagram updated
- TODO__Agents.md: onboarding task completed with deviation notes

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-08 21:26:43 -04:00

3.1 KiB

Cortex — Roadmap

Phases and priorities. For active tasks see TODO__Agents.md. Last updated: 2026-04-29


Phase 0 — Foundation

  • Syncthing fleet sync (agents_sync/) operational
  • MCP tools (ae_*) available in all Claude Code sessions
  • Fleet agents running independently on each machine

Phase 1 — Dispatcher Core

  • FastAPI service with streaming SSE responses
  • Claude CLI and Gemini CLI subprocess backends
  • Session context management (rolling window, file persistence)
  • Nextcloud Talk bot (HMAC-signed webhook)
  • Memory distiller (APScheduler — short/mid/long cycles)
  • Local web UI (single-page, mobile-responsive)
  • Auth status monitoring (/auth/status, UI banner)
  • Session logging and file browser

Phase 2 — Identity & Multi-User

  • Inara persona formalized (IDENTITY.md, SOUL.md, PROTOCOLS.md, context tiers)
  • Two-level user/persona layout (home/{user}/persona/{name}/)
  • Session auth: bcrypt passwords, JWT cookies, invite tokens, Google OAuth
  • Multi-user live: Scott, Holly, Brian
  • Per-user channel config (channels.json)
  • Per-user Gemini API key (settings UI)
  • Help & Reference system (shared base + per-persona additions)
  • Lucide icons, persona picker page, session persistence across navigation

Phase 3 — Intelligence Layer (In Progress)

  • Gemini API orchestrator (tool loop → Claude responder)
  • Tool suite: web search, AE Journal read/write, tasks, scratch, reminders, cron, system, email_send (+ per-user allowlist), nc_talk_send
  • Agent mode in UI (async job, poll for result); role-based tool access + confirmation gate
  • Local LLM backend (Open WebUI/Ollama, per-user multi-model config); inline model edit in registry UI
  • Proactive cron (message / brief job types → NC Talk)
  • Session search (full-text across past session logs)
  • Distill notifications (NC Talk after mid/long runs)
  • Local backend for distillation (DISTILL_BACKEND_MID/LONG in .env)
  • Local orchestrator — ReAct tool loop using local model (High priority — see TODO__Agents.md)
  • Knowledge import — markdown → AE Journals (import script)
  • Dev agent pipeline — specialist agents + supervisor + approval gate
  • Gitea webhook integration + Actions CI

Phase 4 — Channel Expansion

  • Web UI
  • Nextcloud Talk
  • Google Chat
  • WhatsApp (Business API or bridge — investigating)
  • Webhook triggers from Aether platform events

Phase 5 — Routing Intelligence & Scale

  • Intelligent model routing (by task type, privacy, context length)
  • Agent-to-agent task delegation across fleet

Phase 6 — Infrastructure

  • Server DMZ finalized
  • WireGuard for all Cortex-accessing devices
  • Camera/IoT VLAN segmentation

Deferred / Watching

  • Speculative decoding — llama.cpp supports it (E4B + E2B draft ≈ 2x speed); Ollama does not yet
  • RAG via Open WebUI — feed Nextcloud docs into local knowledge collections; possible complement to AE Journals search
  • Multi-host local models — per-user config already supports multiple hosts; routing logic TBD
  • WhatsApp — requires Business API account or a bridge; not started