Local LLM:
- user_settings.py: per-user hosts/models config (local_llm.json)
- routers/local_llm.py + static/local_llm.html: dedicated settings page
- llm_client.py: local OpenAI-compatible backend via httpx
- config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts
- Active model shown near backend toggle (amber hint text)
Memory distillation:
- memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides
- scheduler.py + notification.py: notify NC Talk after mid/long distill
- notification.py: outbound channel abstraction (NC Talk, extensible)
Session search:
- routers/files.py: GET /sessions/search?q= with excerpts grouped by date
- static/index.html + app.js: search UI in file sidebar with highlight
- _esc() helper to prevent XSS in search results
Proactive cron:
- cron_runner.py: new job types — message (send directly) and brief (LLM + send)
- Both support optional per-job channel override
Channels:
- routers/nextcloud_talk.py: consolidated using notification._send_nct_message()
- routers/auth.py: local backend status in /auth/status
- routers/chat.py: /backend returns {primary, fallback, local_model} object
UI / UX:
- Copy button for user messages (matching assistant)
- Autocomplete disabled on sensitive form fields
- settings.html: local model section replaced with link to /settings/local
Docs overhaul:
- MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md
- ARCH__Intelligence_Layer.md replaced with redirect table
- CORTEX.md trimmed to vision only; README updated
- OPEN_WEBUI_API.md added to docs/
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3.4 KiB
3.4 KiB
Cortex / Inara — Master Index
Start here. This document is a map, not a manual. Last updated: 2026-04-03
What It Is
Cortex is a self-hosted personal AI platform. It routes messages from any input channel to AI backends, manages a resident agent (Inara) with persistent memory, and coordinates across a fleet of machines. It is infrastructure, not a product.
Running at: https://cortex.dgrzone.com | systemctl --user restart cortex
Current State
| Component | Status | Notes |
|---|---|---|
| Web UI | ✅ Live | SPA, dark theme, mobile-responsive, session auth |
| Nextcloud Talk bot | ✅ Live | HMAC-signed, per-user routing |
| Google Chat Add-on | ✅ Live | JWT-verified, per-user routing |
| Claude backend | ✅ Live | Primary — via Claude Code CLI |
| Gemini backend | ✅ Live | Fallback — via Gemini CLI |
| Local backend | ✅ Live | Third option — Open WebUI/Ollama on scott_gaming |
| Gemini orchestrator | ✅ Live | Tool loop → Claude response, Agent mode in UI |
| Memory distillation | ✅ Live | Short (daily) / Mid (weekly) / Long (monthly) |
| Multi-user | ✅ Live | Scott, Holly, Brian — each with own personas |
| Session search | ✅ Live | Full-text search across past session logs |
| Proactive cron | ✅ Live | message and brief job types → NC Talk |
Active users / personas: scott/inara, scott/developer, holly/tina, brian/wintermute
Document Map
Project-Level
| Doc | What it covers |
|---|---|
| This file | Index and current state |
CORTEX.md |
Vision, philosophy, "what it is and isn't" |
ROADMAP.md |
Phases — what's done, what's next, what's deferred |
TODO__Agents.md |
Active task list — read before starting work |
Architecture
| Doc | What it covers |
|---|---|
ARCH__SYSTEM.md |
Overall architecture, component map, key design decisions |
ARCH__BACKENDS.md |
LLM backends, routing, fallback, per-user config |
ARCH__PERSONA.md |
Persona system, context tiers, memory distillation |
ARCH__CHANNELS.md |
Input channels — web, NC Talk, Google Chat, cron |
ARCH__FUTURE.md |
Planned: local orchestrator, dev agents, knowledge layer |
Setup & Reference
| Doc | What it covers |
|---|---|
docs/NEXTCLOUD_TALK_BOT.md |
NC Talk bot setup and troubleshooting |
docs/GOOGLE_CHAT_BOT.md |
Google Chat Add-on setup |
docs/OPEN_WEBUI_API.md |
Open WebUI/Ollama API reference for local model work |
Code-Level
| Doc | What it covers |
|---|---|
CLAUDE.md |
Project instructions for Claude Code — directory map, run commands, design decisions |
README.md |
Project root orientation, quick-start, user management |
cortex/static/HELP.md |
In-app help (rendered in UI for all users) |
Quick Reference
Start the service / check logs
systemctl --user restart cortex
journalctl --user -u cortex -f
Syntax check before restart
python3 -m py_compile cortex/<file>.py
Add a user
cd cortex && .venv/bin/python manage_passwords.py invite <username> <email>
Run tests
cd cortex && .venv/bin/python -m pytest tests/ -q