feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul

Local LLM: - user_settings.py: per-user hosts/models config (local_llm.json) - routers/local_llm.py + static/local_llm.html: dedicated settings page - llm_client.py: local OpenAI-compatible backend via httpx - config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts - Active model shown near backend toggle (amber hint text) Memory distillation: - memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides - scheduler.py + notification.py: notify NC Talk after mid/long distill - notification.py: outbound channel abstraction (NC Talk, extensible) Session search: - routers/files.py: GET /sessions/search?q= with excerpts grouped by date - static/index.html + app.js: search UI in file sidebar with highlight - _esc() helper to prevent XSS in search results Proactive cron: - cron_runner.py: new job types — message (send directly) and brief (LLM + send) - Both support optional per-job channel override Channels: - routers/nextcloud_talk.py: consolidated using notification._send_nct_message() - routers/auth.py: local backend status in /auth/status - routers/chat.py: /backend returns {primary, fallback, local_model} object UI / UX: - Copy button for user messages (matching assistant) - Autocomplete disabled on sensitive form fields - settings.html: local model section replaced with link to /settings/local Docs overhaul: - MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md - ARCH__Intelligence_Layer.md replaced with redirect table - CORTEX.md trimmed to vision only; README updated - OPEN_WEBUI_API.md added to docs/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-05 20:53:06 -04:00
parent bd6532e93a
commit a4daebdc9b
33 changed files with 2985 additions and 486 deletions
--- a/documentation/MASTER.md
+++ b/documentation/MASTER.md
@@ -0,0 +1,92 @@
+# Cortex / Inara — Master Index
+
+> Start here. This document is a map, not a manual.
+> Last updated: 2026-04-03
+
+---
+
+## What It Is
+
+Cortex is a self-hosted personal AI platform. It routes messages from any input channel to AI backends, manages a resident agent (Inara) with persistent memory, and coordinates across a fleet of machines. It is infrastructure, not a product.
+
+**Running at:** `https://cortex.dgrzone.com` | `systemctl --user restart cortex`
+
+---
+
+## Current State
+
+| Component | Status | Notes |
+|---|---|---|
+| Web UI | ✅ Live | SPA, dark theme, mobile-responsive, session auth |
+| Nextcloud Talk bot | ✅ Live | HMAC-signed, per-user routing |
+| Google Chat Add-on | ✅ Live | JWT-verified, per-user routing |
+| Claude backend | ✅ Live | Primary — via Claude Code CLI |
+| Gemini backend | ✅ Live | Fallback — via Gemini CLI |
+| Local backend | ✅ Live | Third option — Open WebUI/Ollama on scott_gaming |
+| Gemini orchestrator | ✅ Live | Tool loop → Claude response, Agent mode in UI |
+| Memory distillation | ✅ Live | Short (daily) / Mid (weekly) / Long (monthly) |
+| Multi-user | ✅ Live | Scott, Holly, Brian — each with own personas |
+| Session search | ✅ Live | Full-text search across past session logs |
+| Proactive cron | ✅ Live | `message` and `brief` job types → NC Talk |
+
+**Active users / personas:** scott/inara, scott/developer, holly/tina, brian/wintermute
+
+---
+
+## Document Map
+
+### Project-Level
+| Doc | What it covers |
+|---|---|
+| **This file** | Index and current state |
+| [`CORTEX.md`](../CORTEX.md) | Vision, philosophy, "what it is and isn't" |
+| [`ROADMAP.md`](ROADMAP.md) | Phases — what's done, what's next, what's deferred |
+| [`TODO__Agents.md`](TODO__Agents.md) | Active task list — read before starting work |
+
+### Architecture
+| Doc | What it covers |
+|---|---|
+| [`ARCH__SYSTEM.md`](ARCH__SYSTEM.md) | Overall architecture, component map, key design decisions |
+| [`ARCH__BACKENDS.md`](ARCH__BACKENDS.md) | LLM backends, routing, fallback, per-user config |
+| [`ARCH__PERSONA.md`](ARCH__PERSONA.md) | Persona system, context tiers, memory distillation |
+| [`ARCH__CHANNELS.md`](ARCH__CHANNELS.md) | Input channels — web, NC Talk, Google Chat, cron |
+| [`ARCH__FUTURE.md`](ARCH__FUTURE.md) | Planned: local orchestrator, dev agents, knowledge layer |
+
+### Setup & Reference
+| Doc | What it covers |
+|---|---|
+| [`docs/NEXTCLOUD_TALK_BOT.md`](../docs/NEXTCLOUD_TALK_BOT.md) | NC Talk bot setup and troubleshooting |
+| [`docs/GOOGLE_CHAT_BOT.md`](../docs/GOOGLE_CHAT_BOT.md) | Google Chat Add-on setup |
+| [`docs/OPEN_WEBUI_API.md`](../docs/OPEN_WEBUI_API.md) | Open WebUI/Ollama API reference for local model work |
+
+### Code-Level
+| Doc | What it covers |
+|---|---|
+| [`CLAUDE.md`](../CLAUDE.md) | Project instructions for Claude Code — directory map, run commands, design decisions |
+| [`README.md`](../README.md) | Project root orientation, quick-start, user management |
+| [`cortex/static/HELP.md`](../cortex/static/HELP.md) | In-app help (rendered in UI for all users) |
+
+---
+
+## Quick Reference
+
+**Start the service / check logs**
+```bash
+systemctl --user restart cortex
+journalctl --user -u cortex -f
+```
+
+**Syntax check before restart**
+```bash
+python3 -m py_compile cortex/<file>.py
+```
+
+**Add a user**
+```bash
+cd cortex && .venv/bin/python manage_passwords.py invite <username> <email>
+```
+
+**Run tests**
+```bash
+cd cortex && .venv/bin/python -m pytest tests/ -q
+```