Files

Scott Idem a4daebdc9b feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul

Local LLM:
- user_settings.py: per-user hosts/models config (local_llm.json)
- routers/local_llm.py + static/local_llm.html: dedicated settings page
- llm_client.py: local OpenAI-compatible backend via httpx
- config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts
- Active model shown near backend toggle (amber hint text)

Memory distillation:
- memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides
- scheduler.py + notification.py: notify NC Talk after mid/long distill
- notification.py: outbound channel abstraction (NC Talk, extensible)

Session search:
- routers/files.py: GET /sessions/search?q= with excerpts grouped by date
- static/index.html + app.js: search UI in file sidebar with highlight
- _esc() helper to prevent XSS in search results

Proactive cron:
- cron_runner.py: new job types — message (send directly) and brief (LLM + send)
- Both support optional per-job channel override

Channels:
- routers/nextcloud_talk.py: consolidated using notification._send_nct_message()
- routers/auth.py: local backend status in /auth/status
- routers/chat.py: /backend returns {primary, fallback, local_model} object

UI / UX:
- Copy button for user messages (matching assistant)
- Autocomplete disabled on sensitive form fields
- settings.html: local model section replaced with link to /settings/local

Docs overhaul:
- MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md
- ARCH__Intelligence_Layer.md replaced with redirect table
- CORTEX.md trimmed to vision only; README updated
- OPEN_WEBUI_API.md added to docs/

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-05 20:53:06 -04:00

3.4 KiB

Raw Blame History

Cortex / Inara — Master Index

Start here. This document is a map, not a manual. Last updated: 2026-04-03

What It Is

Cortex is a self-hosted personal AI platform. It routes messages from any input channel to AI backends, manages a resident agent (Inara) with persistent memory, and coordinates across a fleet of machines. It is infrastructure, not a product.

Running at: https://cortex.dgrzone.com | systemctl --user restart cortex

Current State

Component	Status	Notes
Web UI	✅ Live	SPA, dark theme, mobile-responsive, session auth
Nextcloud Talk bot	✅ Live	HMAC-signed, per-user routing
Google Chat Add-on	✅ Live	JWT-verified, per-user routing
Claude backend	✅ Live	Primary — via Claude Code CLI
Gemini backend	✅ Live	Fallback — via Gemini CLI
Local backend	✅ Live	Third option — Open WebUI/Ollama on scott_gaming
Gemini orchestrator	✅ Live	Tool loop → Claude response, Agent mode in UI
Memory distillation	✅ Live	Short (daily) / Mid (weekly) / Long (monthly)
Multi-user	✅ Live	Scott, Holly, Brian — each with own personas
Session search	✅ Live	Full-text search across past session logs
Proactive cron	✅ Live	`message` and `brief` job types → NC Talk

Active users / personas: scott/inara, scott/developer, holly/tina, brian/wintermute

Document Map

Project-Level

Doc	What it covers
This file	Index and current state
`CORTEX.md`	Vision, philosophy, "what it is and isn't"
`ROADMAP.md`	Phases — what's done, what's next, what's deferred
`TODO__Agents.md`	Active task list — read before starting work

Architecture

Doc	What it covers
`ARCH__SYSTEM.md`	Overall architecture, component map, key design decisions
`ARCH__BACKENDS.md`	LLM backends, routing, fallback, per-user config
`ARCH__PERSONA.md`	Persona system, context tiers, memory distillation
`ARCH__CHANNELS.md`	Input channels — web, NC Talk, Google Chat, cron
`ARCH__FUTURE.md`	Planned: local orchestrator, dev agents, knowledge layer

Setup & Reference

Doc	What it covers
`docs/NEXTCLOUD_TALK_BOT.md`	NC Talk bot setup and troubleshooting
`docs/GOOGLE_CHAT_BOT.md`	Google Chat Add-on setup
`docs/OPEN_WEBUI_API.md`	Open WebUI/Ollama API reference for local model work

Code-Level

Doc	What it covers
`CLAUDE.md`	Project instructions for Claude Code — directory map, run commands, design decisions
`README.md`	Project root orientation, quick-start, user management
`cortex/static/HELP.md`	In-app help (rendered in UI for all users)

Quick Reference

Start the service / check logs

systemctl --user restart cortex
journalctl --user -u cortex -f

Syntax check before restart

python3 -m py_compile cortex/<file>.py

Add a user

cd cortex && .venv/bin/python manage_passwords.py invite <username> <email>

Run tests

cd cortex && .venv/bin/python -m pytest tests/ -q

3.4 KiB Raw Blame History