Go to file

Scott Idem 96b3c796c5 feat: file attachment support in chat (images + text/code files)

Text files (.md, .py, .js, .json, etc.): read client-side and injected
into the message body as a fenced code block — works with all backends
with zero model capability requirements.

Images (PNG/JPG/WebP/GIF, max 5 MB): encoded as base64 data URL on the
client and sent as a separate attachment field. Backend formats them as
OpenAI multimodal content (text + image_url) for local_openai backends.
Claude CLI and Gemini CLI see the text message with a "📎 filename.png"
note; image data is never written to session history.

- index.html: 📎 button + hidden file input in mode-select row;
  attachment-row preview area with thumbnail (images) or filename chip
- app.js: _resolveAttachment(), file reader, clearAttachment();
  sendMessage/sendOrchestrate updated to allow no-text sends when a
  file is pending; attachment spread into chat payload for images
- chat.py: Attachment model; attachment field on ChatRequest;
  llm_attachment extracted in _stream_chat and passed to complete()
- llm_client.py: attachment param through complete()/_dispatch()/_local();
  _local() builds multimodal content array for vision calls
- style.css: #attach-btn, #attachment-row, #attachment-preview, thumb

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-12 21:46:50 -04:00

cortex

feat: file attachment support in chat (images + text/code files)

2026-05-12 21:46:50 -04:00

docs

feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul

2026-04-05 20:53:06 -04:00

documentation

docs: mark Phase 2/3 done; add file_diff, git tools, spawn_agent restrictions

2026-05-12 21:34:26 -04:00

scripts

feat: usage tracking + knowledge import script

2026-05-02 20:38:31 -04:00

.gitignore

chore: remove home/ from git, update gitignore

2026-04-08 19:19:23 -04:00

.stignore

tooling: install script, workspace file, and dev-restart helper

2026-04-08 19:11:27 -04:00

backup.sh

feat: restic backup of home/ with systemd daily timer

2026-04-08 19:36:22 -04:00

CLAUDE.md

docs: update tool count to 62 and current state to 2026-05-12

2026-05-12 00:14:12 -04:00

Cortex_and_Inara.code-workspace

tooling: install script, workspace file, and dev-restart helper

2026-04-08 19:11:27 -04:00

dev-restart.sh

tooling: install script, workspace file, and dev-restart helper

2026-04-08 19:11:27 -04:00

docker-compose.yml

Initial commit — Cortex API + Inara identity

2026-03-06 03:41:00 -05:00

install.py

fix: create cortex config dir and restic password file during install

2026-04-08 21:24:08 -04:00

README.md

docs: comprehensive doc audit — sync all docs to current state

2026-05-09 13:13:45 -04:00

README.md

Cortex / Inara — Project Root

Owner: Scott Idem (One Sky IT / Danger Zone) Started: 2026-03-04 Status: Active development

"You can't stop the signal."

Cortex is a self-hosted multi-agent AI platform. It supports multiple users, each with their own named AI persona.

Where Cortex Fits

AI tools aren't one-size-fits-all. Cortex exists in a specific niche — it's not trying to be everything.

Cortex is a self-hosted persona platform. It gives you a persistent AI companion with its own identity, memory, and voice — reachable through your chat apps, not just a browser tab. It remembers who you are across days and weeks. It can proactively message you on a schedule. It runs on your own hardware, behind your own auth.

What Cortex is good at

Being a consistent AI presence — same persona, same memory, day after day
Multi-channel access — web, Nextcloud Talk, Google Chat, all routed to the same brain
Proactive work — scheduled messages, reminders, cron jobs that reach out to you
Multi-user households — each person gets their own persona (Scott → Inara, Holly → Tina)
Private, offline-capable — local models via Ollama when you don't want anything leaving the LAN

What Cortex is not

Not a coding assistant. Cortex lives in chat apps, not in your terminal or IDE. Use Claude Code, DeepSeek TUI, Gemini CLI, or Copilot for code-level work — they specialize in reading and editing project files. Cortex can't open a codebase.
Not a generic LLM chat UI. Open WebUI and LibreChat are excellent model-switching frontends. Cortex isn't a frontend — it's a platform with its own identity system, orchestrator, and memory pipeline. Two different jobs.
Not a SaaS product. Nobody else hosts your Cortex instance. Nobody else sees your conversations. The trade-off is you manage the service yourself — systemctl --user restart cortex.
Not an agent framework. LangChain, CrewAI, and similar are libraries for building AI pipelines. Cortex is a running service with concrete personas, not an abstraction layer to build on top of.

The stack in practice

Use Cortex to talk to Inara — daily assistant, memory keeper, scheduled check-ins
Use Claude Code / DeepSeek TUI to work on Cortex — code edits, architecture, debugging
Use Open WebUI when you want to test a new model or run a quick prompt without persona context

Same AI, different interfaces for different jobs.

Quick Orientation

Directory	What it is
`cortex/`	FastAPI service — dispatcher, routing, LLM backends, session management
`home/`	User and persona data (`home/{username}/persona/{name}/`)
`docs/`	Integration reference docs (NC Talk bot, Google Chat bot)
`documentation/`	Architecture decisions, project plans, agent task lists

Multi-User Layout

Persona data lives in a two-level tree modelled on Linux home directories:

home/
  scott/
    persona/
      inara/       ← IDENTITY.md, SOUL.md, MEMORY_*.md, sessions/, TASKS.json, …
  holly/
    persona/
      tina/
  [username]/
    persona/
      [name]/

Each HTTP request includes user and persona fields. The service validates both against the home/ tree before routing. ContextVars ensure per-request isolation in async code.

Naming rules (same as Linux usernames): lowercase letters, digits, _, -; must start with a letter or underscore; max 32 characters. Example: scott, holly, my_ai-v2.

Setup / Install

Run install.py on any machine to set up or update Cortex. It is idempotent — safe to re-run.

python3 install.py           # install / update everything
python3 install.py --check   # status check only, no changes

What it does: creates the Python venv, installs dependencies, writes the systemd user service, enables linger, starts/restarts the service, checks LLM CLI auth, and sets up the daily backup timer.

Config: copy cortex/.env.default to cortex/.env and fill in secrets before first run.

Running Cortex

Cortex runs as a systemd user service (no sudo required).

# Start / stop / restart
systemctl --user start cortex
systemctl --user stop cortex
systemctl --user restart cortex

# Status and logs
systemctl --user status cortex
journalctl --user -u cortex -f

# Web UI
http://localhost:8000   (or cortex.dgrzone.com on WireGuard)

The service starts automatically at boot via loginctl enable-linger. Service file: ~/.config/systemd/user/cortex.service

Config lives in cortex/config.py and cortex/.env (not tracked — see cortex/.env.default).

Development Workflow

The codebase lives in agents_sync/ and syncs to all fleet machines via Syncthing. Edit code on any machine; use dev-restart.sh to apply changes on the host running the service.

./dev-restart.sh          # restart service, show last 30 log lines
./dev-restart.sh logs     # tail live logs (ctrl-c to stop)
./dev-restart.sh status   # show service status only

Backup

Persona data (home/) is excluded from git and backed up with restic. install.py sets up a systemd timer that runs backup.sh daily at 03:00.

./backup.sh    # run a backup manually

# Inspect snapshots (set env vars or export them)
RESTIC_REPOSITORY=~/backups/cortex-home-restic \
RESTIC_PASSWORD_FILE=~/.config/cortex/restic-password \
restic snapshots

The restic password is generated at ~/.config/cortex/restic-password on first install. Back it up separately — it is required to restore from any snapshot.

Key Documentation

Start here for a full picture: documentation/MASTER.md

File	Purpose
`documentation/MASTER.md`	Index — current state, all doc links, quick reference
`documentation/ROADMAP.md`	Phases — what's done, what's next
`documentation/TODO__Agents.md`	Active task list
`documentation/ARCH__SYSTEM.md`	System architecture and component map
`documentation/ARCH__BACKENDS.md`	LLM backends, routing, fallback
`documentation/ARCH__PERSONA.md`	Persona system, context tiers, memory distillation
`documentation/ARCH__CHANNELS.md`	Input channels — web, NC Talk, Google Chat, cron
`documentation/ARCH__FUTURE.md`	Planned features — local orchestrator, dev agents, knowledge layer
`docs/NEXTCLOUD_TALK_BOT.md`	NC Talk bot setup and troubleshooting
`docs/GOOGLE_CHAT_BOT.md`	Google Chat Add-on setup
`docs/OPEN_WEBUI_API.md`	Open WebUI/Ollama API reference

Architecture at a Glance

[Web UI / NC Talk / Google Chat / Cron / Webhooks]
        ↓
  Cortex Dispatcher  (FastAPI, cortex/)
    ├─ POST /chat                            — direct to LLM (streaming SSE)
    ├─ POST /orchestrate                     — Gemini tool loop → Claude response
    ├─ POST /webhook/nextcloud/{username}    — Nextcloud Talk bot (per-user)
    └─ POST /channels/google-chat/{username} — Google Chat Add-on (per-user)
        ↓
  LLM Backends
  • Claude CLI      — primary, all user-facing responses
  • Gemini CLI      — fallback
  • Gemini API      — orchestrator tool loop (two-brain: Gemini plans, Claude responds)
  • Local OpenAI    — Open WebUI/Ollama on scott_gaming; also runs local orchestrator loop
        ↓
  Persona context loaded from home/{user}/persona/{name}/

See documentation/ARCH__SYSTEM.md for the full architecture breakdown.

Personas

Each persona has its own identity, memory, and session history. They are not tied to a specific LLM model — the name is fixed, the backend varies. Context is loaded at request time from home/{user}/persona/{name}/ via cortex/context_loader.py.

User	Persona	Description
scott	inara	Scott's primary AI assistant
scott	developer	Scott's dev-focused persona
holly	tina	Holly's primary AI assistant
brian	wintermute	Brian's primary AI assistant

Channels

Webhook endpoints are per-user — each user configures their own secrets in home/{username}/channels.json.

Channel	Status	Endpoint / Notes
Web UI	Live	`https://cortex.dgrzone.com` — session auth (login form + JWT cookie)
Nextcloud Talk	Live	`POST /webhook/nextcloud/{username}` — HMAC-signed, async reply
Google Chat	Live	`POST /channels/google-chat/{username}` — Workspace Add-on, JWT auth
Browser Push	Live	VAPID push notifications — subscribe via ☰ menu; proactive reminders + distill alerts

See docs/NEXTCLOUD_TALK_BOT.md and docs/GOOGLE_CHAT_BOT.md for setup instructions.

User Management

cd cortex

# Create a user directory and send an invite email
.venv/bin/python manage_passwords.py invite <username> <email>

# Register a Google account for sign-in (run after user completes onboarding)
.venv/bin/python manage_passwords.py google-add <username> <email>

# List users with password, Google, and email status
.venv/bin/python manage_passwords.py list

# Set/check a password directly
.venv/bin/python manage_passwords.py set <username>
.venv/bin/python manage_passwords.py check <username>

New users receive a link to /setup/{token} where they set their own password and create their first persona. Invite tokens expire in 72 hours and are one-time-use.

To enable a channel for a user, create home/{username}/channels.json — see the relevant doc in docs/.

Testing

cd cortex
.venv/bin/python -m pytest tests/ -q

80 tests covering API endpoints, persona routing, tool functions, and security.

Project	Path
Aether Platform API	`~/OSIT_dev/aether_api_fastapi/`
Aether Frontend	`~/OSIT_dev/aether_app_sveltekit/`
Fleet coordination	`~/agents_sync/`

Languages

Python 69.1%

HTML 14.1%

JavaScript 10.2%

CSS 6.2%

Shell 0.3%

README.md

Cortex / Inara — Project Root

Where Cortex Fits

What Cortex is good at

What Cortex is not

The stack in practice

Quick Orientation

Multi-User Layout

Setup / Install

Running Cortex

Development Workflow

Backup

Key Documentation

Architecture at a Glance

Personas

Channels

User Management

Testing

Related Projects