Cortex-Inara

Author	SHA1	Message	Date
Scott Idem	334e7f0dea	feat: role-based tool access, confirmation gates, and new orchestrator tools - auth_utils: get_user_role() reads role from auth.json (admin\|user, default user) - manage_passwords: new `role` command to promote/demote users (admin-only by convention) - tools/__init__: TOOL_ROLES map, CONFIRM_REQUIRED set, get_tools_for_role(), get_openai_tools_for_role() — both orchestrators now filter tools by caller's role - tools/system: cortex_restart (detached subprocess, 5s delay), cortex_logs (admin-only) - tools/web: http_fetch — direct URL fetch, distinct from web_search - tools/files: file_list (directory listing), file_write (restricted paths, admin-only) - tools/notify: nc_talk_send — proactive outbound via notification.py - orchestrator_engine + openai_orchestrator: user_role param; CONFIRM_REQUIRED tools return a confirmation-request result instead of executing — loop breaks after Claude asks user to confirm in a follow-up message - home/scott/auth.json: role set to admin Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 19:23:53 -04:00
Scott Idem	25182a1765	feat: PWA support — manifest, service worker, icons, public auth exemption Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 18:46:33 -04:00
Scott Idem	66cb197de0	feat: last-used persona cookie, emoji dropdown, theme support, auth status move - cx_last_persona cookie set on serve_ui; root/login/help/settings redirects use preferred persona from cookie instead of alphabetically first - /api/personas returns [{name, emoji}] objects; persona switcher dropdown renders emoji + name with flex layout and .pd-emoji span - Help, Settings, Model Registry pages apply localStorage theme on load (no flash); CSS variables for dark/light replacing all hardcoded hex values - Claude CLI auth status moved from prominent chat banner to Anthropic provider block in Model Registry — live dot indicator (ok/warn/err) - Auth banner removed from main chat UI (index.html, app.js, style.css) - Add Model collapsed into Models section as <details> to shorten page - Light-mode overrides for provider icons, model badges, ctx-badge, tags (Anthropic/Google/local colors now readable in both themes) - Help page gains table, pre/code, hr styles for HELP.md rendered content Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 22:52:34 -04:00
Scott Idem	2b9dd53566	feat: replace Agent mode with independent Tools toggle - Remove 'agent' from mode dropdown; Chat/Note/OTR remain - Add ⚡ tools toggle button in input bar (persisted in localStorage) When on: routes to POST /orchestrate (Gemini tool loop); send btn → "Run" When off: routes to POST /chat (direct to active role); no change - Role selector and tools toggle are now fully independent: active chat_role sent in orchestrate payload → used for final response - orchestrator_engine.run() accepts response_role param; passes it to complete(role=...) instead of hardcoded model="claude" - OrchestrateRequest gains chat_role field (default "chat") - Migrate stored 'agent' mode/MRU entries to 'chat' on load Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 20:36:15 -04:00
Scott Idem	1cc7988953	feat: add shell_exec tool and fix orchestrator model name resolution - Add shell_exec to orchestrator tool suite (system.py + __init__.py) Runs arbitrary shell commands on the Cortex host with timeout (1–120s), combined stdout/stderr output, optional working_dir, and exit code reporting. Enables system diagnostics (df, ls, ps, journalctl, etc.) from Agent mode. - Fix orchestrator_engine.run() to use model_name from resolved registry entry Previously used settings.orchestrator_model (.env hardcode) regardless of what model was assigned to the orchestrator role. Now accepts model_name param and falls back to settings value only when registry has no model_name. - Update ARCH__FUTURE.md: date, running host, local orchestrator status, model registry V2 progress, added Cortex Mesh concept (section 9) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 20:29:46 -04:00
Scott Idem	8baab874f1	feat: replace backend/slot toggle with role selector The backend toggle now cycles through configured roles (chat, coder, research, distill, etc.) instead of backup model slots within the chat role. Each role uses its own primary→backup chain from the registry. - ChatRequest.slot replaced by chat_role (default "chat") - GET /backend returns available_roles instead of chat_models - _available_roles_for_toggle() builds list from defined_roles, excluding orchestrator (which has its own Agent mode) - Model label on responses now reflects the actual role's assigned model - Toggle is inert when only one role is configured (avoids useless cycling) - Add "Clear browser cache" button to Account Settings (Connected Accounts) - Add _role_model_label() helper for cleaner response tag labeling Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 19:23:18 -04:00
Scott Idem	962d58d2e2	feat: model registry Phase 3 — slot-based backend toggle Backend toggle now cycles through chat role models by label instead of cycling service type strings (auto/claude/gemini/local). - model_registry: get_model_for_slot() — resolves a specific priority slot without walking the fallback chain - llm_client: complete() gains slot param; explicit slot selection dispatches directly to that model with no silent fallback - routers/chat.py: ChatRequest.slot; GET /backend returns chat_models [{slot, label, type}] for the UI; _stream_chat uses resolved model label for the response tag when a slot is pinned - app.js: toggle loads chat_models from /backend, cycles by label, sends slot in chat payload; legacy model field removed from payload - app.js: fix Gap B — agent mode placeholder no longer says "Gemini tool loop"; now says "orchestrator" - DESIGN doc: updated to reflect phases 1+2 complete, catalog-as-code decision, Gap A/B documented, Phase 3 implementation details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 21:43:08 -04:00
Scott Idem	6e56024815	fix: settings page and help docs updated for model registry V2 settings.html: - Remove Gemini API Key section (keys now managed in Model Registry) - Rename "Local Models" → "Model Registry" with updated description covering all providers (Anthropic, Google, local hosts) - Update button text: "Manage local models" → "Manage models" settings.py: remove dead gemini_key template variable lookups HELP.md: - Fix navigation path: ☰ → Account → Model Registry → Manage models - Restructure Model Registry section as ordered steps (1: providers/hosts, 2: add models, 3: assign roles) so dependency order is clear - Add explicit note that accounts/hosts must exist before adding models Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 21:07:05 -04:00
Scott Idem	f08b033d6c	feat: model registry Phase 2 — cloud provider UI (Anthropic + Google) Adds cloud provider management to /settings/models: - Google Accounts section: add/remove Gemini API keys with labels - Add Model form: provider tabs (Local / Google / Anthropic) with catalog dropdowns that auto-fill label and context_k - Provider badges on model rows (Anthropic / Google / Local) - /settings/local now redirects to /settings/models (canonical URL) - save_cloud_model() in model_registry for Anthropic/Google entries - Distill role migration restored in _migrate_from_local_llm - Test fixes: version assertions updated to V2 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 20:41:06 -04:00
Scott Idem	45c95d20ba	feat: model registry V2 — provider-aware schema with multi-account support Adds a providers section to the per-user model registry for Anthropic and Google as first-class providers alongside local hosts. Google accounts (API keys) are now stored as a list so multiple Google accounts can coexist. Changes: - model_registry.py: V2 schema, auto migration V1→V2 (pulls gemini_api_key from auth.json into providers.google.accounts), _resolve_model() merges account API key for gemini_api type models - routers/orchestrator.py: uses model-resolved api_key when orchestrator role resolves to a gemini_api model with account_id - ANTHROPIC_CATALOG and GOOGLE_CATALOG constants for model picker (Phase 2) - New functions: get_google_api_key(), save/remove_google_account(), get_catalog() - Documentation: ARCH__BACKENDS.md updated to V2 schema, DESIGN doc added Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 20:21:04 -04:00
Scott Idem	3b3456600a	feat: store and display backend + host metadata on chat messages Each assistant message in the session JSON now carries: backend, backend_label, host (platform.node()) These fields are shown as model tags in the UI — on live responses and when loading session history. Session log entries (sessions/YYYY-MM-DD.md) include the backend label and host in the turn header. The local (OpenAI-compat) backend strips non-standard fields before sending messages to the API so extra fields don't leak upstream. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 22:16:48 -04:00
Scott Idem	d9a322164a	feat: OpenAI-compatible orchestrator + backend auto-routing - openai_orchestrator.py — new ReAct tool loop engine for any OpenAI-compatible endpoint (OpenRouter, Open WebUI, Ollama, LiteLLM); model handles both tool loop and final response, no Claude handoff needed - tools/__init__.py — auto-derive OpenAI JSON Schema from existing Gemini FunctionDeclarations so tool definitions have a single source of truth - routers/orchestrator.py — route to openai_orchestrator when model registry "orchestrator" role resolves to a local_openai type host - routers/chat.py — pass role to _backend_label(); fix fallback_used logic (only meaningful for explicit backend overrides, not auto-routing) - static/app.js — add null/"auto" to backend cycle; fetch local model hint without overriding the auto default on page load - model_registry.py — _normalize() back-fills host_type on old registry files - requirements.txt — add openai>=1.0.0 - ARCH__BACKENDS.md — document OpenAI-compat backend and routing logic Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:18:18 -04:00
Scott Idem	a6e404c143	feat: host_type field for OpenRouter / OpenAI-compatible API support Adds host_type ("openwebui" \| "openai") to the host schema so Cortex can talk to both Open WebUI/Ollama and OpenRouter/standard-OpenAI endpoints. Path differences per type: openwebui (default): /api/chat/completions, /api/models openai: /chat/completions, /models model_registry.py: - host_type added to host schema (default "openwebui", backward compat) - save_host() accepts host_type parameter - _resolve_model() passes host_type through with the merged host fields llm_client._local(): - Reads host_type from resolved model_cfg - Selects correct chat completions path accordingly routers/local_llm.py: - save_host route accepts host_type form field - fetch-models uses /models for openai type, /api/models for openwebui - Existing host rows show type selector pre-filled from stored value local_llm.html: - "Add host" form includes type selector To use OpenRouter: - Add host: URL = https://openrouter.ai/api/v1, Type = OpenAI-compatible - API key from openrouter.ai (store in .env or model_registry.json only) - Fetch models or add manually (e.g. anthropic/claude-sonnet-4-5-20251022) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 21:11:22 -04:00
Scott Idem	8570e8d852	fix: backend toggle not sent to server; add per-message model tag Fixes: - app.js was tracking primaryBackend locally but never included model: primaryBackend in the /chat POST body, so the server always used settings.primary_backend regardless of what the user clicked. Now model: primaryBackend is sent on every chat request. - Responses were only annotated when fallback occurred. Now every assistant message shows a small model tag at the bottom right. chat.py: - _backend_label() resolves human-readable name: claude → "Claude", gemini → "Gemini", local → registry label (e.g. "Gemma 4 E4B") or model_name - SSE payload now includes backend_label field app.js: - model: primaryBackend added to /chat fetch body - After every response, appends .model-tag div with backend_label - Fallback shows "⚡ fallback → <label>" in amber; normal is muted - Removed separate system message for fallback (tag covers it) style.css: - .model-tag: small muted text, right-aligned, separated by thin line - .model-tag.fallback: amber (#f59e0b) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 22:10:40 -04:00
Scott Idem	608e1de246	feat: model registry UI — hosts, models, role assignments Replaces the single-host local model settings page with a full model registry interface at /settings/local. Hosts section: - List existing hosts with inline edit + save + remove - Collapsible "Add host" form - Per-host "Fetch models" button Models section: - List all models with label, model name, host, context_k badge, tags - Remove button Add Model section: - Host dropdown, label, model name, context_k, tags (comma-separated) - "Fetch models from host" with auto-fill picker Role Assignments section: - One row per defined role (chat, orchestrator, distill, coder, research) - Primary + backup_1 + backup_2 dropdowns per role - Dropdowns pre-filled from registry on load - AJAX save on change (POST /api/models/role) with toast confirmation - Built-in models (claude_cli, gemini_cli, gemini_api) always available in dropdowns Backend: - All user_settings references replaced with model_registry - host/{id}/remove route added - fetch-models now accepts host_id query param - POST /api/models/role for AJAX role assignment Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 21:31:32 -04:00
Scott Idem	6a1a1c2686	feat: unified model registry with role-based routing Introduces model_registry.py as the single source of truth for all LLM backend configuration. Replaces scattered backend settings across user_settings, config distill_backend_, and the UI toggle. model_registry.py: - Per-user home/{user}/model_registry.json with version, hosts, models, roles - Models have: type (local_openai\|claude_cli\|gemini_cli\|gemini_api), label, model_name, host_id, context_k (tokens), tags (capability labels) - Roles map to priority chains: primary, backup_1..backup_4 - Built-in IDs (claude_cli, gemini_cli, gemini_api) always resolvable - Auto-migrates existing local_llm.json on first access - CRUD: save_host, remove_host, save_model, remove_model, set_role - get_model_for_role(): registry → .env default → hardcoded fallback config.py: - role_chat/orchestrator/distill/coder/research .env defaults - defined_roles: comma-separated standard role list (extensible) - get_defined_roles() and get_role_default() helper methods llm_client.complete(): - New role= parameter (default "chat") for registry-based routing - model= still accepted for explicit UI toggle override - _claude() and _local() accept model_cfg dict instead of raw string - _local() uses pre-resolved config from registry memory_distiller.py: - distill_mid/long now use role="distill" (no more distill_backend_ .env vars needed) cron_runner.py: - brief jobs use role="chat" routers/chat.py + auth.py: - Use model_registry instead of user_settings for local model info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 21:25:18 -04:00
Scott Idem	a4daebdc9b	feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul Local LLM: - user_settings.py: per-user hosts/models config (local_llm.json) - routers/local_llm.py + static/local_llm.html: dedicated settings page - llm_client.py: local OpenAI-compatible backend via httpx - config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts - Active model shown near backend toggle (amber hint text) Memory distillation: - memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides - scheduler.py + notification.py: notify NC Talk after mid/long distill - notification.py: outbound channel abstraction (NC Talk, extensible) Session search: - routers/files.py: GET /sessions/search?q= with excerpts grouped by date - static/index.html + app.js: search UI in file sidebar with highlight - _esc() helper to prevent XSS in search results Proactive cron: - cron_runner.py: new job types — message (send directly) and brief (LLM + send) - Both support optional per-job channel override Channels: - routers/nextcloud_talk.py: consolidated using notification._send_nct_message() - routers/auth.py: local backend status in /auth/status - routers/chat.py: /backend returns {primary, fallback, local_model} object UI / UX: - Copy button for user messages (matching assistant) - Autocomplete disabled on sensitive form fields - settings.html: local model section replaced with link to /settings/local Docs overhaul: - MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md - ARCH__Intelligence_Layer.md replaced with redirect table - CORTEX.md trimmed to vision only; README updated - OPEN_WEBUI_API.md added to docs/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 20:53:06 -04:00
Scott Idem	bd6532e93a	feat: shared nav bar on Help and Settings pages Replaces the lone "← Back to Cortex" link with a consistent page-nav on both pages: ← Chat \| Help \| Settings \| Sign out Active page is highlighted purple; others are muted gray. Settings page gets a {{ help_href }} template var from settings.py. Help page builds nav links from the existing cfg JS object. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 22:09:08 -04:00
Scott Idem	662924c6a1	fix: pass user to _run_job so get_user_gemini_key resolves correctly NameError: name 'user' is not defined in orchestrator._run_job — user was resolved in the endpoint but not forwarded to the background task. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 21:21:14 -04:00
Scott Idem	93f7f44e51	feat: per-user channel config for Google Chat and Nextcloud Talk - New endpoints: POST /channels/google-chat/{username} and /webhook/nextcloud/{username} - Channel secrets/config live in home/{username}/channels.json (gitignored) - auth_utils: get_user_channels() helper reads channels.json - Both routers load persona, audience/secret, backend, timeout per user; set_context() wires the correct persona before building the system prompt - Removed server-level channel settings from config.py and .env — no user gets a channel until they create their own channels.json - .gitignore: home/**/channels.json added To migrate: update Google Chat Add-on webhook URL to /channels/google-chat/{username} and re-register NC Talk bot at /webhook/nextcloud/{username} Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 13:02:45 -04:00
Scott Idem	7438031797	feat: connected accounts + Gemini API key in account settings UI Settings page gains two new sections: - Connected Accounts: shows linked Google email (read-only) - Gemini API Key: paste personal key from aistudio.google.com, shows masked hint of saved key, remove link to revert to server key POST /settings/gemini-key saves/clears gemini_api_key in auth.json. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 21:16:37 -04:00
Scott Idem	8aec6aafcc	feat: Google OAuth sign-in + per-user Gemini API key Users with Google accounts can now sign in without a password. Auth flow: - GET /auth/google → Google consent page (CSRF state cookie) - GET /auth/google/callback → exchange code, lookup user, set JWT - auth.json gains google_sub + google_email fields - set_password() no longer overwrites unrelated auth.json fields Admin setup: python manage_passwords.py google-add <username> <email> # add GOOGLE_CLIENT_ID + GOOGLE_CLIENT_SECRET to .env Per-user Gemini key: - get_user_gemini_key() reads gemini_api_key from auth.json - orchestrator_engine.run() accepts gemini_api_key param - orchestrator router passes user's key, falls back to server key login.html: "Sign in with Google" button above the password form. manage_passwords.py list: now shows auth method columns (pw / google). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 21:01:52 -04:00
Scott Idem	62fde62653	feat: persona-specific favicon + fix favicon.ico 404 app.js updates the <link rel="icon"> to the active persona's emoji on load (CORTEX_EMOJI is already injected server-side). /favicon.ico route added as a fallback for login/settings/help pages that don't have persona context. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:45:36 -04:00
Scott Idem	826bd6cfe3	feat: /{username} persona picker landing page Visiting /scott (or any user root) now shows a clean card page listing all their personas with emoji + name, each linking to /{user}/{persona}. Previously the route was unhandled (404 or wildcard match). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:19:04 -04:00
Scott Idem	c3507f8e11	fix: help page back link preserves active persona Pass ?persona= query param on the help link so the server knows which persona to return to. Previously always defaulted to personas[0], causing navigation back to the wrong persona. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:13:52 -04:00
Scott Idem	fa04b5e6b0	feat: off the record mode (OTR) Adds a third input mode toggle alongside Note and Agent. When active: - Textarea gets a subtle purple tint with dashed border - OTR button highlights purple - Placeholder reads "Off the record — not logged or distilled…" - off_record=True is sent to /chat; session_logger is skipped - In-memory session context is preserved within the session Switching to Note or Agent mode deactivates OTR, and vice versa. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 21:07:21 -04:00
Scott Idem	8487645224	fix: claude auth expiry warning — correct field name and smarter threshold - Fix 'undefined' in auth banner: read access_token_hours_remaining (not hours_remaining) - Fix false-positive warning on fresh tokens: when refresh token present, only warn within 1 hour of expiry (not 24h) since the CLI should auto-rotate but sometimes misses - Emit claude_auth_expired SSE event on 401 so UI shows inline red banner immediately - app.js: handle claude_auth_expired SSE event with persistent top banner + dismiss button Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-25 23:22:18 -04:00
Scott Idem	0cf0d65e9e	feat: session naming, username/persona rename, help page, contrast fixes - Session name field: PATCH /sessions/{id} endpoint, inline rename button in UI - Persona rename: inline ✏ toggle form in settings, POST /settings/persona/rename - Username rename: inline form in settings, POST /settings/username (renames home dir, forces re-login) - Help page: dedicated /help route replacing modal, collapsible sections - Per-persona isolation: files.py and session_store.py now scope to correct user/persona - Contrast/visibility: muted text bumped to slate-400+, session rename btn at 0.4 opacity Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 23:10:12 -04:00
Scott Idem	1b425a539f	feat: account settings page + dedicated help page - Add /settings page with password change form and personas list - Add /help dedicated page (replaces help modal); renders HELP.md with collapsible sections, dark theme, back link to active persona - Add 👤 account button and convert ? button to link in header - Remove help modal HTML and ~55 lines of modal JS from main app - Register settings and help routers in main.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 21:41:18 -04:00
Scott Idem	c01ef663f5	fix: per-persona session/file isolation + onboarding route order - session_store: store sessions under home/{user}/persona/{name}/session_data/ instead of the shared cortex/data/sessions/ bucket - chat endpoints: add user/persona query params to /sessions, /history/, /sessions/, /note so they resolve the correct persona context - files router: add user/persona query params to /files and /files/{name} so the file browser loads the right persona's files - app.js: pass user/persona on all session, history, and file fetches; move _fileParams to top-level scope so it is available everywhere - onboarding: fix FastAPI route ordering — register /persona before /{token} so the literal path wins and does not get captured as a token value - ui.py: read Emoji field from IDENTITY.md and inject into CORTEX_CONFIG so the header icon reflects each persona's chosen emoji - .gitignore: exclude home/**/session_data/ (runtime state) - migrate scott/inara sessions from cortex/data/sessions/ to session_data/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-22 00:01:07 -04:00
Scott Idem	c2825194d4	docs: update project docs, NC Talk guide, Tina persona, and gitignore - CLAUDE.md: add new auth/onboarding files to directory map, update security section (JWT/bcrypt/invite details), expand recently completed - README.md: fix Web UI auth description, add User Management section - TODO__Agents.md: mark NC Talk docs and auth/onboarding complete, update Holly onboarding plan to reflect single-instance multi-user approach - docs/NEXTCLOUD_TALK_BOT.md: complete guide — occ commands, nginx config, clarify incoming vs outgoing HMAC difference, multi-user note, full troubleshooting table - home/holly/persona/tina/: flesh out all four persona files with real content (DCC name origin, metal music, reading, foster cats, Holly's profile) - .gitignore: exclude home/**/auth.json, invite.json, profile.json Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-21 00:13:35 -04:00
Scott Idem	46b65d087c	feat: persona onboarding — invite tokens, self-service setup, persona creation, switcher New user flow: 1. Admin: python manage_passwords.py invite <username> → generates URL 2. User visits /setup/<token> → sets own password → logged in 3. User redirected to /setup/persona → fills name/emoji/description 4. persona_template.py generates all starter files → lands at /{user}/{persona} Multiple personas: - Header persona name is now a clickable dropdown listing all personas - "New persona" link at bottom → /setup/persona (available to logged-in users) - /api/personas endpoint returns persona list for current session user New files: - persona_template.py: generates IDENTITY/SOUL/PROTOCOLS/USER/HELP.md + data files - routers/onboarding.py: /setup/{token}, /setup/persona GET+POST - static/setup.html: two-step form (password → persona), emoji picker, mobile-friendly Updated: - auth_utils.py: create_invite(), validate_invite(), consume_invite() - manage_passwords.py: invite command with URL output - auth_middleware.py: /setup/* prefix is public (invite tokens need no auth) - routers/ui.py: /api/personas endpoint; post-login redirect if no personas - static/app.js: persona switcher dropdown with navigation + Add persona link - static/style.css: .persona-switcher, .persona-dropdown, mobile adjustments Mobile: login/setup pages are card-centered with responsive padding; dropdown avoids edge-clipping on narrow screens; logout button stays visible. All 80 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 23:10:32 -04:00
Scott Idem	a9bbb668b5	feat: session auth + per-user/persona UI at /{user}/{persona} Replaces nginx basic auth with a proper per-user session system: - auth_utils.py: bcrypt password hashing, JWT cookie creation/decode - auth_middleware.py: validates JWT cookie on all routes except /login, /health, /static/, and webhook endpoints (/channels/, /webhook/) - routers/ui.py: GET /login, POST /login, POST /logout, GET /{username}/{persona} — serves index.html with CORTEX_CONFIG injected - static/login.html: minimal login form (dark theme, matches UI) - main.py: registers SessionAuthMiddleware + ui.router - config.py: jwt_secret, jwt_expire_days settings - manage_passwords.py: CLI tool to set/check/list user passwords - app.js: reads window.CORTEX_CONFIG (user + persona), sends both on every /chat and /orchestrate request; persona name shown in header; logout button (⏏) added to header - requirements.txt: bcrypt, PyJWT, python-multipart - .env.default: JWT_SECRET, JWT_EXPIRE_DAYS documented - tests: client fixture injects JWT cookie; security test assertions updated for URL-normalized path traversal paths (still secure, codes differ) All 80 tests pass. Setup for a new user: python manage_passwords.py set scott python manage_passwords.py set holly Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 22:54:12 -04:00
Scott Idem	77e770cdb2	feat: multi-user/multi-persona support with two-level home directory layout Restructures persona storage from a flat personas/{name}/ layout to home/{username}/persona/{name}/, mirroring Linux home directories. Changes: - persona.py: two ContextVars (user + persona), Linux-style name validation, set_context(), get_user(), get_persona(), validate(), list_users(), list_user_personas(); persona_path() takes (username, name) - config.py: replaces personas_dir with home_dir + home_root() - git mv personas/inara → home/scott/persona/inara (history preserved) - home/holly/persona/tina/: Holly's persona stub added - cron_runner.py: all storage functions take (username, persona) params - tools/cron.py: stamps user + persona on jobs; APScheduler IDs are {user}:{persona}:{job_id} to prevent collisions across users - memory_distiller.py: distill_short/mid/long take (username, persona); added missing Path + settings imports - scheduler.py: _load_user_crons() iterates home//persona/ (two-level) - routers/chat.py, orchestrator.py: user field added; set_context() called - tests/conftest.py: home_root fixture with two-level structure; patches home_dir instead of personas_dir - tests/test_persona.py: fully rewritten for two-level API - tests/test_api_files.py: updated fixture name and path - .env.default: documents HOME_DIR setting; scrubs stale API key - CLAUDE.md, README.md: directory maps updated for new layout All 80 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 22:35:40 -04:00
Scott Idem	5cadb836fa	feat: multi-persona support (single Cortex, multiple users) - Add cortex/persona.py: ContextVar-based per-request routing with path traversal protection and persona validation - Migrate inara/ → personas/inara/ (git history preserved via git mv) - config.py: add personas_root(), inara_path() delegates to personas/inara - All 14 settings.inara_path() call sites replaced with persona_path() - ChatRequest + OrchestrateRequest: add persona field (default: "inara") with validation at request entry before any processing - memory_distiller: add optional persona param for future per-persona distill - cron_runner/tools/cron: stamp persona on jobs, prefix APScheduler IDs (persona:job_id) to prevent collisions across personas - scheduler: _load_user_crons() iterates all personas at startup Adding a new persona: create personas/<name>/ with IDENTITY.md + SOUL.md. Auth: handled at nginx level (inject X-Cortex-Persona header per subdomain). Future: persona maps to Aether account_id_random for full integration. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 21:50:02 -04:00
Scott Idem	1b32667872	feat: scratchpad tool + fix Claude auth token expiry warning - Add cortex/tools/scratch.py with scratch_read/write/append/clear tools - Register all four scratch tools in the orchestrator tool registry - Create inara/SCRATCH.md as the backing file (never distilled/archived) - Fix auth.py: expiresAt reflects short-lived access token (~8h) not the 1-year refresh token — suppress expiry warning when refreshToken is present Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 21:10:03 -04:00
Scott Idem	aaac3e1353	feat: persist orchestrator sessions + user service + docs update - Orchestrator now saves turns to session store so history survives page refresh - UI session_id updated from job result; history controls attached to agent turns - Cortex migrated from system service to systemd user service (no more sudo) - Update README.md and CLAUDE.md with correct service commands Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 23:08:38 -04:00
Scott Idem	97438f1a0f	feat: multi-instance support — agent_name and user_name configurable All hardcoded "Inara"/"Scott" strings replaced with settings.agent_name and settings.user_name, read from .env at startup: - config.py: AGENT_NAME and USER_NAME settings (defaults: Inara / Scott) - llm_client.py: conversation labels in prompt builder - session_logger.py: Name: labels in session log markdown - memory_distiller.py: distillation system prompts (mid + long) - routers/nextcloud_talk.py: @mention prefix strip - routers/google_chat.py: greeting message Second instance scaffolding: - holly/: identity directory with placeholder files (USER_NAME=Holly, AGENT_NAME to be chosen by Holly) - cortex/.env.holly: config for Holly's instance on port 8001 - cortex-holly.service: systemd unit for the second instance No behavioural change to the Inara/Scott instance — defaults unchanged. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 20:13:11 -04:00
Scott Idem	f935fc4a7f	feat: session delete + touch-friendly message controls Session delete: - DELETE /sessions/{session_id} endpoint (chat.py + session_store.py) - × button on each session item in the panel (hover-reveal on desktop) - Clears UI if the active session is deleted Touch accessibility: - @media (hover: none) rule makes msg-actions always visible on touch devices - msg-act-btn tap targets enlarged to 36px min-height, readable font size - session-delete-btn also always visible and finger-sized on touch Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 19:43:20 -04:00
Scott Idem	ed472ce9a0	feat: Intelligence Layer Phase 1 — orchestrator service Adds the Gemini API orchestrator (ReAct tool loop → Claude responder): Orchestrator engine + router: - orchestrator_engine.py: Gemini API tool loop, Claude CLI handoff - routers/orchestrator.py: POST /orchestrate (async job queue), GET /orchestrate/{job_id} Tools (cortex/tools/): - web.py: DuckDuckGo web search (no key required) - ae_knowledge.py: ae_journal_search + ae_journal_entry_create (AE V3 API) - ae_tasks.py: ae_task_list (reads agents_sync Kanban filesystem) - files.py: file_read (path-allowlisted to safe dirs) Config + deps: - config.py: orchestrator, DuckDuckGo, and AE API settings - requirements.txt: google-genai, duckduckgo-search - .env.default: reference config with all new keys documented Docs: - CLAUDE.md, README.md, documentation/ added to repo - Port references updated 7331 → 8000 throughout - Default model updated to gemini-2.5-flash Tested: ae_task_list, ae_journal_search, web_search all working end-to-end. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 19:37:49 -04:00
Scott Idem	fe6561bf6a	feat: add Gemini auth check to token warning banner /auth/status now returns per-backend status: Claude warns on <24h expiry, Gemini warns only when oauth_creds.json is missing or has no refresh_token (access token rotates automatically so expiry_date is not a useful signal). Banner shows warnings for both backends when needed, and the hint text names the specific CLI commands to run. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 23:29:25 -04:00
Scott Idem	48a6734ec3	feat: Claude CLI OAuth token expiry warning New GET /auth/status endpoint reads ~/.claude/.credentials.json and returns hours remaining + warning flag. UI shows a dismissible amber banner when < 24h remain, turning red if expired. Checked on page load and every 30 minutes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 23:06:28 -04:00
Scott Idem	4253e69c0b	Add auto memory distillation scheduler (APScheduler) - scheduler.py: AsyncIOScheduler with three cron jobs short daily 03:00 (no LLM, always fast) mid weekly Sun 03:30 (LLM) long monthly 1st 04:00 (LLM — off by default) - config.py: AUTO_DISTILL, AUTO_DISTILL_SHORT/MID/LONG .env flags - main.py: start/stop scheduler in FastAPI lifespan - routers/distill.py: GET /distill/status — next run times + config - requirements.txt: apscheduler>=3.10 - HELP.md: updated planned items, added /distill/status to API table Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 22:31:38 -04:00
Scott Idem	0ebfbc6590	Refactor UI into separate CSS/JS, add help modal and HELP.md - static/index.html: reduced to 127-line HTML shell - static/style.css: all styles extracted (~900 lines) + help modal styles + shared markdown rendering for file-preview and help-modal-body including tables (previously missing) - static/app.js: all JS extracted (~900 lines) + help modal fetch/render - index.html: adds ? help button + help modal HTML - inara/HELP.md: comprehensive reference doc covering all features, keyboard shortcuts, API endpoints, memory system, planned items - routers/files.py: HELP.md added to ALLOWED set - context_loader.py: HELP.md loaded at tier 2+ (after PROTOCOLS.md) so Inara can reference it when helping Scott with the interface Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:52:54 -04:00
Scott Idem	ce3c1f5f7f	Add tiered memory system with manual distillation - config.py: memory_budget_long/mid/short settings (overridable in .env) - memory_distiller.py: distill_short (no LLM), distill_mid, distill_long (LLM) - routers/distill.py: POST /distill/{short,mid,long,all} endpoints - context_loader.py: rewrote to load long→mid→short order with include_* toggles - routers/chat.py: ChatRequest gains include_long/mid/short fields - routers/files.py: MEMORY_LONG/MID/SHORT.md added to ALLOWED set - main.py: register distill router - static/index.html: context bar — tier selector, L/M/S memory toggles, distill buttons with status feedback; send includes tier + memory flags - inara/MEMORY_LONG.md: migrated from MEMORY.md + Cortex/Talk bot notes - inara/MEMORY_MID.md, MEMORY_SHORT.md: stubs ready for distillation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:22:32 -04:00
Scott Idem	3455c7a09c	Add SSE real-time Talk activity, file editor UI, and identity file API - event_bus.py: in-process asyncio pub/sub (one Queue per SSE client) - nextcloud_talk.py: publishes nct_message/nct_response events to bus - chat.py: GET /events SSE endpoint streams Talk activity to browser - routers/files.py: whitelist-protected GET/PUT for Inara identity .md files - main.py: register files router - static/index.html: real-time Talk feed, blue badge on Sessions btn, Files modal with preview/edit toggle and Ctrl+S save Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:10:07 -04:00
Scott Idem	fe854ee534	Add Nextcloud Talk bot integration (Inara) - New routers/nextcloud_talk.py: webhook handler verifies incoming HMAC, calls LLM via BackgroundTasks, posts reply with correctly computed signature (random + message_text, not raw body) - llm_client.py: read Claude OAuth token live from ~/.claude/.credentials.json to avoid stale systemd env tokens; strip conflicting ANTHROPIC_API_KEY - config.py: add nextcloud_url, nextcloud_talk_bot_secret, nextcloud_talk_timeout settings - main.py: register nextcloud_talk router, add logging setup - docs/NEXTCLOUD_TALK_BOT.md: installation guide + HMAC signing gotcha Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-16 23:04:26 -04:00
Scott Idem	8add4ffd02	Add edit/delete history, named sessions, scroll fix, systemd service - Edit/delete individual messages from session context with inline editing (Ctrl+Enter saves, Escape cancels); changes sync to backend via PUT /history - PUT /history/{session_id} endpoint to replace full message list - Named sessions: readable slugs (e.g. quiet-spring) instead of UUID fragments - Scroll no longer snaps to bottom when user has scrolled up to read history - cortex.service: systemd unit for auto-start and restart-on-failure Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-10 23:38:39 -04:00
Scott Idem	2f675ee4bf	Initial commit — Cortex API + Inara identity Cortex: FastAPI backend serving Inara via Claude/Gemini CLI backends. Includes SSE streaming chat, session persistence, Google Chat webhook handler, and Docker support. Inara: Identity files (persona, soul, protocols, memory, context tiers) mounted read-only into the container at runtime. Features in initial cut: - /chat endpoint with SSE keepalive + LLM fallback - Session store with rolling history window - Markdown rendering, copy-to-clipboard, links open in new tab - Stacked right-column input controls (height selector, enter toggle, note mode with public/private) — semi-hidden until textarea grows - /note endpoint for injecting public context into session history - Docker Compose config (local dev runs natively; Docker for server) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 03:41:00 -05:00

49 Commits