Cortex-Inara

Author	SHA1	Message	Date
Scott Idem	348ca120c1	feat: full channels.json UI + http_allowlist settings Notifications page: - NC Talk section expanded: url, bot_secret, notification_room, nc_username, nc_app_password — all fields from channels.json now editable - Per-channel sections use <details>/<summary> collapsibles; auto-open when values are present - Secrets use type=password with "leave blank to keep" semantics - Google Chat outbound webhook in its own collapsible section Account settings: - HTTP POST Allowlist section added (same textarea pattern as email allowlist) - POST /settings/http-allowlist route saves home/{user}/http_allowlist.json - Example placeholder shows ha.dgrzone.com and n8n patterns Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-09 13:57:18 -04:00
Scott Idem	19475610be	feat: move Notifications to its own settings sub-page Adds GET /settings/notifications (dedicated page with channel form + two test buttons) and updates POST /settings/notifications to render that page. Settings page now shows a compact link card instead of the full form. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 23:43:52 -04:00
Scott Idem	3c7ecf4e4f	feat: notification test endpoints — POST /api/push/test and /api/push/reminders/check - POST /api/push/test: sends "Test notification from Cortex" via the user's configured notification channel (web_push / NCT / email / etc.) - POST /api/push/reminders/check: runs the daily reminder check immediately for the current user, returns reminders_found count Both require an active session cookie. Useful for verifying channel setup without waiting for the 09:00 scheduler job. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 23:34:58 -04:00
Scott Idem	64020ad982	feat: proactive notifications — web_push channel + daily reminder check Routes web_push through notification.py alongside NCT/email/Google Chat, and fires daily reminder summaries via the scheduler. - notification.py: _notify_web_push() + "web_push" case in notify(); all four channels (web_push/email/nextcloud/google_chat) now routable - scheduler.py: _run_reminder_check() daily at 09:00 — reads due reminders per persona via set_context(), formats up to 3 entries, calls notify() - routers/settings.py: "web_push" added to valid notification_channel values - static/settings.html: "Browser Push Notification" option in channel selector - TODO__Agents.md: proactive notifications section marked complete Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 23:28:49 -04:00
Scott Idem	47d23a7b2f	feat: per-model max_rounds for Gemini orchestrator engine Mirrors the pattern already in openai_orchestrator.py. The Gemini engine was still hardcoded to the global orchestrator_max_rounds setting. - orchestrator_engine.py: max_rounds param on run() and _run_from_contents(); effective_limit = min(per_model_limit, global_limit); stored in checkpoint so resume() respects it across confirmation gates - routers/orchestrator.py: passes orch_model.get("max_rounds") to run() - tools/agents.py: passes model_cfg.get("max_rounds") for gemini_api spawns Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 22:54:37 -04:00
Scott Idem	09d775b47b	feat: spawn_agent tool + host max_concurrent + docs Adds a synchronous sub-agent spawning tool that lets the orchestrator delegate tasks to a specific role's model and tool set. - cortex/tools/agents.py: spawn_agent(task, role, tier, timeout, max_rounds) - Supports local_openai and gemini_api model types - Per-host asyncio semaphore (keyed by host_id or model type) - asyncio.wait_for() enforces timeout; admin-only tool - cortex/model_registry.py: max_concurrent field in host schema (default 3, clamped 1-20); backfilled on _normalize() for existing hosts - cortex/routers/local_llm.py + local_llm.html: "Max parallel" number input in host add/edit forms - cortex/tools/__init__.py: spawn_agent registered in TOOL_CATEGORIES["Agents"], _CALLABLES, TOOL_ROLES (admin), and _ALL_DECLARATIONS - Docs: TOOLS.md count 44→45, spawn_agent section; HELP.md tool table updated; ARCH__FUTURE.md Round 2 completed items; TODO__Agents.md spawn_agent checked; CLAUDE.md tool count and list updated Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 22:48:21 -04:00
Scott Idem	6ad7597db8	feat: per-role inject_datetime toggle for system prompt Each role can now disable the current date/time header injected into the system prompt. Default is true (all existing roles unchanged). Useful for pure processing roles (summarizer, classifier, translator) where temporal context is irrelevant or could cause unexpected model behavior. Changes: - model_registry: set_role_config/get_role_config gain inject_datetime field - context_loader: load_context gains inject_datetime param (default True) - orchestrator router: passes inject_datetime from role_cfg to load_context - local_llm router: reads inject_datetime from POST body, passes to registry; role_config_data_js includes the field - local_llm.html: checkbox in role config panel; populate on open, save on submit Session logs still timestamp every turn (HH:MM header in YYYY-MM-DD.md files) regardless of this setting — the toggle only affects the system prompt header. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 21:53:35 -04:00
Scott Idem	f8f7cd75da	feat: audit log, usage tracking UI, OpenAI orchestrator compaction, onboarding + docs Tool audit log: - Every orchestrator tool call logged to home/{user}/tool_audit/YYYY-MM-DD.jsonl - Files panel sidebar: audit log group (collapsed), date-linked read-only table - Admin endpoints: /api/audit/files, /api/audit/day, /api/audit/recent, /api/audit/stats - Engine and model name recorded per entry OpenAI orchestrator improvements: - Context budget enforcement: 75% of model context_k (min 16k) - Message compaction: truncates old tool results when approaching budget - max_rounds respected per model config (intersected with server cap) OpenRouter onboarding (setup.html, onboarding.py, app.js, settings.html): - Step 3 of 3: /setup/model with curated model picker - Chat banner for users on server-default model (informational, not alarmist) - Settings quick-link card; /setup/model works standalone for existing users Model registry + session store: - set_role_config / get_role_config for per-role tool lists and system_append - session_store: session rename, session name backfill endpoint UI updates (app.js, index.html, style.css, local_llm.html): - Role toggle in context panel - Off-the-record mode - Agent notes read-only viewer - OPERATIONS.md loaded at T2+ in context Documentation: - HELP.md: full tool table, per-role tool sets, Agent Notes, usage tracking - TOOLS.md: Agent Notes section, count corrected to 44 - ARCH__SYSTEM.md, ARCH__BACKENDS.md, MASTER.md updated to match reality - CLAUDE.md: onboarding flow, documentation philosophy sections - README.md: stack in practice, DeepSeek TUI mention, architecture diagram updated - TODO__Agents.md: onboarding task completed with deviation notes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 21:26:43 -04:00
Scott Idem	c02d2462b0	feat: agent notes, OpenRouter onboarding, usage tracking, per-role tools docs Agent notes tool (cortex/tools/agent_notes.py): - Private durable notepad for the orchestrator — not user-visible - agent_notes_read/write/append/clear with 3 rolling backups - Per-persona isolation via ContextVars; no TOOL_ROLES gating needed - PROTOCOLS.md updated to make this a core proactive tool OpenRouter guided onboarding: - Setup Step 3 (/setup/model) — OpenRouter quick-connect with curated model list - Amber banner in chat for users on server-default model - Settings quick-link card (/settings/models OpenRouter section) - POST /setup/model/skip for users who want to bypass Step 3 - Holly pre-configured: DeepSeek V4 Flash (OpenRouter) → Gemma Medium (local) → claude_cli Usage tracking: - cortex/routers/usage.py — GET /api/usage, /api/usage/summary, /api/usage/all (admin) Documentation: - HELP.md: Tools section rewritten — full tool table by category, per-role tool sets explained - TOOLS.md: Agent Notes section added; count corrected to 44 - ARCH__SYSTEM.md, ARCH__BACKENDS.md, MASTER.md, CLAUDE.md, README.md updated - TODO__Agents.md: onboarding task checked off with deviation notes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 21:25:31 -04:00
Scott Idem	02accefe8f	feat: audit log in Files panel sidebar Adds an "Audit Log" section (collapsed by default) at the bottom of the Files panel showing tool_audit/YYYY-MM-DD.jsonl files for the current user. - GET /api/audit/files — lists available dates (newest first, any auth user) - GET /api/audit/day — returns entries for one date as JSON (any auth user) - tool_audit.read_day() — reads a single day's JSONL file chronologically - Clicking a date renders a read-only table: time / tool / status / args / result - Status cells are colour-coded (green ok, red error, amber denied) - Edit/Raw/Preview/Save buttons are hidden in audit view, restored on file switch - Audit group starts collapsed; expands on click like other file groups Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 20:36:08 -04:00
Scott Idem	584ae679a6	feat: tool call audit log Every orchestrator tool invocation is recorded to home/{user}/tool_audit/YYYY-MM-DD.jsonl. Each entry captures: timestamp, user, tool, args (truncated), status (ok/error/denied), result length, and a 300-char result snippet. - tool_audit.py: JSONL writer with per-file asyncio locks; read_recent / read_recent_all_users helpers - tools/__init__.py: hook in call_tool() — fire-and-forget record on every dispatch - routers/audit.py: GET /api/audit/recent and /api/audit/stats (admin-only) - tools/files.py: add home_root() to file_read allowed roots so agents can read audit JSONL Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:55:59 -04:00
Scott Idem	ddf44a2aee	feat: web push notifications (VAPID) - push_utils.py: subscription storage + send helper (auto-prunes 410 endpoints) - routers/push.py: GET /api/push/vapid-key (public), POST/DELETE /api/push/subscribe - sw.js: push event listener shows notification; notificationclick focuses/opens tab - app.js: subscribe/unsubscribe flow + "Enable notifications" toggle in settings dropdown - tools/notify.py: web_push orchestrator tool (user-level, no admin required) - VAPID keys in .env; pywebpush added to requirements.txt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:38:58 -04:00
Scott Idem	0b96772fa6	fix: show session friendly name in resume message and status bar /history/{session_id} now returns a 'name' field alongside messages. resumeSession() uses data.name first, then the sessionNames map, then raw ID as fallback — so named sessions display correctly even on page load before the sessions panel has been opened. 'Resumed session X' message also now shows the friendly name. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:14:59 -04:00
Scott Idem	508fb638ad	feat: distill safeguards — rolling backups + sanity checks Before any memory file is overwritten, _rotate_backup() keeps 2 rolling backups: MEMORY_.bak1.md (most recent) and MEMORY_.bak2.md (older). _sanity_check() now also guards against size anomalies: the new content must be between 40% and 250% of the old file size — anything outside that range looks like truncation or runaway output and aborts the write. Existing checks (min length, refusal phrases) still apply. Backup files exposed in the Files panel (ALLOWED set) so they can be reviewed and manually restored if needed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 18:54:27 -04:00
Scott Idem	0ffcd57c95	fix: multi-user distillation + datetime in context + session log labels Distillation was silently operating on scott/inara for all users due to ContextVar defaults. All three distill endpoints now require ?user=&persona= query params and validate them via persona.validate(). Memory distiller signatures changed from Optional to required positional args — no more global settings fallback. Scheduler now iterates all users/personas instead of hardcoding the primary user. - context_loader: inject current date/time as first system prompt section - session_logger: use get_user()/get_persona() from context instead of settings globals so Holly/Brian sessions show correct speaker labels - memory_distiller: system prompts now reference u.title()/p.title() instead of settings.user_name/settings.agent_name - distill router: Query(...) enforces params; _resolve() validates persona - scheduler: _all_personas() helper iterates every user/persona for distill - app.js: runDistill() now appends ?user=&persona= via _fileParams Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 18:44:51 -04:00
Scott Idem	eab92d876d	refactor: split tool declarations into domain files + role config UI tools/__init__.py shrinks from 1,137 → 250 lines. Each domain file now owns both its callables and its FunctionDeclarations (DECLARATIONS list), so adding a new tool only touches one file. New TOOL_CATEGORIES dict exported from __init__ — used by the UI for grouped tool checkboxes. Role config UI (Settings → Model Registry → Role Assignments): - ⚙ button per role expands an inline configure panel - Textarea for system_append (injected into system prompt for this role) - Grouped checkboxes for tool allow-list (all checked = no restriction) - POST /api/models/role-config saves both fields; updates ROLE_CONFIG_DATA in-page so re-open reflects current state without a page reload Backend: - model_registry.set_role_config() writes system_append + tools to registry - TOOL_CATEGORIES exported from tools/__init__ for UI rendering - TOOLS.md header updated: 30 → 39 tools (ae_journal_* and cortex_* additions) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 20:40:50 -04:00
Scott Idem	49123cdd5c	feat: per-role tool lists and system prompt overlays Each role in model_registry.json can now carry two optional keys: system_append — injected into the system prompt at position 7 (after memory, closest to the turn) for the active chat_role tools — explicit tool allow-list; intersected with the user's access-level filter so it can only restrict, never elevate No changes needed for existing users — missing keys fall back to current behavior. Add keys to a role to give it a specialty focus: "coder": { "primary": "claude_cli", "system_append": "You are in code-specialist mode...", "tools": ["web_search", "file_read", "shell_exec", "scratch_write"] } Changes: - model_registry.py: get_role_config() returns system_append + tools - context_loader.py: role_append param appended as "--- Role Context ---" - tools/__init__.py: get_tools_for_role/get_openai_tools_for_role accept optional tool_list and intersect with access-level filter - orchestrator_engine.py: tool_list threaded through run/resume/checkpoint - openai_orchestrator.py: tool_list threaded through run/resume/checkpoint; _build_client now calls get_openai_tools_for_role instead of returning unfiltered OPENAI_TOOL_SCHEMAS - routers/orchestrator.py: pulls role_cfg for chat_role, passes both role_append and tool_list to context loader and engine Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 20:00:38 -04:00
Scott Idem	6405dd338d	feat: proper confirmation-resume flow + per-user tool policy Fixes the broken confirmation gate where users had no way to approve or deny a blocked tool call in the web UI. Changes: - orchestrator_engine.py: add OrchestrateCheckpoint dataclass, extract loop into _run_from_contents(), add resume() function - openai_orchestrator.py: same treatment — _run_from_messages(), resume() - routers/orchestrator.py: POST /{job_id}/confirm and /deny endpoints, separate _checkpoints store, _resume_job() + _finalize_job() helpers, "awaiting_confirmation" job status with pending_confirmation payload - auth_utils.py: get_tool_policy() and save_tool_policy() helpers reading home/{user}/tool_policy.json (allow/deny lists) - routers/orchestrator.py: loads tool_policy per user and passes confirm_allow/confirm_deny to both engines - app.js: poll loop handles awaiting_confirmation — shows Confirm/Deny buttons inline, resumes polling after user action - settings.html + settings.py: Tool Permissions section with allow/deny textareas, POST /settings/tool-policy route - style.css: .confirm-gate, .confirm-btn, .deny-btn styles Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 19:14:53 -04:00
Scott Idem	bce7de647c	feat: proactive notifications — email, NC Talk, Google Chat per user notification.py now handles all three outbound channels. Email defaults to the user's login address (google_email from auth.json); an optional override can be set in channels.json. Google Chat uses an incoming webhook URL. NC Talk was already wired, just needs notification_room set. Settings page gains a Notifications section: channel dropdown, optional email override, NC room token, and Google Chat webhook URL. All stored in per-user channels.json. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 22:32:22 -04:00
Scott Idem	db3dd465b2	feat: email allowlist management in Settings + Files panel Settings page gets an editable textarea (POST /settings/email-allowlist) so users can view and update their per-user regex allowlist without touching the raw JSON file. Files panel gains a "Settings" group containing email_allowlist.json as a raw JSON editor backup — served from home/{user}/ via files.py USER_FILES. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 21:56:45 -04:00
Scott Idem	a5658eb3c4	feat: edit existing model entries in the Model Registry - Inline edit form per model row (label, model name/ID, host/account, context, tags) - Fetch models button in edit form for local models — same live-picker UX as Add Model - POST /settings/local/models/{id}/edit route in local_llm.py - Admin role badge (ADMIN/USER pill) in Account Settings page - HELP.md updated: new tools table with admin/confirm markers, PWA install section - TODO updated: tool expansions marked done, distill review and Unsloth resolved, role-based access and admin badge added to completed Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 21:08:09 -04:00
Scott Idem	334e7f0dea	feat: role-based tool access, confirmation gates, and new orchestrator tools - auth_utils: get_user_role() reads role from auth.json (admin\|user, default user) - manage_passwords: new `role` command to promote/demote users (admin-only by convention) - tools/__init__: TOOL_ROLES map, CONFIRM_REQUIRED set, get_tools_for_role(), get_openai_tools_for_role() — both orchestrators now filter tools by caller's role - tools/system: cortex_restart (detached subprocess, 5s delay), cortex_logs (admin-only) - tools/web: http_fetch — direct URL fetch, distinct from web_search - tools/files: file_list (directory listing), file_write (restricted paths, admin-only) - tools/notify: nc_talk_send — proactive outbound via notification.py - orchestrator_engine + openai_orchestrator: user_role param; CONFIRM_REQUIRED tools return a confirmation-request result instead of executing — loop breaks after Claude asks user to confirm in a follow-up message - home/scott/auth.json: role set to admin Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 19:23:53 -04:00
Scott Idem	25182a1765	feat: PWA support — manifest, service worker, icons, public auth exemption Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 18:46:33 -04:00
Scott Idem	66cb197de0	feat: last-used persona cookie, emoji dropdown, theme support, auth status move - cx_last_persona cookie set on serve_ui; root/login/help/settings redirects use preferred persona from cookie instead of alphabetically first - /api/personas returns [{name, emoji}] objects; persona switcher dropdown renders emoji + name with flex layout and .pd-emoji span - Help, Settings, Model Registry pages apply localStorage theme on load (no flash); CSS variables for dark/light replacing all hardcoded hex values - Claude CLI auth status moved from prominent chat banner to Anthropic provider block in Model Registry — live dot indicator (ok/warn/err) - Auth banner removed from main chat UI (index.html, app.js, style.css) - Add Model collapsed into Models section as <details> to shorten page - Light-mode overrides for provider icons, model badges, ctx-badge, tags (Anthropic/Google/local colors now readable in both themes) - Help page gains table, pre/code, hr styles for HELP.md rendered content Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 22:52:34 -04:00
Scott Idem	2b9dd53566	feat: replace Agent mode with independent Tools toggle - Remove 'agent' from mode dropdown; Chat/Note/OTR remain - Add ⚡ tools toggle button in input bar (persisted in localStorage) When on: routes to POST /orchestrate (Gemini tool loop); send btn → "Run" When off: routes to POST /chat (direct to active role); no change - Role selector and tools toggle are now fully independent: active chat_role sent in orchestrate payload → used for final response - orchestrator_engine.run() accepts response_role param; passes it to complete(role=...) instead of hardcoded model="claude" - OrchestrateRequest gains chat_role field (default "chat") - Migrate stored 'agent' mode/MRU entries to 'chat' on load Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 20:36:15 -04:00
Scott Idem	1cc7988953	feat: add shell_exec tool and fix orchestrator model name resolution - Add shell_exec to orchestrator tool suite (system.py + __init__.py) Runs arbitrary shell commands on the Cortex host with timeout (1–120s), combined stdout/stderr output, optional working_dir, and exit code reporting. Enables system diagnostics (df, ls, ps, journalctl, etc.) from Agent mode. - Fix orchestrator_engine.run() to use model_name from resolved registry entry Previously used settings.orchestrator_model (.env hardcode) regardless of what model was assigned to the orchestrator role. Now accepts model_name param and falls back to settings value only when registry has no model_name. - Update ARCH__FUTURE.md: date, running host, local orchestrator status, model registry V2 progress, added Cortex Mesh concept (section 9) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 20:29:46 -04:00
Scott Idem	8baab874f1	feat: replace backend/slot toggle with role selector The backend toggle now cycles through configured roles (chat, coder, research, distill, etc.) instead of backup model slots within the chat role. Each role uses its own primary→backup chain from the registry. - ChatRequest.slot replaced by chat_role (default "chat") - GET /backend returns available_roles instead of chat_models - _available_roles_for_toggle() builds list from defined_roles, excluding orchestrator (which has its own Agent mode) - Model label on responses now reflects the actual role's assigned model - Toggle is inert when only one role is configured (avoids useless cycling) - Add "Clear browser cache" button to Account Settings (Connected Accounts) - Add _role_model_label() helper for cleaner response tag labeling Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 19:23:18 -04:00
Scott Idem	962d58d2e2	feat: model registry Phase 3 — slot-based backend toggle Backend toggle now cycles through chat role models by label instead of cycling service type strings (auto/claude/gemini/local). - model_registry: get_model_for_slot() — resolves a specific priority slot without walking the fallback chain - llm_client: complete() gains slot param; explicit slot selection dispatches directly to that model with no silent fallback - routers/chat.py: ChatRequest.slot; GET /backend returns chat_models [{slot, label, type}] for the UI; _stream_chat uses resolved model label for the response tag when a slot is pinned - app.js: toggle loads chat_models from /backend, cycles by label, sends slot in chat payload; legacy model field removed from payload - app.js: fix Gap B — agent mode placeholder no longer says "Gemini tool loop"; now says "orchestrator" - DESIGN doc: updated to reflect phases 1+2 complete, catalog-as-code decision, Gap A/B documented, Phase 3 implementation details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 21:43:08 -04:00
Scott Idem	6e56024815	fix: settings page and help docs updated for model registry V2 settings.html: - Remove Gemini API Key section (keys now managed in Model Registry) - Rename "Local Models" → "Model Registry" with updated description covering all providers (Anthropic, Google, local hosts) - Update button text: "Manage local models" → "Manage models" settings.py: remove dead gemini_key template variable lookups HELP.md: - Fix navigation path: ☰ → Account → Model Registry → Manage models - Restructure Model Registry section as ordered steps (1: providers/hosts, 2: add models, 3: assign roles) so dependency order is clear - Add explicit note that accounts/hosts must exist before adding models Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 21:07:05 -04:00
Scott Idem	f08b033d6c	feat: model registry Phase 2 — cloud provider UI (Anthropic + Google) Adds cloud provider management to /settings/models: - Google Accounts section: add/remove Gemini API keys with labels - Add Model form: provider tabs (Local / Google / Anthropic) with catalog dropdowns that auto-fill label and context_k - Provider badges on model rows (Anthropic / Google / Local) - /settings/local now redirects to /settings/models (canonical URL) - save_cloud_model() in model_registry for Anthropic/Google entries - Distill role migration restored in _migrate_from_local_llm - Test fixes: version assertions updated to V2 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 20:41:06 -04:00
Scott Idem	45c95d20ba	feat: model registry V2 — provider-aware schema with multi-account support Adds a providers section to the per-user model registry for Anthropic and Google as first-class providers alongside local hosts. Google accounts (API keys) are now stored as a list so multiple Google accounts can coexist. Changes: - model_registry.py: V2 schema, auto migration V1→V2 (pulls gemini_api_key from auth.json into providers.google.accounts), _resolve_model() merges account API key for gemini_api type models - routers/orchestrator.py: uses model-resolved api_key when orchestrator role resolves to a gemini_api model with account_id - ANTHROPIC_CATALOG and GOOGLE_CATALOG constants for model picker (Phase 2) - New functions: get_google_api_key(), save/remove_google_account(), get_catalog() - Documentation: ARCH__BACKENDS.md updated to V2 schema, DESIGN doc added Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 20:21:04 -04:00
Scott Idem	3b3456600a	feat: store and display backend + host metadata on chat messages Each assistant message in the session JSON now carries: backend, backend_label, host (platform.node()) These fields are shown as model tags in the UI — on live responses and when loading session history. Session log entries (sessions/YYYY-MM-DD.md) include the backend label and host in the turn header. The local (OpenAI-compat) backend strips non-standard fields before sending messages to the API so extra fields don't leak upstream. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 22:16:48 -04:00
Scott Idem	d9a322164a	feat: OpenAI-compatible orchestrator + backend auto-routing - openai_orchestrator.py — new ReAct tool loop engine for any OpenAI-compatible endpoint (OpenRouter, Open WebUI, Ollama, LiteLLM); model handles both tool loop and final response, no Claude handoff needed - tools/__init__.py — auto-derive OpenAI JSON Schema from existing Gemini FunctionDeclarations so tool definitions have a single source of truth - routers/orchestrator.py — route to openai_orchestrator when model registry "orchestrator" role resolves to a local_openai type host - routers/chat.py — pass role to _backend_label(); fix fallback_used logic (only meaningful for explicit backend overrides, not auto-routing) - static/app.js — add null/"auto" to backend cycle; fetch local model hint without overriding the auto default on page load - model_registry.py — _normalize() back-fills host_type on old registry files - requirements.txt — add openai>=1.0.0 - ARCH__BACKENDS.md — document OpenAI-compat backend and routing logic Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:18:18 -04:00
Scott Idem	a6e404c143	feat: host_type field for OpenRouter / OpenAI-compatible API support Adds host_type ("openwebui" \| "openai") to the host schema so Cortex can talk to both Open WebUI/Ollama and OpenRouter/standard-OpenAI endpoints. Path differences per type: openwebui (default): /api/chat/completions, /api/models openai: /chat/completions, /models model_registry.py: - host_type added to host schema (default "openwebui", backward compat) - save_host() accepts host_type parameter - _resolve_model() passes host_type through with the merged host fields llm_client._local(): - Reads host_type from resolved model_cfg - Selects correct chat completions path accordingly routers/local_llm.py: - save_host route accepts host_type form field - fetch-models uses /models for openai type, /api/models for openwebui - Existing host rows show type selector pre-filled from stored value local_llm.html: - "Add host" form includes type selector To use OpenRouter: - Add host: URL = https://openrouter.ai/api/v1, Type = OpenAI-compatible - API key from openrouter.ai (store in .env or model_registry.json only) - Fetch models or add manually (e.g. anthropic/claude-sonnet-4-5-20251022) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-06 21:11:22 -04:00
Scott Idem	8570e8d852	fix: backend toggle not sent to server; add per-message model tag Fixes: - app.js was tracking primaryBackend locally but never included model: primaryBackend in the /chat POST body, so the server always used settings.primary_backend regardless of what the user clicked. Now model: primaryBackend is sent on every chat request. - Responses were only annotated when fallback occurred. Now every assistant message shows a small model tag at the bottom right. chat.py: - _backend_label() resolves human-readable name: claude → "Claude", gemini → "Gemini", local → registry label (e.g. "Gemma 4 E4B") or model_name - SSE payload now includes backend_label field app.js: - model: primaryBackend added to /chat fetch body - After every response, appends .model-tag div with backend_label - Fallback shows "⚡ fallback → <label>" in amber; normal is muted - Removed separate system message for fallback (tag covers it) style.css: - .model-tag: small muted text, right-aligned, separated by thin line - .model-tag.fallback: amber (#f59e0b) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 22:10:40 -04:00
Scott Idem	608e1de246	feat: model registry UI — hosts, models, role assignments Replaces the single-host local model settings page with a full model registry interface at /settings/local. Hosts section: - List existing hosts with inline edit + save + remove - Collapsible "Add host" form - Per-host "Fetch models" button Models section: - List all models with label, model name, host, context_k badge, tags - Remove button Add Model section: - Host dropdown, label, model name, context_k, tags (comma-separated) - "Fetch models from host" with auto-fill picker Role Assignments section: - One row per defined role (chat, orchestrator, distill, coder, research) - Primary + backup_1 + backup_2 dropdowns per role - Dropdowns pre-filled from registry on load - AJAX save on change (POST /api/models/role) with toast confirmation - Built-in models (claude_cli, gemini_cli, gemini_api) always available in dropdowns Backend: - All user_settings references replaced with model_registry - host/{id}/remove route added - fetch-models now accepts host_id query param - POST /api/models/role for AJAX role assignment Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 21:31:32 -04:00
Scott Idem	6a1a1c2686	feat: unified model registry with role-based routing Introduces model_registry.py as the single source of truth for all LLM backend configuration. Replaces scattered backend settings across user_settings, config distill_backend_, and the UI toggle. model_registry.py: - Per-user home/{user}/model_registry.json with version, hosts, models, roles - Models have: type (local_openai\|claude_cli\|gemini_cli\|gemini_api), label, model_name, host_id, context_k (tokens), tags (capability labels) - Roles map to priority chains: primary, backup_1..backup_4 - Built-in IDs (claude_cli, gemini_cli, gemini_api) always resolvable - Auto-migrates existing local_llm.json on first access - CRUD: save_host, remove_host, save_model, remove_model, set_role - get_model_for_role(): registry → .env default → hardcoded fallback config.py: - role_chat/orchestrator/distill/coder/research .env defaults - defined_roles: comma-separated standard role list (extensible) - get_defined_roles() and get_role_default() helper methods llm_client.complete(): - New role= parameter (default "chat") for registry-based routing - model= still accepted for explicit UI toggle override - _claude() and _local() accept model_cfg dict instead of raw string - _local() uses pre-resolved config from registry memory_distiller.py: - distill_mid/long now use role="distill" (no more distill_backend_ .env vars needed) cron_runner.py: - brief jobs use role="chat" routers/chat.py + auth.py: - Use model_registry instead of user_settings for local model info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 21:25:18 -04:00
Scott Idem	a4daebdc9b	feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul Local LLM: - user_settings.py: per-user hosts/models config (local_llm.json) - routers/local_llm.py + static/local_llm.html: dedicated settings page - llm_client.py: local OpenAI-compatible backend via httpx - config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts - Active model shown near backend toggle (amber hint text) Memory distillation: - memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides - scheduler.py + notification.py: notify NC Talk after mid/long distill - notification.py: outbound channel abstraction (NC Talk, extensible) Session search: - routers/files.py: GET /sessions/search?q= with excerpts grouped by date - static/index.html + app.js: search UI in file sidebar with highlight - _esc() helper to prevent XSS in search results Proactive cron: - cron_runner.py: new job types — message (send directly) and brief (LLM + send) - Both support optional per-job channel override Channels: - routers/nextcloud_talk.py: consolidated using notification._send_nct_message() - routers/auth.py: local backend status in /auth/status - routers/chat.py: /backend returns {primary, fallback, local_model} object UI / UX: - Copy button for user messages (matching assistant) - Autocomplete disabled on sensitive form fields - settings.html: local model section replaced with link to /settings/local Docs overhaul: - MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md - ARCH__Intelligence_Layer.md replaced with redirect table - CORTEX.md trimmed to vision only; README updated - OPEN_WEBUI_API.md added to docs/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 20:53:06 -04:00
Scott Idem	bd6532e93a	feat: shared nav bar on Help and Settings pages Replaces the lone "← Back to Cortex" link with a consistent page-nav on both pages: ← Chat \| Help \| Settings \| Sign out Active page is highlighted purple; others are muted gray. Settings page gets a {{ help_href }} template var from settings.py. Help page builds nav links from the existing cfg JS object. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 22:09:08 -04:00
Scott Idem	662924c6a1	fix: pass user to _run_job so get_user_gemini_key resolves correctly NameError: name 'user' is not defined in orchestrator._run_job — user was resolved in the endpoint but not forwarded to the background task. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 21:21:14 -04:00
Scott Idem	93f7f44e51	feat: per-user channel config for Google Chat and Nextcloud Talk - New endpoints: POST /channels/google-chat/{username} and /webhook/nextcloud/{username} - Channel secrets/config live in home/{username}/channels.json (gitignored) - auth_utils: get_user_channels() helper reads channels.json - Both routers load persona, audience/secret, backend, timeout per user; set_context() wires the correct persona before building the system prompt - Removed server-level channel settings from config.py and .env — no user gets a channel until they create their own channels.json - .gitignore: home/**/channels.json added To migrate: update Google Chat Add-on webhook URL to /channels/google-chat/{username} and re-register NC Talk bot at /webhook/nextcloud/{username} Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 13:02:45 -04:00
Scott Idem	7438031797	feat: connected accounts + Gemini API key in account settings UI Settings page gains two new sections: - Connected Accounts: shows linked Google email (read-only) - Gemini API Key: paste personal key from aistudio.google.com, shows masked hint of saved key, remove link to revert to server key POST /settings/gemini-key saves/clears gemini_api_key in auth.json. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 21:16:37 -04:00
Scott Idem	8aec6aafcc	feat: Google OAuth sign-in + per-user Gemini API key Users with Google accounts can now sign in without a password. Auth flow: - GET /auth/google → Google consent page (CSRF state cookie) - GET /auth/google/callback → exchange code, lookup user, set JWT - auth.json gains google_sub + google_email fields - set_password() no longer overwrites unrelated auth.json fields Admin setup: python manage_passwords.py google-add <username> <email> # add GOOGLE_CLIENT_ID + GOOGLE_CLIENT_SECRET to .env Per-user Gemini key: - get_user_gemini_key() reads gemini_api_key from auth.json - orchestrator_engine.run() accepts gemini_api_key param - orchestrator router passes user's key, falls back to server key login.html: "Sign in with Google" button above the password form. manage_passwords.py list: now shows auth method columns (pw / google). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 21:01:52 -04:00
Scott Idem	62fde62653	feat: persona-specific favicon + fix favicon.ico 404 app.js updates the <link rel="icon"> to the active persona's emoji on load (CORTEX_EMOJI is already injected server-side). /favicon.ico route added as a fallback for login/settings/help pages that don't have persona context. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:45:36 -04:00
Scott Idem	826bd6cfe3	feat: /{username} persona picker landing page Visiting /scott (or any user root) now shows a clean card page listing all their personas with emoji + name, each linking to /{user}/{persona}. Previously the route was unhandled (404 or wildcard match). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:19:04 -04:00
Scott Idem	c3507f8e11	fix: help page back link preserves active persona Pass ?persona= query param on the help link so the server knows which persona to return to. Previously always defaulted to personas[0], causing navigation back to the wrong persona. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 23:13:52 -04:00
Scott Idem	fa04b5e6b0	feat: off the record mode (OTR) Adds a third input mode toggle alongside Note and Agent. When active: - Textarea gets a subtle purple tint with dashed border - OTR button highlights purple - Placeholder reads "Off the record — not logged or distilled…" - off_record=True is sent to /chat; session_logger is skipped - In-memory session context is preserved within the session Switching to Note or Agent mode deactivates OTR, and vice versa. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 21:07:21 -04:00
Scott Idem	8487645224	fix: claude auth expiry warning — correct field name and smarter threshold - Fix 'undefined' in auth banner: read access_token_hours_remaining (not hours_remaining) - Fix false-positive warning on fresh tokens: when refresh token present, only warn within 1 hour of expiry (not 24h) since the CLI should auto-rotate but sometimes misses - Emit claude_auth_expired SSE event on 401 so UI shows inline red banner immediately - app.js: handle claude_auth_expired SSE event with persistent top banner + dismiss button Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-25 23:22:18 -04:00
Scott Idem	0cf0d65e9e	feat: session naming, username/persona rename, help page, contrast fixes - Session name field: PATCH /sessions/{id} endpoint, inline rename button in UI - Persona rename: inline ✏ toggle form in settings, POST /settings/persona/rename - Username rename: inline form in settings, POST /settings/username (renames home dir, forces re-login) - Help page: dedicated /help route replacing modal, collapsible sections - Per-persona isolation: files.py and session_store.py now scope to correct user/persona - Contrast/visibility: muted text bumped to slate-400+, session rename btn at 0.4 opacity Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 23:10:12 -04:00
Scott Idem	1b425a539f	feat: account settings page + dedicated help page - Add /settings page with password change form and personas list - Add /help dedicated page (replaces help modal); renders HELP.md with collapsible sections, dark theme, back link to active persona - Add 👤 account button and convert ? button to link in header - Remove help modal HTML and ~55 lines of modal JS from main app - Register settings and help routers in main.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 21:41:18 -04:00

1 2

70 Commits