Cortex-Inara

Author	SHA1	Message	Date
Scott Idem	3716e5974f	feat: Phase 3 model toggle — cycle chat-role slot models in UI Replaces the role-cycle toggle with a slot model toggle in the Context & Memory panel. The active model label is shown on the button; clicking cycles through Primary → Backup 1 → Backup 2 slots configured for the Chat role. - app.js: remove activeRole()/availableRoles role-cycling; add activeChatModel()/chatModels slot cycling; update send/orchestrate payloads to send slot + chat_role:"chat"; fix updateSendBtnTitle and startRunTimer to use activeChatModel() - chat.py: add slot field to ChatRequest; pass slot= to complete(); resolve backend_label from slot config; add _chat_slot_models() helper; include chat_models in GET /backend response - HELP.md: update Model toggle description, tool count (62/16), Backends section, API chat payload example Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-12 21:32:43 -04:00
Scott Idem	85792a7bcf	feat: per-role inject_mode, OTR fixes, hover metadata, send/stop tooltip - inject_mode: per-role toggle (parallel to inject_datetime) gates the "Current mode: Off The Record" line in the system prompt; wired through model_registry, context_loader, chat router, orchestrator router, and local_llm settings UI - OTR orchestrator fix: OrchestrateRequest now carries off_record; _finalize_job stores it per message and gates log_turn on it; JS orchestrate payload sends off_record correctly - Per-message hover metadata: removed always-visible .model-tag; replaced with .msg-meta strip in the action bar (hover-only); shows model label, host, fallback indicator, and OTR badge; stored in session JSON - Send/stop button tooltip: shows role + model and (when tools on) separate orchestrator model + engine label; live elapsed timer on stop button via startRunTimer/stopRunTimer - OrchestratorResult.backend_label: new field; openai_orchestrator fills it; finalize_job propagates it to job dict and session messages - GET /backend: exposes orchestrator_model label so the frontend tooltip can show both models separately - TODO: session delete confirmation added Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-09 16:12:03 -04:00
Scott Idem	128d8a7c1e	feat: inject session mode into persona system prompt context_loader.load_context() now accepts a mode param ("chat"\|"otr"). In OTR mode, the --- System --- block gains a second line: Current mode: Off The Record — this conversation is private and will not be logged or included in memory distillation routers/chat.py passes mode="otr" when req.off_record is True. Normal chat and all orchestrator calls stay at mode="chat" (no change to the System block). The System block consolidates date/time and mode in one place, matching the existing timestamp pattern. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-09 14:35:09 -04:00
Scott Idem	f8f7cd75da	feat: audit log, usage tracking UI, OpenAI orchestrator compaction, onboarding + docs Tool audit log: - Every orchestrator tool call logged to home/{user}/tool_audit/YYYY-MM-DD.jsonl - Files panel sidebar: audit log group (collapsed), date-linked read-only table - Admin endpoints: /api/audit/files, /api/audit/day, /api/audit/recent, /api/audit/stats - Engine and model name recorded per entry OpenAI orchestrator improvements: - Context budget enforcement: 75% of model context_k (min 16k) - Message compaction: truncates old tool results when approaching budget - max_rounds respected per model config (intersected with server cap) OpenRouter onboarding (setup.html, onboarding.py, app.js, settings.html): - Step 3 of 3: /setup/model with curated model picker - Chat banner for users on server-default model (informational, not alarmist) - Settings quick-link card; /setup/model works standalone for existing users Model registry + session store: - set_role_config / get_role_config for per-role tool lists and system_append - session_store: session rename, session name backfill endpoint UI updates (app.js, index.html, style.css, local_llm.html): - Role toggle in context panel - Off-the-record mode - Agent notes read-only viewer - OPERATIONS.md loaded at T2+ in context Documentation: - HELP.md: full tool table, per-role tool sets, Agent Notes, usage tracking - TOOLS.md: Agent Notes section, count corrected to 44 - ARCH__SYSTEM.md, ARCH__BACKENDS.md, MASTER.md updated to match reality - CLAUDE.md: onboarding flow, documentation philosophy sections - README.md: stack in practice, DeepSeek TUI mention, architecture diagram updated - TODO__Agents.md: onboarding task completed with deviation notes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 21:26:43 -04:00
Scott Idem	0b96772fa6	fix: show session friendly name in resume message and status bar /history/{session_id} now returns a 'name' field alongside messages. resumeSession() uses data.name first, then the sessionNames map, then raw ID as fallback — so named sessions display correctly even on page load before the sessions panel has been opened. 'Resumed session X' message also now shows the friendly name. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:14:59 -04:00
Scott Idem	8baab874f1	feat: replace backend/slot toggle with role selector The backend toggle now cycles through configured roles (chat, coder, research, distill, etc.) instead of backup model slots within the chat role. Each role uses its own primary→backup chain from the registry. - ChatRequest.slot replaced by chat_role (default "chat") - GET /backend returns available_roles instead of chat_models - _available_roles_for_toggle() builds list from defined_roles, excluding orchestrator (which has its own Agent mode) - Model label on responses now reflects the actual role's assigned model - Toggle is inert when only one role is configured (avoids useless cycling) - Add "Clear browser cache" button to Account Settings (Connected Accounts) - Add _role_model_label() helper for cleaner response tag labeling Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 19:23:18 -04:00
Scott Idem	962d58d2e2	feat: model registry Phase 3 — slot-based backend toggle Backend toggle now cycles through chat role models by label instead of cycling service type strings (auto/claude/gemini/local). - model_registry: get_model_for_slot() — resolves a specific priority slot without walking the fallback chain - llm_client: complete() gains slot param; explicit slot selection dispatches directly to that model with no silent fallback - routers/chat.py: ChatRequest.slot; GET /backend returns chat_models [{slot, label, type}] for the UI; _stream_chat uses resolved model label for the response tag when a slot is pinned - app.js: toggle loads chat_models from /backend, cycles by label, sends slot in chat payload; legacy model field removed from payload - app.js: fix Gap B — agent mode placeholder no longer says "Gemini tool loop"; now says "orchestrator" - DESIGN doc: updated to reflect phases 1+2 complete, catalog-as-code decision, Gap A/B documented, Phase 3 implementation details Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-27 21:43:08 -04:00
Scott Idem	3b3456600a	feat: store and display backend + host metadata on chat messages Each assistant message in the session JSON now carries: backend, backend_label, host (platform.node()) These fields are shown as model tags in the UI — on live responses and when loading session history. Session log entries (sessions/YYYY-MM-DD.md) include the backend label and host in the turn header. The local (OpenAI-compat) backend strips non-standard fields before sending messages to the API so extra fields don't leak upstream. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 22:16:48 -04:00
Scott Idem	d9a322164a	feat: OpenAI-compatible orchestrator + backend auto-routing - openai_orchestrator.py — new ReAct tool loop engine for any OpenAI-compatible endpoint (OpenRouter, Open WebUI, Ollama, LiteLLM); model handles both tool loop and final response, no Claude handoff needed - tools/__init__.py — auto-derive OpenAI JSON Schema from existing Gemini FunctionDeclarations so tool definitions have a single source of truth - routers/orchestrator.py — route to openai_orchestrator when model registry "orchestrator" role resolves to a local_openai type host - routers/chat.py — pass role to _backend_label(); fix fallback_used logic (only meaningful for explicit backend overrides, not auto-routing) - static/app.js — add null/"auto" to backend cycle; fetch local model hint without overriding the auto default on page load - model_registry.py — _normalize() back-fills host_type on old registry files - requirements.txt — add openai>=1.0.0 - ARCH__BACKENDS.md — document OpenAI-compat backend and routing logic Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:18:18 -04:00
Scott Idem	8570e8d852	fix: backend toggle not sent to server; add per-message model tag Fixes: - app.js was tracking primaryBackend locally but never included model: primaryBackend in the /chat POST body, so the server always used settings.primary_backend regardless of what the user clicked. Now model: primaryBackend is sent on every chat request. - Responses were only annotated when fallback occurred. Now every assistant message shows a small model tag at the bottom right. chat.py: - _backend_label() resolves human-readable name: claude → "Claude", gemini → "Gemini", local → registry label (e.g. "Gemma 4 E4B") or model_name - SSE payload now includes backend_label field app.js: - model: primaryBackend added to /chat fetch body - After every response, appends .model-tag div with backend_label - Fallback shows "⚡ fallback → <label>" in amber; normal is muted - Removed separate system message for fallback (tag covers it) style.css: - .model-tag: small muted text, right-aligned, separated by thin line - .model-tag.fallback: amber (#f59e0b) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 22:10:40 -04:00
Scott Idem	6a1a1c2686	feat: unified model registry with role-based routing Introduces model_registry.py as the single source of truth for all LLM backend configuration. Replaces scattered backend settings across user_settings, config distill_backend_, and the UI toggle. model_registry.py: - Per-user home/{user}/model_registry.json with version, hosts, models, roles - Models have: type (local_openai\|claude_cli\|gemini_cli\|gemini_api), label, model_name, host_id, context_k (tokens), tags (capability labels) - Roles map to priority chains: primary, backup_1..backup_4 - Built-in IDs (claude_cli, gemini_cli, gemini_api) always resolvable - Auto-migrates existing local_llm.json on first access - CRUD: save_host, remove_host, save_model, remove_model, set_role - get_model_for_role(): registry → .env default → hardcoded fallback config.py: - role_chat/orchestrator/distill/coder/research .env defaults - defined_roles: comma-separated standard role list (extensible) - get_defined_roles() and get_role_default() helper methods llm_client.complete(): - New role= parameter (default "chat") for registry-based routing - model= still accepted for explicit UI toggle override - _claude() and _local() accept model_cfg dict instead of raw string - _local() uses pre-resolved config from registry memory_distiller.py: - distill_mid/long now use role="distill" (no more distill_backend_ .env vars needed) cron_runner.py: - brief jobs use role="chat" routers/chat.py + auth.py: - Use model_registry instead of user_settings for local model info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 21:25:18 -04:00
Scott Idem	a4daebdc9b	feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul Local LLM: - user_settings.py: per-user hosts/models config (local_llm.json) - routers/local_llm.py + static/local_llm.html: dedicated settings page - llm_client.py: local OpenAI-compatible backend via httpx - config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts - Active model shown near backend toggle (amber hint text) Memory distillation: - memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides - scheduler.py + notification.py: notify NC Talk after mid/long distill - notification.py: outbound channel abstraction (NC Talk, extensible) Session search: - routers/files.py: GET /sessions/search?q= with excerpts grouped by date - static/index.html + app.js: search UI in file sidebar with highlight - _esc() helper to prevent XSS in search results Proactive cron: - cron_runner.py: new job types — message (send directly) and brief (LLM + send) - Both support optional per-job channel override Channels: - routers/nextcloud_talk.py: consolidated using notification._send_nct_message() - routers/auth.py: local backend status in /auth/status - routers/chat.py: /backend returns {primary, fallback, local_model} object UI / UX: - Copy button for user messages (matching assistant) - Autocomplete disabled on sensitive form fields - settings.html: local model section replaced with link to /settings/local Docs overhaul: - MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md - ARCH__Intelligence_Layer.md replaced with redirect table - CORTEX.md trimmed to vision only; README updated - OPEN_WEBUI_API.md added to docs/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 20:53:06 -04:00
Scott Idem	fa04b5e6b0	feat: off the record mode (OTR) Adds a third input mode toggle alongside Note and Agent. When active: - Textarea gets a subtle purple tint with dashed border - OTR button highlights purple - Placeholder reads "Off the record — not logged or distilled…" - off_record=True is sent to /chat; session_logger is skipped - In-memory session context is preserved within the session Switching to Note or Agent mode deactivates OTR, and vice versa. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 21:07:21 -04:00
Scott Idem	0cf0d65e9e	feat: session naming, username/persona rename, help page, contrast fixes - Session name field: PATCH /sessions/{id} endpoint, inline rename button in UI - Persona rename: inline ✏ toggle form in settings, POST /settings/persona/rename - Username rename: inline form in settings, POST /settings/username (renames home dir, forces re-login) - Help page: dedicated /help route replacing modal, collapsible sections - Per-persona isolation: files.py and session_store.py now scope to correct user/persona - Contrast/visibility: muted text bumped to slate-400+, session rename btn at 0.4 opacity Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 23:10:12 -04:00
Scott Idem	c01ef663f5	fix: per-persona session/file isolation + onboarding route order - session_store: store sessions under home/{user}/persona/{name}/session_data/ instead of the shared cortex/data/sessions/ bucket - chat endpoints: add user/persona query params to /sessions, /history/, /sessions/, /note so they resolve the correct persona context - files router: add user/persona query params to /files and /files/{name} so the file browser loads the right persona's files - app.js: pass user/persona on all session, history, and file fetches; move _fileParams to top-level scope so it is available everywhere - onboarding: fix FastAPI route ordering — register /persona before /{token} so the literal path wins and does not get captured as a token value - ui.py: read Emoji field from IDENTITY.md and inject into CORTEX_CONFIG so the header icon reflects each persona's chosen emoji - .gitignore: exclude home/**/session_data/ (runtime state) - migrate scott/inara sessions from cortex/data/sessions/ to session_data/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-22 00:01:07 -04:00
Scott Idem	77e770cdb2	feat: multi-user/multi-persona support with two-level home directory layout Restructures persona storage from a flat personas/{name}/ layout to home/{username}/persona/{name}/, mirroring Linux home directories. Changes: - persona.py: two ContextVars (user + persona), Linux-style name validation, set_context(), get_user(), get_persona(), validate(), list_users(), list_user_personas(); persona_path() takes (username, name) - config.py: replaces personas_dir with home_dir + home_root() - git mv personas/inara → home/scott/persona/inara (history preserved) - home/holly/persona/tina/: Holly's persona stub added - cron_runner.py: all storage functions take (username, persona) params - tools/cron.py: stamps user + persona on jobs; APScheduler IDs are {user}:{persona}:{job_id} to prevent collisions across users - memory_distiller.py: distill_short/mid/long take (username, persona); added missing Path + settings imports - scheduler.py: _load_user_crons() iterates home//persona/ (two-level) - routers/chat.py, orchestrator.py: user field added; set_context() called - tests/conftest.py: home_root fixture with two-level structure; patches home_dir instead of personas_dir - tests/test_persona.py: fully rewritten for two-level API - tests/test_api_files.py: updated fixture name and path - .env.default: documents HOME_DIR setting; scrubs stale API key - CLAUDE.md, README.md: directory maps updated for new layout All 80 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 22:35:40 -04:00
Scott Idem	5cadb836fa	feat: multi-persona support (single Cortex, multiple users) - Add cortex/persona.py: ContextVar-based per-request routing with path traversal protection and persona validation - Migrate inara/ → personas/inara/ (git history preserved via git mv) - config.py: add personas_root(), inara_path() delegates to personas/inara - All 14 settings.inara_path() call sites replaced with persona_path() - ChatRequest + OrchestrateRequest: add persona field (default: "inara") with validation at request entry before any processing - memory_distiller: add optional persona param for future per-persona distill - cron_runner/tools/cron: stamp persona on jobs, prefix APScheduler IDs (persona:job_id) to prevent collisions across personas - scheduler: _load_user_crons() iterates all personas at startup Adding a new persona: create personas/<name>/ with IDENTITY.md + SOUL.md. Auth: handled at nginx level (inject X-Cortex-Persona header per subdomain). Future: persona maps to Aether account_id_random for full integration. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 21:50:02 -04:00
Scott Idem	f935fc4a7f	feat: session delete + touch-friendly message controls Session delete: - DELETE /sessions/{session_id} endpoint (chat.py + session_store.py) - × button on each session item in the panel (hover-reveal on desktop) - Clears UI if the active session is deleted Touch accessibility: - @media (hover: none) rule makes msg-actions always visible on touch devices - msg-act-btn tap targets enlarged to 36px min-height, readable font size - session-delete-btn also always visible and finger-sized on touch Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 19:43:20 -04:00
Scott Idem	ce3c1f5f7f	Add tiered memory system with manual distillation - config.py: memory_budget_long/mid/short settings (overridable in .env) - memory_distiller.py: distill_short (no LLM), distill_mid, distill_long (LLM) - routers/distill.py: POST /distill/{short,mid,long,all} endpoints - context_loader.py: rewrote to load long→mid→short order with include_* toggles - routers/chat.py: ChatRequest gains include_long/mid/short fields - routers/files.py: MEMORY_LONG/MID/SHORT.md added to ALLOWED set - main.py: register distill router - static/index.html: context bar — tier selector, L/M/S memory toggles, distill buttons with status feedback; send includes tier + memory flags - inara/MEMORY_LONG.md: migrated from MEMORY.md + Cortex/Talk bot notes - inara/MEMORY_MID.md, MEMORY_SHORT.md: stubs ready for distillation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:22:32 -04:00
Scott Idem	3455c7a09c	Add SSE real-time Talk activity, file editor UI, and identity file API - event_bus.py: in-process asyncio pub/sub (one Queue per SSE client) - nextcloud_talk.py: publishes nct_message/nct_response events to bus - chat.py: GET /events SSE endpoint streams Talk activity to browser - routers/files.py: whitelist-protected GET/PUT for Inara identity .md files - main.py: register files router - static/index.html: real-time Talk feed, blue badge on Sessions btn, Files modal with preview/edit toggle and Ctrl+S save Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:10:07 -04:00
Scott Idem	8add4ffd02	Add edit/delete history, named sessions, scroll fix, systemd service - Edit/delete individual messages from session context with inline editing (Ctrl+Enter saves, Escape cancels); changes sync to backend via PUT /history - PUT /history/{session_id} endpoint to replace full message list - Named sessions: readable slugs (e.g. quiet-spring) instead of UUID fragments - Scroll no longer snaps to bottom when user has scrolled up to read history - cortex.service: systemd unit for auto-start and restart-on-failure Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-10 23:38:39 -04:00
Scott Idem	2f675ee4bf	Initial commit — Cortex API + Inara identity Cortex: FastAPI backend serving Inara via Claude/Gemini CLI backends. Includes SSE streaming chat, session persistence, Google Chat webhook handler, and Docker support. Inara: Identity files (persona, soul, protocols, memory, context tiers) mounted read-only into the container at runtime. Features in initial cut: - /chat endpoint with SSE keepalive + LLM fallback - Session store with rolling history window - Markdown rendering, copy-to-clipboard, links open in new tab - Stacked right-column input controls (height selector, enter toggle, note mode with public/private) — semi-hidden until textarea grows - /note endpoint for injecting public context into session history - Docker Compose config (local dev runs natively; Docker for server) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 03:41:00 -05:00

22 Commits