Cortex-Inara

Author	SHA1	Message	Date
Scott Idem	ddf44a2aee	feat: web push notifications (VAPID) - push_utils.py: subscription storage + send helper (auto-prunes 410 endpoints) - routers/push.py: GET /api/push/vapid-key (public), POST/DELETE /api/push/subscribe - sw.js: push event listener shows notification; notificationclick focuses/opens tab - app.js: subscribe/unsubscribe flow + "Enable notifications" toggle in settings dropdown - tools/notify.py: web_push orchestrator tool (user-level, no admin required) - VAPID keys in .env; pywebpush added to requirements.txt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:38:58 -04:00
Scott Idem	6a1a1c2686	feat: unified model registry with role-based routing Introduces model_registry.py as the single source of truth for all LLM backend configuration. Replaces scattered backend settings across user_settings, config distill_backend_, and the UI toggle. model_registry.py: - Per-user home/{user}/model_registry.json with version, hosts, models, roles - Models have: type (local_openai\|claude_cli\|gemini_cli\|gemini_api), label, model_name, host_id, context_k (tokens), tags (capability labels) - Roles map to priority chains: primary, backup_1..backup_4 - Built-in IDs (claude_cli, gemini_cli, gemini_api) always resolvable - Auto-migrates existing local_llm.json on first access - CRUD: save_host, remove_host, save_model, remove_model, set_role - get_model_for_role(): registry → .env default → hardcoded fallback config.py: - role_chat/orchestrator/distill/coder/research .env defaults - defined_roles: comma-separated standard role list (extensible) - get_defined_roles() and get_role_default() helper methods llm_client.complete(): - New role= parameter (default "chat") for registry-based routing - model= still accepted for explicit UI toggle override - _claude() and _local() accept model_cfg dict instead of raw string - _local() uses pre-resolved config from registry memory_distiller.py: - distill_mid/long now use role="distill" (no more distill_backend_ .env vars needed) cron_runner.py: - brief jobs use role="chat" routers/chat.py + auth.py: - Use model_registry instead of user_settings for local model info Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 21:25:18 -04:00
Scott Idem	a4daebdc9b	feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul Local LLM: - user_settings.py: per-user hosts/models config (local_llm.json) - routers/local_llm.py + static/local_llm.html: dedicated settings page - llm_client.py: local OpenAI-compatible backend via httpx - config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts - Active model shown near backend toggle (amber hint text) Memory distillation: - memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides - scheduler.py + notification.py: notify NC Talk after mid/long distill - notification.py: outbound channel abstraction (NC Talk, extensible) Session search: - routers/files.py: GET /sessions/search?q= with excerpts grouped by date - static/index.html + app.js: search UI in file sidebar with highlight - _esc() helper to prevent XSS in search results Proactive cron: - cron_runner.py: new job types — message (send directly) and brief (LLM + send) - Both support optional per-job channel override Channels: - routers/nextcloud_talk.py: consolidated using notification._send_nct_message() - routers/auth.py: local backend status in /auth/status - routers/chat.py: /backend returns {primary, fallback, local_model} object UI / UX: - Copy button for user messages (matching assistant) - Autocomplete disabled on sensitive form fields - settings.html: local model section replaced with link to /settings/local Docs overhaul: - MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md - ARCH__Intelligence_Layer.md replaced with redirect table - CORTEX.md trimmed to vision only; README updated - OPEN_WEBUI_API.md added to docs/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 20:53:06 -04:00
Scott Idem	93f7f44e51	feat: per-user channel config for Google Chat and Nextcloud Talk - New endpoints: POST /channels/google-chat/{username} and /webhook/nextcloud/{username} - Channel secrets/config live in home/{username}/channels.json (gitignored) - auth_utils: get_user_channels() helper reads channels.json - Both routers load persona, audience/secret, backend, timeout per user; set_context() wires the correct persona before building the system prompt - Removed server-level channel settings from config.py and .env — no user gets a channel until they create their own channels.json - .gitignore: home/**/channels.json added To migrate: update Google Chat Add-on webhook URL to /channels/google-chat/{username} and re-register NC Talk bot at /webhook/nextcloud/{username} Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-29 13:02:45 -04:00
Scott Idem	8aec6aafcc	feat: Google OAuth sign-in + per-user Gemini API key Users with Google accounts can now sign in without a password. Auth flow: - GET /auth/google → Google consent page (CSRF state cookie) - GET /auth/google/callback → exchange code, lookup user, set JWT - auth.json gains google_sub + google_email fields - set_password() no longer overwrites unrelated auth.json fields Admin setup: python manage_passwords.py google-add <username> <email> # add GOOGLE_CLIENT_ID + GOOGLE_CLIENT_SECRET to .env Per-user Gemini key: - get_user_gemini_key() reads gemini_api_key from auth.json - orchestrator_engine.run() accepts gemini_api_key param - orchestrator router passes user's key, falls back to server key login.html: "Sign in with Google" button above the password form. manage_passwords.py list: now shows auth method columns (pw / google). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 21:01:52 -04:00
Scott Idem	69f38ca7dc	feat: SMTP email support for invite links + profile.json for user email storage - email_utils.py: send_email() via smtplib.SMTP_SSL (port 465, same server as AE API); send_invite_email() renders plain-text + HTML invite template - config.py: smtp_server, smtp_port, smtp_username, smtp_password, smtp_from_email, smtp_from_name, cortex_base_url settings - manage_passwords.py: - profile.json helpers (get/set email stored in home/{username}/profile.json) - invite command now accepts optional email arg, sends invite automatically; falls back to stored email; prints link either way - new 'email' command to store/update a user's email address - 'list' command now shows email alongside password status - .env.default: SMTP_* and CORTEX_BASE_URL documented Usage after adding SMTP_PASSWORD to .env: python manage_passwords.py invite holly holly@example.com → generates token, stores email, sends invite, prints link as fallback All 80 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 23:19:09 -04:00
Scott Idem	a9bbb668b5	feat: session auth + per-user/persona UI at /{user}/{persona} Replaces nginx basic auth with a proper per-user session system: - auth_utils.py: bcrypt password hashing, JWT cookie creation/decode - auth_middleware.py: validates JWT cookie on all routes except /login, /health, /static/, and webhook endpoints (/channels/, /webhook/) - routers/ui.py: GET /login, POST /login, POST /logout, GET /{username}/{persona} — serves index.html with CORTEX_CONFIG injected - static/login.html: minimal login form (dark theme, matches UI) - main.py: registers SessionAuthMiddleware + ui.router - config.py: jwt_secret, jwt_expire_days settings - manage_passwords.py: CLI tool to set/check/list user passwords - app.js: reads window.CORTEX_CONFIG (user + persona), sends both on every /chat and /orchestrate request; persona name shown in header; logout button (⏏) added to header - requirements.txt: bcrypt, PyJWT, python-multipart - .env.default: JWT_SECRET, JWT_EXPIRE_DAYS documented - tests: client fixture injects JWT cookie; security test assertions updated for URL-normalized path traversal paths (still secure, codes differ) All 80 tests pass. Setup for a new user: python manage_passwords.py set scott python manage_passwords.py set holly Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 22:54:12 -04:00
Scott Idem	77e770cdb2	feat: multi-user/multi-persona support with two-level home directory layout Restructures persona storage from a flat personas/{name}/ layout to home/{username}/persona/{name}/, mirroring Linux home directories. Changes: - persona.py: two ContextVars (user + persona), Linux-style name validation, set_context(), get_user(), get_persona(), validate(), list_users(), list_user_personas(); persona_path() takes (username, name) - config.py: replaces personas_dir with home_dir + home_root() - git mv personas/inara → home/scott/persona/inara (history preserved) - home/holly/persona/tina/: Holly's persona stub added - cron_runner.py: all storage functions take (username, persona) params - tools/cron.py: stamps user + persona on jobs; APScheduler IDs are {user}:{persona}:{job_id} to prevent collisions across users - memory_distiller.py: distill_short/mid/long take (username, persona); added missing Path + settings imports - scheduler.py: _load_user_crons() iterates home//persona/ (two-level) - routers/chat.py, orchestrator.py: user field added; set_context() called - tests/conftest.py: home_root fixture with two-level structure; patches home_dir instead of personas_dir - tests/test_persona.py: fully rewritten for two-level API - tests/test_api_files.py: updated fixture name and path - .env.default: documents HOME_DIR setting; scrubs stale API key - CLAUDE.md, README.md: directory maps updated for new layout All 80 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 22:35:40 -04:00
Scott Idem	5cadb836fa	feat: multi-persona support (single Cortex, multiple users) - Add cortex/persona.py: ContextVar-based per-request routing with path traversal protection and persona validation - Migrate inara/ → personas/inara/ (git history preserved via git mv) - config.py: add personas_root(), inara_path() delegates to personas/inara - All 14 settings.inara_path() call sites replaced with persona_path() - ChatRequest + OrchestrateRequest: add persona field (default: "inara") with validation at request entry before any processing - memory_distiller: add optional persona param for future per-persona distill - cron_runner/tools/cron: stamp persona on jobs, prefix APScheduler IDs (persona:job_id) to prevent collisions across personas - scheduler: _load_user_crons() iterates all personas at startup Adding a new persona: create personas/<name>/ with IDENTITY.md + SOUL.md. Auth: handled at nginx level (inject X-Cortex-Persona header per subdomain). Future: persona maps to Aether account_id_random for full integration. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 21:50:02 -04:00
Scott Idem	97438f1a0f	feat: multi-instance support — agent_name and user_name configurable All hardcoded "Inara"/"Scott" strings replaced with settings.agent_name and settings.user_name, read from .env at startup: - config.py: AGENT_NAME and USER_NAME settings (defaults: Inara / Scott) - llm_client.py: conversation labels in prompt builder - session_logger.py: Name: labels in session log markdown - memory_distiller.py: distillation system prompts (mid + long) - routers/nextcloud_talk.py: @mention prefix strip - routers/google_chat.py: greeting message Second instance scaffolding: - holly/: identity directory with placeholder files (USER_NAME=Holly, AGENT_NAME to be chosen by Holly) - cortex/.env.holly: config for Holly's instance on port 8001 - cortex-holly.service: systemd unit for the second instance No behavioural change to the Inara/Scott instance — defaults unchanged. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 20:13:11 -04:00
Scott Idem	ed472ce9a0	feat: Intelligence Layer Phase 1 — orchestrator service Adds the Gemini API orchestrator (ReAct tool loop → Claude responder): Orchestrator engine + router: - orchestrator_engine.py: Gemini API tool loop, Claude CLI handoff - routers/orchestrator.py: POST /orchestrate (async job queue), GET /orchestrate/{job_id} Tools (cortex/tools/): - web.py: DuckDuckGo web search (no key required) - ae_knowledge.py: ae_journal_search + ae_journal_entry_create (AE V3 API) - ae_tasks.py: ae_task_list (reads agents_sync Kanban filesystem) - files.py: file_read (path-allowlisted to safe dirs) Config + deps: - config.py: orchestrator, DuckDuckGo, and AE API settings - requirements.txt: google-genai, duckduckgo-search - .env.default: reference config with all new keys documented Docs: - CLAUDE.md, README.md, documentation/ added to repo - Port references updated 7331 → 8000 throughout - Default model updated to gemini-2.5-flash Tested: ae_task_list, ae_journal_search, web_search all working end-to-end. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 19:37:49 -04:00
Scott Idem	ce1561572a	config: default scheduler timezone to America/New_York	2026-03-17 22:42:47 -04:00
Scott Idem	b123dc3117	Fix scheduler timezone: use ZoneInfo(settings.scheduler_timezone) not 'local'	2026-03-17 22:38:11 -04:00
Scott Idem	4253e69c0b	Add auto memory distillation scheduler (APScheduler) - scheduler.py: AsyncIOScheduler with three cron jobs short daily 03:00 (no LLM, always fast) mid weekly Sun 03:30 (LLM) long monthly 1st 04:00 (LLM — off by default) - config.py: AUTO_DISTILL, AUTO_DISTILL_SHORT/MID/LONG .env flags - main.py: start/stop scheduler in FastAPI lifespan - routers/distill.py: GET /distill/status — next run times + config - requirements.txt: apscheduler>=3.10 - HELP.md: updated planned items, added /distill/status to API table Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 22:31:38 -04:00
Scott Idem	ce3c1f5f7f	Add tiered memory system with manual distillation - config.py: memory_budget_long/mid/short settings (overridable in .env) - memory_distiller.py: distill_short (no LLM), distill_mid, distill_long (LLM) - routers/distill.py: POST /distill/{short,mid,long,all} endpoints - context_loader.py: rewrote to load long→mid→short order with include_* toggles - routers/chat.py: ChatRequest gains include_long/mid/short fields - routers/files.py: MEMORY_LONG/MID/SHORT.md added to ALLOWED set - main.py: register distill router - static/index.html: context bar — tier selector, L/M/S memory toggles, distill buttons with status feedback; send includes tier + memory flags - inara/MEMORY_LONG.md: migrated from MEMORY.md + Cortex/Talk bot notes - inara/MEMORY_MID.md, MEMORY_SHORT.md: stubs ready for distillation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:22:32 -04:00
Scott Idem	fe854ee534	Add Nextcloud Talk bot integration (Inara) - New routers/nextcloud_talk.py: webhook handler verifies incoming HMAC, calls LLM via BackgroundTasks, posts reply with correctly computed signature (random + message_text, not raw body) - llm_client.py: read Claude OAuth token live from ~/.claude/.credentials.json to avoid stale systemd env tokens; strip conflicting ANTHROPIC_API_KEY - config.py: add nextcloud_url, nextcloud_talk_bot_secret, nextcloud_talk_timeout settings - main.py: register nextcloud_talk router, add logging setup - docs/NEXTCLOUD_TALK_BOT.md: installation guide + HMAC signing gotcha Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-16 23:04:26 -04:00
Scott Idem	2f675ee4bf	Initial commit — Cortex API + Inara identity Cortex: FastAPI backend serving Inara via Claude/Gemini CLI backends. Includes SSE streaming chat, session persistence, Google Chat webhook handler, and Docker support. Inara: Identity files (persona, soul, protocols, memory, context tiers) mounted read-only into the container at runtime. Features in initial cut: - /chat endpoint with SSE keepalive + LLM fallback - Session store with rolling history window - Markdown rendering, copy-to-clipboard, links open in new tab - Stacked right-column input controls (height selector, enter toggle, note mode with public/private) — semi-hidden until textarea grows - /note endpoint for injecting public context into session history - Docker Compose config (local dev runs natively; Docker for server) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 03:41:00 -05:00

17 Commits