Cortex-Inara

Author	SHA1	Message	Date
Scott Idem	0c1cf3989a	feat: aider multi-provider credentials + test suite green (182/182) aider_run multi-provider credentials (tools/aider.py): - _resolve_credentials() — general credential resolver; replaces the previous OpenRouter-only injection; resolution priority: Anthropic model hint → explicit host_label → model prefix (openrouter/, groq/, deepseek/*, …) → OpenRouter default → Anthropic API key → any keyed cloud host → local/generic host - _host_flags() — generates --api-key slug=key for known cloud providers (OpenRouter, OpenAI, Groq, Together, Fireworks, X.ai, DeepSeek, Mistral); generates --openai-api-base + --openai-api-key for generic/local hosts (Open WebUI, Ollama); appends /api suffix for openwebui host_type; auto-prefixes model with 'openai/' for generic endpoints when model has no / prefix - Anthropic API keys from providers.anthropic.credentials (not a host entry) - host_label param added to aider_run and FunctionDeclaration — pick a configured host by partial label match (e.g. 'OpenRouter', 'Local', 'scott-lt-i7-rtx') - 16 unit tests for _resolve_credentials covering all resolution paths main.py: move @app.get("/health") before app.include_router(ui.router) — the /{username} catch-all in ui.router was swallowing the /health path Test suite: 37 pre-existing failures → 182/182 passing - test_tools.py: _task_list() missing priority arg (6 callsites); cron ID regex c_\w+ → c_[\w-]+ (token_urlsafe includes '-', causing intermittent truncation) - test_webhooks.py: rewritten for per-user channel config architecture — patch routers.nextcloud_talk/google_chat.get_user_channels instead of removed settings fields; corrected endpoints /webhook/nextcloud/scott and /channels/google-chat/scott; non-empty cfg dicts so falsy-guard passes - test_health.py: test_unknown_route_404 now uses 3-segment path (/{u}/{p}/x) since single-segment paths hit the /{username} UI catch-all - test_api_files.py: removed '../config.py' from not-in-allowed test (ASGI normalizes it to /config.py which hits /{username} catch-all, not files router) - test_security.py: same webhook patch target fix; per-user endpoint URLs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-03 23:00:45 -04:00
Scott Idem	70665fadff	feat: schedules UI, task cron type, monthly/yearly schedules, AE DB tools, integrations page - Schedules web UI (/settings/crons): list, add, edit, pause/resume, delete jobs - cron task type: full orchestrator tool loop on a schedule, result → notification channel - parse_schedule: monthly/yearly formats (monthly:DD:HH:MM, yearly:MM:DD:HH:MM) - HA inbound webhook tools toggle: orchestrator loop vs. direct LLM, configurable in UI - ae_db_query/describe/show_view: SELECT-only Aether MariaDB access (admin, per-user creds) - /settings/integrations: admin-only page for Aether DB credentials - Schedules nav link added to all settings pages - pymysql added to requirements - Docs updated: HELP.md, MASTER.md, CLAUDE.md Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 21:06:43 -04:00
Scott Idem	69ec2f667d	feat: tool risk policy UI + wiring through all orchestrators - New /settings/tools page: max_risk selector (low/medium/high) + per-tool override dropdowns (Default / Force include / Force exclude) for all 58 tools grouped by category with color-coded risk badges; JS updates Auto status live - get_tools_for_role() + get_openai_tools_for_role() now accept max_risk, whitelist, blacklist; _apply_risk_policy() handles the filtering logic - get_risk_policy() helper in auth_utils reads from tool_policy.json - Risk policy wired through orchestrator.py, openai_orchestrator.py, orchestrator_engine.py, nextcloud_talk.py, homeassistant.py - Tools nav link added to settings.html and notifications.html - CLAUDE.md and ARCH__SYSTEM.md updated: tool count 50→58, risk system docs, tool access control three-layer model documented Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-11 22:45:04 -04:00
Scott Idem	1d361fe809	feat: NCT orchestrator support + Home Assistant webhook nextcloud_talk.py: - Fix missing import hmac / import hashlib (NameError bug in _verify_signature) - Add orchestrator routing when channels.json "tools": true — sends "⏳ Working on it…" immediately, then runs the full tool loop and replies with the result; checkpoint case gets a web UI confirmation note - Read tier and role from channel config (defaults: default_tier / "chat") - Pass cfg through to _process_message homeassistant.py (new): - POST /webhook/ha/{username}/{webhook_id} - Auth: webhook_id path segment matched against channels.json - Accepts JSON or form-encoded body from HA automations - Builds natural-language task from payload (uses "message" key if present, otherwise serialises full body as context) - Same orchestrator/direct dispatch as NCT - Delivers response via notify() — NC Talk, web push, or configured channel - Session key: ha_{username} for continuity across HA events - Registered in main.py; /webhook/ prefix already public in auth_middleware channels.json schema addition: "homeassistant": { "webhook_id": "your-secret-id", "persona": "inara", "tier": 2, "role": "chat", "tools": false } Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-11 19:45:59 -04:00
Scott Idem	f8f7cd75da	feat: audit log, usage tracking UI, OpenAI orchestrator compaction, onboarding + docs Tool audit log: - Every orchestrator tool call logged to home/{user}/tool_audit/YYYY-MM-DD.jsonl - Files panel sidebar: audit log group (collapsed), date-linked read-only table - Admin endpoints: /api/audit/files, /api/audit/day, /api/audit/recent, /api/audit/stats - Engine and model name recorded per entry OpenAI orchestrator improvements: - Context budget enforcement: 75% of model context_k (min 16k) - Message compaction: truncates old tool results when approaching budget - max_rounds respected per model config (intersected with server cap) OpenRouter onboarding (setup.html, onboarding.py, app.js, settings.html): - Step 3 of 3: /setup/model with curated model picker - Chat banner for users on server-default model (informational, not alarmist) - Settings quick-link card; /setup/model works standalone for existing users Model registry + session store: - set_role_config / get_role_config for per-role tool lists and system_append - session_store: session rename, session name backfill endpoint UI updates (app.js, index.html, style.css, local_llm.html): - Role toggle in context panel - Off-the-record mode - Agent notes read-only viewer - OPERATIONS.md loaded at T2+ in context Documentation: - HELP.md: full tool table, per-role tool sets, Agent Notes, usage tracking - TOOLS.md: Agent Notes section, count corrected to 44 - ARCH__SYSTEM.md, ARCH__BACKENDS.md, MASTER.md updated to match reality - CLAUDE.md: onboarding flow, documentation philosophy sections - README.md: stack in practice, DeepSeek TUI mention, architecture diagram updated - TODO__Agents.md: onboarding task completed with deviation notes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 21:26:43 -04:00
Scott Idem	584ae679a6	feat: tool call audit log Every orchestrator tool invocation is recorded to home/{user}/tool_audit/YYYY-MM-DD.jsonl. Each entry captures: timestamp, user, tool, args (truncated), status (ok/error/denied), result length, and a 300-char result snippet. - tool_audit.py: JSONL writer with per-file asyncio locks; read_recent / read_recent_all_users helpers - tools/__init__.py: hook in call_tool() — fire-and-forget record on every dispatch - routers/audit.py: GET /api/audit/recent and /api/audit/stats (admin-only) - tools/files.py: add home_root() to file_read allowed roots so agents can read audit JSONL Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:55:59 -04:00
Scott Idem	ddf44a2aee	feat: web push notifications (VAPID) - push_utils.py: subscription storage + send helper (auto-prunes 410 endpoints) - routers/push.py: GET /api/push/vapid-key (public), POST/DELETE /api/push/subscribe - sw.js: push event listener shows notification; notificationclick focuses/opens tab - app.js: subscribe/unsubscribe flow + "Enable notifications" toggle in settings dropdown - tools/notify.py: web_push orchestrator tool (user-level, no admin required) - VAPID keys in .env; pywebpush added to requirements.txt Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 19:38:58 -04:00
Scott Idem	a4daebdc9b	feat: local LLM multi-model, session search, cron proactive types, notifications, docs overhaul Local LLM: - user_settings.py: per-user hosts/models config (local_llm.json) - routers/local_llm.py + static/local_llm.html: dedicated settings page - llm_client.py: local OpenAI-compatible backend via httpx - config.py: LOCAL_API_URL/KEY/MODEL + per-backend timeouts - Active model shown near backend toggle (amber hint text) Memory distillation: - memory_distiller.py: DISTILL_BACKEND_MID/LONG .env overrides - scheduler.py + notification.py: notify NC Talk after mid/long distill - notification.py: outbound channel abstraction (NC Talk, extensible) Session search: - routers/files.py: GET /sessions/search?q= with excerpts grouped by date - static/index.html + app.js: search UI in file sidebar with highlight - _esc() helper to prevent XSS in search results Proactive cron: - cron_runner.py: new job types — message (send directly) and brief (LLM + send) - Both support optional per-job channel override Channels: - routers/nextcloud_talk.py: consolidated using notification._send_nct_message() - routers/auth.py: local backend status in /auth/status - routers/chat.py: /backend returns {primary, fallback, local_model} object UI / UX: - Copy button for user messages (matching assistant) - Autocomplete disabled on sensitive form fields - settings.html: local model section replaced with link to /settings/local Docs overhaul: - MASTER.md hub + ARCH__SYSTEM/BACKENDS/PERSONA/CHANNELS/FUTURE.md - ARCH__Intelligence_Layer.md replaced with redirect table - CORTEX.md trimmed to vision only; README updated - OPEN_WEBUI_API.md added to docs/ Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-05 20:53:06 -04:00
Scott Idem	8aec6aafcc	feat: Google OAuth sign-in + per-user Gemini API key Users with Google accounts can now sign in without a password. Auth flow: - GET /auth/google → Google consent page (CSRF state cookie) - GET /auth/google/callback → exchange code, lookup user, set JWT - auth.json gains google_sub + google_email fields - set_password() no longer overwrites unrelated auth.json fields Admin setup: python manage_passwords.py google-add <username> <email> # add GOOGLE_CLIENT_ID + GOOGLE_CLIENT_SECRET to .env Per-user Gemini key: - get_user_gemini_key() reads gemini_api_key from auth.json - orchestrator_engine.run() accepts gemini_api_key param - orchestrator router passes user's key, falls back to server key login.html: "Sign in with Google" button above the password form. manage_passwords.py list: now shows auth method columns (pw / google). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-27 21:01:52 -04:00
Scott Idem	1b425a539f	feat: account settings page + dedicated help page - Add /settings page with password change form and personas list - Add /help dedicated page (replaces help modal); renders HELP.md with collapsible sections, dark theme, back link to active persona - Add 👤 account button and convert ? button to link in header - Remove help modal HTML and ~55 lines of modal JS from main app - Register settings and help routers in main.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 21:41:18 -04:00
Scott Idem	8c61c28b7d	fix: mount /static before ui.router to prevent wildcard route catching static files The ui.router's /{username}/{persona} wildcard was matching /static/style.css (username="static", persona="style.css") because app.mount("/static") was registered after app.include_router(ui.router). FastAPI processes routes in registration order, so /static must be mounted first. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 23:28:13 -04:00
Scott Idem	46b65d087c	feat: persona onboarding — invite tokens, self-service setup, persona creation, switcher New user flow: 1. Admin: python manage_passwords.py invite <username> → generates URL 2. User visits /setup/<token> → sets own password → logged in 3. User redirected to /setup/persona → fills name/emoji/description 4. persona_template.py generates all starter files → lands at /{user}/{persona} Multiple personas: - Header persona name is now a clickable dropdown listing all personas - "New persona" link at bottom → /setup/persona (available to logged-in users) - /api/personas endpoint returns persona list for current session user New files: - persona_template.py: generates IDENTITY/SOUL/PROTOCOLS/USER/HELP.md + data files - routers/onboarding.py: /setup/{token}, /setup/persona GET+POST - static/setup.html: two-step form (password → persona), emoji picker, mobile-friendly Updated: - auth_utils.py: create_invite(), validate_invite(), consume_invite() - manage_passwords.py: invite command with URL output - auth_middleware.py: /setup/* prefix is public (invite tokens need no auth) - routers/ui.py: /api/personas endpoint; post-login redirect if no personas - static/app.js: persona switcher dropdown with navigation + Add persona link - static/style.css: .persona-switcher, .persona-dropdown, mobile adjustments Mobile: login/setup pages are card-centered with responsive padding; dropdown avoids edge-clipping on narrow screens; logout button stays visible. All 80 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 23:10:32 -04:00
Scott Idem	a9bbb668b5	feat: session auth + per-user/persona UI at /{user}/{persona} Replaces nginx basic auth with a proper per-user session system: - auth_utils.py: bcrypt password hashing, JWT cookie creation/decode - auth_middleware.py: validates JWT cookie on all routes except /login, /health, /static/, and webhook endpoints (/channels/, /webhook/) - routers/ui.py: GET /login, POST /login, POST /logout, GET /{username}/{persona} — serves index.html with CORTEX_CONFIG injected - static/login.html: minimal login form (dark theme, matches UI) - main.py: registers SessionAuthMiddleware + ui.router - config.py: jwt_secret, jwt_expire_days settings - manage_passwords.py: CLI tool to set/check/list user passwords - app.js: reads window.CORTEX_CONFIG (user + persona), sends both on every /chat and /orchestrate request; persona name shown in header; logout button (⏏) added to header - requirements.txt: bcrypt, PyJWT, python-multipart - .env.default: JWT_SECRET, JWT_EXPIRE_DAYS documented - tests: client fixture injects JWT cookie; security test assertions updated for URL-normalized path traversal paths (still secure, codes differ) All 80 tests pass. Setup for a new user: python manage_passwords.py set scott python manage_passwords.py set holly Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 22:54:12 -04:00
Scott Idem	ed472ce9a0	feat: Intelligence Layer Phase 1 — orchestrator service Adds the Gemini API orchestrator (ReAct tool loop → Claude responder): Orchestrator engine + router: - orchestrator_engine.py: Gemini API tool loop, Claude CLI handoff - routers/orchestrator.py: POST /orchestrate (async job queue), GET /orchestrate/{job_id} Tools (cortex/tools/): - web.py: DuckDuckGo web search (no key required) - ae_knowledge.py: ae_journal_search + ae_journal_entry_create (AE V3 API) - ae_tasks.py: ae_task_list (reads agents_sync Kanban filesystem) - files.py: file_read (path-allowlisted to safe dirs) Config + deps: - config.py: orchestrator, DuckDuckGo, and AE API settings - requirements.txt: google-genai, duckduckgo-search - .env.default: reference config with all new keys documented Docs: - CLAUDE.md, README.md, documentation/ added to repo - Port references updated 7331 → 8000 throughout - Default model updated to gemini-2.5-flash Tested: ae_task_list, ae_journal_search, web_search all working end-to-end. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 19:37:49 -04:00
Scott Idem	48a6734ec3	feat: Claude CLI OAuth token expiry warning New GET /auth/status endpoint reads ~/.claude/.credentials.json and returns hours remaining + warning flag. UI shows a dismissible amber banner when < 24h remain, turning red if expired. Checked on page load and every 30 minutes. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 23:06:28 -04:00
Scott Idem	4253e69c0b	Add auto memory distillation scheduler (APScheduler) - scheduler.py: AsyncIOScheduler with three cron jobs short daily 03:00 (no LLM, always fast) mid weekly Sun 03:30 (LLM) long monthly 1st 04:00 (LLM — off by default) - config.py: AUTO_DISTILL, AUTO_DISTILL_SHORT/MID/LONG .env flags - main.py: start/stop scheduler in FastAPI lifespan - routers/distill.py: GET /distill/status — next run times + config - requirements.txt: apscheduler>=3.10 - HELP.md: updated planned items, added /distill/status to API table Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 22:31:38 -04:00
Scott Idem	ce3c1f5f7f	Add tiered memory system with manual distillation - config.py: memory_budget_long/mid/short settings (overridable in .env) - memory_distiller.py: distill_short (no LLM), distill_mid, distill_long (LLM) - routers/distill.py: POST /distill/{short,mid,long,all} endpoints - context_loader.py: rewrote to load long→mid→short order with include_* toggles - routers/chat.py: ChatRequest gains include_long/mid/short fields - routers/files.py: MEMORY_LONG/MID/SHORT.md added to ALLOWED set - main.py: register distill router - static/index.html: context bar — tier selector, L/M/S memory toggles, distill buttons with status feedback; send includes tier + memory flags - inara/MEMORY_LONG.md: migrated from MEMORY.md + Cortex/Talk bot notes - inara/MEMORY_MID.md, MEMORY_SHORT.md: stubs ready for distillation Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:22:32 -04:00
Scott Idem	3455c7a09c	Add SSE real-time Talk activity, file editor UI, and identity file API - event_bus.py: in-process asyncio pub/sub (one Queue per SSE client) - nextcloud_talk.py: publishes nct_message/nct_response events to bus - chat.py: GET /events SSE endpoint streams Talk activity to browser - routers/files.py: whitelist-protected GET/PUT for Inara identity .md files - main.py: register files router - static/index.html: real-time Talk feed, blue badge on Sessions btn, Files modal with preview/edit toggle and Ctrl+S save Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 21:10:07 -04:00
Scott Idem	fe854ee534	Add Nextcloud Talk bot integration (Inara) - New routers/nextcloud_talk.py: webhook handler verifies incoming HMAC, calls LLM via BackgroundTasks, posts reply with correctly computed signature (random + message_text, not raw body) - llm_client.py: read Claude OAuth token live from ~/.claude/.credentials.json to avoid stale systemd env tokens; strip conflicting ANTHROPIC_API_KEY - config.py: add nextcloud_url, nextcloud_talk_bot_secret, nextcloud_talk_timeout settings - main.py: register nextcloud_talk router, add logging setup - docs/NEXTCLOUD_TALK_BOT.md: installation guide + HMAC signing gotcha Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-16 23:04:26 -04:00
Scott Idem	2f675ee4bf	Initial commit — Cortex API + Inara identity Cortex: FastAPI backend serving Inara via Claude/Gemini CLI backends. Includes SSE streaming chat, session persistence, Google Chat webhook handler, and Docker support. Inara: Identity files (persona, soul, protocols, memory, context tiers) mounted read-only into the container at runtime. Features in initial cut: - /chat endpoint with SSE keepalive + LLM fallback - Session store with rolling history window - Markdown rendering, copy-to-clipboard, links open in new tab - Stacked right-column input controls (height selector, enter toggle, note mode with public/private) — semi-hidden until textarea grows - /note endpoint for injecting public context into session history - Docker Compose config (local dev runs natively; Docker for server) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 03:41:00 -05:00

20 Commits