Cortex-Inara

Author	SHA1	Message	Date
Scott Idem	7b443b40a4	feat: http_post tool, nc_talk_history tool, local orchestrator retry - http_post: POST to external URLs with per-user URL prefix allowlist (home/{user}/http_allowlist.json); admin-only, confirm-required - nc_talk_history: read recent NC Talk messages via Basic Auth (requires nc_username + nc_app_password in channels.json under nextcloud) - openai_orchestrator: _chat_with_retry() wraps both API calls with exponential backoff (3 attempts, 1s/2s) on connection errors and transient status codes (429, 500, 502, 503, 504) - Docs updated: CLAUDE.md, HELP.md, TODO, MASTER, ROADMAP (50 tools) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-09 13:38:38 -04:00
Scott Idem	a99ebb8c30	feat: retry button for orchestrator errors + explicit client timeout Extract orchestrator inner loop into _doOrchestrate() so the retry button can re-run without re-adding the user message to DOM or history — same pattern as the existing chat retry. Also set AsyncOpenAI(timeout=settings.timeout_local) so slow remote models (OpenRouter/DeepSeek) get the same 300s budget as local chat calls instead of the SDK default which varies by connection. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-09 12:39:34 -04:00
Scott Idem	f8f7cd75da	feat: audit log, usage tracking UI, OpenAI orchestrator compaction, onboarding + docs Tool audit log: - Every orchestrator tool call logged to home/{user}/tool_audit/YYYY-MM-DD.jsonl - Files panel sidebar: audit log group (collapsed), date-linked read-only table - Admin endpoints: /api/audit/files, /api/audit/day, /api/audit/recent, /api/audit/stats - Engine and model name recorded per entry OpenAI orchestrator improvements: - Context budget enforcement: 75% of model context_k (min 16k) - Message compaction: truncates old tool results when approaching budget - max_rounds respected per model config (intersected with server cap) OpenRouter onboarding (setup.html, onboarding.py, app.js, settings.html): - Step 3 of 3: /setup/model with curated model picker - Chat banner for users on server-default model (informational, not alarmist) - Settings quick-link card; /setup/model works standalone for existing users Model registry + session store: - set_role_config / get_role_config for per-role tool lists and system_append - session_store: session rename, session name backfill endpoint UI updates (app.js, index.html, style.css, local_llm.html): - Role toggle in context panel - Off-the-record mode - Agent notes read-only viewer - OPERATIONS.md loaded at T2+ in context Documentation: - HELP.md: full tool table, per-role tool sets, Agent Notes, usage tracking - TOOLS.md: Agent Notes section, count corrected to 44 - ARCH__SYSTEM.md, ARCH__BACKENDS.md, MASTER.md updated to match reality - CLAUDE.md: onboarding flow, documentation philosophy sections - README.md: stack in practice, DeepSeek TUI mention, architecture diagram updated - TODO__Agents.md: onboarding task completed with deviation notes Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-08 21:26:43 -04:00
Scott Idem	a75546485b	feat: context budget enforcement + compaction in OpenAI orchestrator Protects all models in the Primary/Backup chain regardless of context window: - _context_budget(): 75% of model_cfg["context_k"] * 1000 (default 32k if unset) - _estimate_tokens(): char count / 4 + 3k overhead for tool schemas - _compact_messages(): truncates old tool results to 400 chars, keeps last 6 intact (~2 recent rounds), logs chars saved per compaction pass - Compaction runs before every API call; log line now shows estimated token count - Malformed tool call args logged with model/args detail instead of silent {} - finish_reason check accepts "stop" and None alongside "tool_calls" (some models return wrong reason even when tool_calls are present) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 22:01:54 -04:00
Scott Idem	7d221863dc	feat: engine/model in audit log + docs update - tool_audit: ContextVars (engine, model) set at orchestrator run start; fields added to every entry - orchestrator_engine: tool_audit.set_context("gemini", model_name) at run() start - openai_orchestrator: tool_audit.set_context("openai", model label) at run() start - audit table: Model column between Status and Args - HELP.md: push notifications section, audit log in Files section, tool count 30→40, new API endpoints - TODO__Agents.md: web_push and audit log marked complete with full detail Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-05 20:42:32 -04:00
Scott Idem	49123cdd5c	feat: per-role tool lists and system prompt overlays Each role in model_registry.json can now carry two optional keys: system_append — injected into the system prompt at position 7 (after memory, closest to the turn) for the active chat_role tools — explicit tool allow-list; intersected with the user's access-level filter so it can only restrict, never elevate No changes needed for existing users — missing keys fall back to current behavior. Add keys to a role to give it a specialty focus: "coder": { "primary": "claude_cli", "system_append": "You are in code-specialist mode...", "tools": ["web_search", "file_read", "shell_exec", "scratch_write"] } Changes: - model_registry.py: get_role_config() returns system_append + tools - context_loader.py: role_append param appended as "--- Role Context ---" - tools/__init__.py: get_tools_for_role/get_openai_tools_for_role accept optional tool_list and intersect with access-level filter - orchestrator_engine.py: tool_list threaded through run/resume/checkpoint - openai_orchestrator.py: tool_list threaded through run/resume/checkpoint; _build_client now calls get_openai_tools_for_role instead of returning unfiltered OPENAI_TOOL_SCHEMAS - routers/orchestrator.py: pulls role_cfg for chat_role, passes both role_append and tool_list to context loader and engine Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-01 20:00:38 -04:00
Scott Idem	6405dd338d	feat: proper confirmation-resume flow + per-user tool policy Fixes the broken confirmation gate where users had no way to approve or deny a blocked tool call in the web UI. Changes: - orchestrator_engine.py: add OrchestrateCheckpoint dataclass, extract loop into _run_from_contents(), add resume() function - openai_orchestrator.py: same treatment — _run_from_messages(), resume() - routers/orchestrator.py: POST /{job_id}/confirm and /deny endpoints, separate _checkpoints store, _resume_job() + _finalize_job() helpers, "awaiting_confirmation" job status with pending_confirmation payload - auth_utils.py: get_tool_policy() and save_tool_policy() helpers reading home/{user}/tool_policy.json (allow/deny lists) - routers/orchestrator.py: loads tool_policy per user and passes confirm_allow/confirm_deny to both engines - app.js: poll loop handles awaiting_confirmation — shows Confirm/Deny buttons inline, resumes polling after user action - settings.html + settings.py: Tool Permissions section with allow/deny textareas, POST /settings/tool-policy route - style.css: .confirm-gate, .confirm-btn, .deny-btn styles Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-30 19:14:53 -04:00
Scott Idem	334e7f0dea	feat: role-based tool access, confirmation gates, and new orchestrator tools - auth_utils: get_user_role() reads role from auth.json (admin\|user, default user) - manage_passwords: new `role` command to promote/demote users (admin-only by convention) - tools/__init__: TOOL_ROLES map, CONFIRM_REQUIRED set, get_tools_for_role(), get_openai_tools_for_role() — both orchestrators now filter tools by caller's role - tools/system: cortex_restart (detached subprocess, 5s delay), cortex_logs (admin-only) - tools/web: http_fetch — direct URL fetch, distinct from web_search - tools/files: file_list (directory listing), file_write (restricted paths, admin-only) - tools/notify: nc_talk_send — proactive outbound via notification.py - orchestrator_engine + openai_orchestrator: user_role param; CONFIRM_REQUIRED tools return a confirmation-request result instead of executing — loop breaks after Claude asks user to confirm in a follow-up message - home/scott/auth.json: role set to admin Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 19:23:53 -04:00
Scott Idem	27ca7c7efd	fix: apply host_type path correction in OpenAI orchestrator The AsyncOpenAI client always appends /chat/completions to base_url. Open WebUI's endpoint is at /api/chat/completions, so for openwebui host_type the base_url must include the /api prefix — same logic as _local() in llm_client.py. Also strip non-standard metadata fields (backend, host, etc.) from session_messages before passing them to the API. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 23:16:33 -04:00
Scott Idem	d9a322164a	feat: OpenAI-compatible orchestrator + backend auto-routing - openai_orchestrator.py — new ReAct tool loop engine for any OpenAI-compatible endpoint (OpenRouter, Open WebUI, Ollama, LiteLLM); model handles both tool loop and final response, no Claude handoff needed - tools/__init__.py — auto-derive OpenAI JSON Schema from existing Gemini FunctionDeclarations so tool definitions have a single source of truth - routers/orchestrator.py — route to openai_orchestrator when model registry "orchestrator" role resolves to a local_openai type host - routers/chat.py — pass role to _backend_label(); fix fallback_used logic (only meaningful for explicit backend overrides, not auto-routing) - static/app.js — add null/"auto" to backend cycle; fetch local model hint without overriding the auto default on page load - model_registry.py — _normalize() back-fills host_type on old registry files - requirements.txt — add openai>=1.0.0 - ARCH__BACKENDS.md — document OpenAI-compat backend and routing logic Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-08 19:18:18 -04:00

10 Commits