boocode

Author	SHA1	Message	Date
indifferentketchup	a7104691aa	v1.12.2: live tok/s + ctx display next to status indicator ChatThroughput renders inline beside StatusDot while streaming or tool_running. Subscribes to existing usage frames via sessionEvents. Hides when status drops to idle/error or data is older than 10s. Addresses the 2026-05-21 spike's UX gap where slow streams looked identical to dead streams — now there's a live token velocity readout that immediately distinguishes the two. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-21 20:45:53 +00:00
indifferentketchup	dc43dd44f9	v1.11: opencode-style compaction port - compaction.ts: usable/isOverflow/estimate/turns/select/buildPrompt/process - compaction-prompt.ts: SUMMARY_TEMPLATE verbatim from opencode - schema: messages.{compacted_at,summary,tail_start_id} + chats.needs_compaction - inference: auto-trigger on overflow, pre-fetch compaction before next turn - /compact slash command rewired to new path - WS: chat_status working/idle around compaction + compacted frame - frontend: SummaryCard + sonner toast on compacted - 24 unit tests for pure functions	2026-05-20 19:05:35 +00:00
indifferentketchup	5c61cc7281	v1.8.2: tool loop cap-hit summary + tool call UI compaction Old hardcoded MAX_TOOL_LOOP_DEPTH=15 replaced by per-agent max_tool_calls (1-100, AGENTS.md frontmatter) with defaults: 30 for read-only-only agents, 10 for agents that include any non-read-only tool, 15 for raw chat. When the loop hits cap, fire one final summary call with tools disabled, stream the wrap-up into the in-flight assistant message, then insert a system sentinel with metadata.kind='cap_hit'. The sentinel renders an amber bubble with a Continue button (latest sentinel only) that POSTs to a new /api/chats/:id/continue route to extend. Hard ceiling: 3 cap-hits per chat (2 continues max) — third sentinel reports can_continue=false. Error frames carry a machine-readable reason code alongside human error text. Failed messages persist the reason via metadata.kind='error' so the bubble renders specifics on reload (WS error frame is one-shot). Tool call UI rewired: ToolCallLine renders inline (↳ name args spinner/check/✗, expand-on-tap for args+result); ToolCallGroup collapses 3+ consecutive same-tool runs into a compact card. MessageList owns a three-pass pre-render (flatten + fold tool results onto matching runs by id + group same-tool runs + number sentinels). MessageBubble drops tool rendering and adds the sentinel / error-reason branches. ToolCallCard deleted. Roadmap follow-up logged: add explicit max_tool_calls: 30 to the 6 agents in /data/AGENTS.md and /opt/boocode/AGENTS.md post-ship for discoverability (defaults handle behavior identically). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-17 10:31:32 +00:00
indifferentketchup	12d91c9a12	v1.8.1: global agents + parser robustness + WS reconnect toast Builtins move out of code into /data/AGENTS.md (always-on, mounted ro into the container); per-project AGENTS.md is now an optional override. agents.ts merges global + project entries with project-wins-by-name and caches per-source mtimes (60s TTL). Parser switches to per-block try/catch and returns AgentsResponse { agents, errors[] } so one malformed block no longer fails the file. AgentPicker shows a non-blocking amber chip listing skipped blocks and only fires a gray toast when zero agents loaded. WS reconnect UX (useUserEvents + useSessionStream) now silent on the first disconnect; createWsReconnectToast escalates to gray after 3 failures or 15 s, then to red with a Retry Now action after 60 s. useSessionStream also gained the exponential-backoff reconnect it was missing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 23:16:02 +00:00
indifferentketchup	2f6be39efd	chore: surface swallowed errors + remove dead session_renamed paths Swallowed-error logging (audit Feature 3): - file_index.ts:36-37 (git mtime probes): comment — best-effort, project may not be a git repo. - useUserEvents.ts:44 / 53 (ws.close on error / unmount): comments — best-effort, socket may already be closing. - RightRail.tsx:38 (localStorage write): comment — best-effort, quota or private mode. - App.tsx:21 (api.sessions.get for RightRail projectId): replaced silent catch with console.warn. - Session.tsx:38, 41 (session fetch + project list for breadcrumb): replaced silent catches with console.warn. H1: ProjectSidebar.tsx:189 — dropped the local sessionEvents.emit ({type:'session_renamed'}) after PATCH. Server publishes via broker.publishUser since v1.4; useUserEvents forwards. H2: useSessionStream.ts session_renamed case removed (dead — no server code path publishes session_renamed on the per-session WS channel; only user channel via broker.publishUser). Also dropped the session_renamed variant from WsFrame (in apps/web/src/api/types.ts) to keep the discriminated-union switch exhaustive. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 04:35:49 +00:00
indifferentketchup	c35ec65fc4	batch4: chats-in-sessions, force-send, /compact, right-rail file browser Session 1:N Chat data model with backfill. Workspace switches to client-side multi-tab pane management. Right-rail file browser with float-over viewer and click-drag line selection replaces FileBrowserPane. Adds /compact streaming summarizer (respects compact markers in context builder), force-send (cancels in-flight, persists partial as 'cancelled', awaits cancellation completion via deferred Promise + 5s timeout), message queue, stop generation, chat auto-rename, session archive/unarchive with Closed Sessions section on repo landing page. CHECK constraints on sessions.status, messages.role, messages.status with KEEP IN SYNC comments tying to MESSAGE_ROLES / MESSAGE_STATUSES const arrays. Deletes dead pane routes/hook and the api.panes.* client block. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 20:39:48 +00:00
indifferentketchup	2464d23bb6	v1.1 batch 1: markdown, message actions, tok/s+ctx, AI naming Four features land together on this branch: 1. Markdown rendering — assistant messages go through react-markdown + remark-gfm. Fenced code blocks render via existing CodeBlock (with copy button); inline `code` is styled inline. User messages stay plain text. No raw HTML (no rehype-raw). 2. Per-message Copy + Regenerate. New endpoint POST /api/sessions/:id/messages/:message_id/regenerate validates the target (404/400/409), atomically deletes the target plus any later messages in the session, inserts a fresh streaming assistant row, and enqueues a normal inference run. The DELETE bound uses a SQL subquery (`created_at >= (SELECT created_at FROM messages WHERE id = $1)`) instead of a JS round-trip so postgres TIMESTAMPTZ µs precision is preserved — otherwise sub-ms clock_timestamp() differences between the user row and the assistant row collapsed to the same JS Date, pulling the triggering user message into the >= bound. New `messages_deleted` WS frame so already-connected clients prune the stale tail without needing a full snapshot resend. 3. tok/s + ctx counter. Five new nullable message columns: tokens_used, ctx_used, ctx_max, started_at, finished_at. started_at is set right before the OpenAI call in services/inference.ts (not in the route, not in the frame handler); finished_at + tokens_used + ctx_used + ctx_max are committed in the same UPDATE that flips status to 'complete'. The inference request now opts into stream_options.include_usage so the final chunk carries usage; defensive parsing also picks up timings.n_ctx when llama.cpp emits it (currently absent for our llama-swap models, so ctx_max stays NULL and the UI just shows `<used> ctx`). message_complete frame extended with tokens_used / ctx_used / ctx_max / started_at / finished_at / model. Frontend StatsLine in MessageBubble computes tok/s client-side from the timestamps and renders muted mono text below the body of completed assistant messages. 4. AI chat naming after the first turn. Backend services/auto_name.ts runs via setImmediate after the top-level inference resolves; it checks that there is exactly one completed assistant message and that the session has not been user-renamed (`name IS NULL OR name = '' OR name = 'New session'`), then fires a single non-streaming chat completion with the spec prompt. Qwen3 chat templates emit chain-of- thought into reasoning_content and burn the entire max_tokens budget without producing visible output, so the request includes `chat_template_kwargs: { enable_thinking: false }` and max_tokens=30. Title is trimmed, quote-stripped, "Title:" prefix dropped, and truncated to 60 chars before a guarded UPDATE on sessions.name. New `session_renamed` WS frame propagates to the open session view directly and to the project's session list via a tiny module-scope event bus (apps/web/src/hooks/sessionEvents.ts) — kept dumb: one event type, two methods, no library. Cleanups: dropped the now-unused splitCodeBlocks export from CodeBlock.tsx (react-markdown supersedes it), and added a long-form NOTE in auto_name.ts documenting the enable_thinking + max_tokens pattern for any future Qwen- family non-streaming utility calls (planned: fork-message, agent-routing, web-search summarization). Schema bootstrap remains idempotent (ADD COLUMN IF NOT EXISTS). Auth, broker, clock_timestamp() conventions, and zod validation all unchanged. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 22:52:40 +00:00
indifferentketchup	a7f218e182	initial	2026-05-14 19:24:50 +00:00

8 Commits