SneakyCode

Author	SHA1	Message	Date
Phillip Tarrant	f0d8ef8f0a	feat: add thinking mode toggle to suppress reasoning-only response loops Adds `llm.thinking` config option (default: true) that when disabled: - Injects /no_think into the last user message for Qwen 3.x compatibility - Sends chat_template_kwargs in API payload for backends that support it - Silently and immediately nudges on reasoning-only responses instead of showing warnings and wasting retry iterations Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 19:34:36 -05:00
Phillip Tarrant	220c6613e4	feat: add extra_body config for model-specific API parameters Allows passing arbitrary parameters (e.g., enable_thinking, reasoning_effort) to the LLM API request body via config.yaml, solving reasoning-only response loops with models like Qwen 3.x without requiring code changes per model. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 19:15:35 -05:00
Phillip Tarrant	3f9012e6c2	feat: implement tweaks plan - modals, smart shell, spinner, /models, debug log, skills Phase 1: Permission modal dialog, session resume modal, HistoryInput with up/down arrow cycling, remove "You:" echo from chat log, LLM client cleanup on unmount. Phase 2: Smart shell auto-approve using allowed/denied command lists from ToolsConfig, animated thinking spinner with live token count in status bar. Phase 3: /models slash command (list + switch), CLI directory positional argument, JSONL debug logger with rotation. Phase 4: Skills system with SkillsManager, load_skill LLM tool, /skills listing, skill invocation via slash commands, system prompt integration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 15:46:44 -05:00
Phillip Tarrant	76ba490aa2	Add Phase 7: polish and hardening — retry, truncation, sessions, shutdown - Config extensions: retry backoff, truncation threshold, session persistence - LLM retry with exponential backoff + jitter on transient errors (5xx, connection) - Conversation truncation: drops oldest messages preserving first user + recent N - Session persistence: auto-save/restore with atomic writes, cleanup of old files - Graceful shutdown: SIGTERM handler, cancel() on AgentLoop, save-on-exit - Partial message recovery on mid-stream interruption - New slash commands: /save, /session - 18 new tests (5 retry, 5 truncation, 4 session, 4 integration workflows) - README.md and docs/tools.md documentation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 10:20:16 -05:00
Phillip Tarrant	91187a0728	Add Phase 5: ReAct-style agent loop with tool execution Implement the core autonomy layer — AgentLoop streams LLM responses, parses tool calls, executes them with permission checks, feeds results back, and repeats until the task completes or finish is called. - Add FinishTool for explicit loop termination - Add tools parameter to LLMClient.stream_chat() for function calling - Add compact tool result display (status line, not full output) - Refactor REPL to delegate to AgentLoop.run_turn() - Fix Ollama null content rejection (always send content as string) - Add finish to auto_approve permissions - 9 unit tests for agent loop (34 total, zero regressions) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 08:37:22 -05:00
Phillip Tarrant	adbb442ce5	Add Phase 3: LLM integration with Ollama streaming and preflight checks Wire the REPL to a local Ollama instance via streaming HTTP (SSE). LLMClient handles async streaming chat, StreamHandler renders live Markdown via Rich and accumulates tool call fragments. Startup now runs a preflight check that verifies Ollama is reachable and the configured model is pulled, exiting with a clear message on failure. Also adds .gitignore and updates config to use qwen3.5. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 07:27:56 -05:00
Phillip Tarrant	5aff2183d6	init commit	2026-03-11 07:21:21 -05:00

7 Commits