Design Philosophy

The Core Principle

Model provides intelligence, harness provides reliability — this is not a marketing slogan. It is an architectural constraint that shapes every line of code in this system.

The principle emerges from a fundamental observation: large language models are remarkably capable at reasoning, planning, and generating appropriate responses, but they lack the properties required for safe, reliable system operation. They have no concept of timeouts, no understanding of resource constraints, no built-in safety guarantees, and no memory across sessions. Meanwhile, traditional software systems excel at these reliability properties but cannot reason about user intent or choose appropriate actions.

This system sits at this boundary. The model handles the cognitive work — understanding what the user wants, deciding which tools to call, interpreting results, planning the next step. The harness handles the reliability work — ensuring operations execute within bounds, protecting against resource exhaustion, maintaining session state, and enforcing security boundaries.

This separation is not merely convenient. It is necessary because the model can be wrong, but the harness must never fail.

The Pilot and Autopilot Analogy

Consider an airplane: the pilot provides judgment, decision-making, and situational awareness — reading the instruments, assessing weather conditions, deciding on a course of action. The autopilot provides reliability — maintaining altitude, following the flight plan, preventing the plane from stalling or overspeeding, handling turbulence within safe parameters.

Neither the pilot nor the autopilot alone can safely operate an aircraft. The pilot without the autopilot would quickly fatigue and make errors. The autopilot without the pilot cannot handle unexpected situations, assess the reasonableness of a flight path, or make judgment calls.

This is precisely the relationship between the model and the harness:

Aspect	Pilot (Model)	Autopilot (Harness)
Role	Decision-making, judgment	Bounded execution, safety guarantees
Strength	Flexibility, context understanding	Predictability, reliability
Limitation	Can make poor decisions	Cannot reason about intent
Failure Mode	Wrong tool choice, hallucination	Should never fail
Trust Model	Probabilistic	Deterministic

The model (pilot) decides what to do — which files to edit, which commands to run, which approach to try. The harness (autopilot) ensures the execution stays within safe bounds — the tool completes within timeout, the output fits in context, the operation is permitted by policy.

What we chose this over alternatives: A monolithic system where the model directly controls execution without a harness is equivalent to a plane without an autopilot — it may work in ideal conditions but will fail in edge cases. A system where the harness makes all decisions is equivalent to an autopilot that never defers to the pilot — it cannot respond to novel situations.

What the Model Provides: Intelligence

When we say the model provides "intelligence," we mean a specific set of capabilities that the system delegates to the LLM:

Intent Understanding

The model interprets natural language input and extracts what the user is actually trying to accomplish. "Make this function faster" is not a precise instruction — the model must understand what "faster" means in context, which function is being referenced, and what performance characteristics currently exist. This is an intelligence task that no amount of rule-based parsing can replace.

Tool Selection

Given an available toolset (Read, Write, Edit, Glob, Grep, Bash, plus any MCP tools), the model decides which tools to call and in what order. This requires understanding tool capabilities, matching them to the current task, and forming a plan. The harness provides the list of available tools, but the model determines which ones to use.

Result Interpretation

Tool outputs are not self-explanatory. A grep search returns lines of text; the model must interpret whether those results indicate success, failure, or partial progress. A file write operation returns a simple confirmation; the model must decide whether to verify the write, proceed to the next step, or report completion to the user.

Next-Step Planning

The agent loop is iterative. After each tool execution, the model must decide: continue with the next step, try a different approach, ask for clarification, or declare the task complete. This is a planning and reasoning task that requires understanding the overall goal and current progress.

These four capabilities — intent understanding, tool selection, result interpretation, and next-step planning — constitute what we mean by "intelligence" in this architectural context. They are the cognitive work that the LLM performs, and they are impossible to implement with deterministic code.

What the Harness Provides: Reliability

Reliability is not a single property but a collection of guarantees that traditional software systems have long provided:

Permission Control

The model may request any operation, but the harness decides which operations actually execute. Permission control is not advisory — it is a hard gate that blocks operations the user has not approved. The model might ask to delete all files in the project, but the harness intercepts this request and only proceeds if the user's permission policy allows it.

This is the most critical reliability property because the model operates with the user's full privileges. Without permission control, an AI agent would be indistinguishable from a malicious script.

Timeout Protection

The model has no concept of time. It might call a grep search over a million files and wait indefinitely for results. It might execute a shell command that never terminates. The harness enforces timeout bounds — if a tool doesn't complete within the configured duration, it is terminated and an error is returned to the model.

Without timeout protection, a single misbehaving tool could hang the entire session indefinitely.

Output Truncation

Tool outputs can be arbitrarily large. Reading a binary file, running a command that produces megabytes of output, or grep searching a massive codebase could produce results larger than the context window. The harness truncates tool outputs to prevent them from consuming the entire available context.

This is not merely a performance optimization — it is a correctness guarantee. Without truncation, the system could fail to provide the model with the information it needs to complete the task.

Session Persistence

The model is stateless between API calls. The harness maintains conversation history across turns and can persist sessions to disk for later resumption. This allows the user to disconnect and reconnect without losing progress on a long-running task.

Error Recovery

Tools can fail for reasons unrelated to the model's reasoning — network timeouts, file permission issues, invalid tool inputs. The harness catches these errors, formats them appropriately, and presents them to the model in a way that enables recovery. The model can decide to retry, try a different approach, or report the error to the user.

Why This Separation Matters

The separation of intelligence and reliability is not an implementation detail — it is a fundamental architectural requirement that stems from the nature of current AI systems.

The Model Can Be Wrong

LLMs are probabilistic systems. They can hallucinate, misinterpret instructions, choose suboptimal tools, and make mistakes. A model might:

Call the wrong tool for the task
Misinterpret a tool's output
Enter an infinite loop of tool calls
Misunderstand the user's intent

These failures are expected and should be handled gracefully. The harness cannot prevent model errors, but it can contain their impact.

The Harness Must Never Fail

Unlike the model, the harness is deterministic software. It should never:

Crash during permission checks
Fail to enforce timeouts
Lose user data due to session failures
Allow unauthorized operations
Produce unpredictable behavior

These are the same reliability guarantees that any production system must provide. The harness's job is to maintain these guarantees regardless of what the model does.

The Interface Between Them

The model and harness communicate through a well-defined interface. The model receives tool definitions (name, description, input schema) and decides which tools to call. The harness executes tools and returns structured results (text output, errors, success/failure status).

This interface is intentionally narrow. The model cannot bypass the harness, cannot inspect harness internals, and cannot modify harness behavior. The harness is a hard boundary, not a soft layer.

Comparison with Other Approaches

Paradigm	Model Role	Harness Role	Automation Level	Safety Model
Chatbot	Generates text responses only	None (user executes manually)	Minimum	Maximum (user controls all actions)
Script Runner	Generates script arguments	Executes predefined scripts	Medium	User supervision
IDE Assistant	Full access to development environment	Minimal (user supervises)	Maximum	User supervision
claude-code-Go	Controls execution flow, chooses tools	Enforces safety bounds, resource limits	High	Automatic permission enforcement + optional human-in-the-loop

Why we chose this over a chatbot approach: A pure chatbot requires the user to manually execute every action. This provides maximum safety but zero automation. The system needs to take direct actions to be useful as a coding assistant.

Why we chose this over a script runner approach: A script runner limits the model to choosing arguments for predefined scripts. This constrains the model's flexibility — it cannot adapt to novel situations or combine tools in unexpected ways. The model must control the execution flow.

Why we chose this over a full IDE approach: In a full IDE, the AI has complete access to the development environment with safety managed through user supervision. This places too much burden on the user and cannot scale to automated workflows. The harness provides defense in depth without requiring constant attention.

Why we chose this over a bare framework: Many "agent frameworks" provide just the agent loop and leave safety, persistence, and permission to the user. This is appropriate for experimentation but unsuitable for production use. This system is an agent product that ships with all reliability features included.

Design Decisions

Decision	Why We Chose This	Trade-off
Three-tier permission model	Binary allow/deny is insufficient for varied development workflows. A developer might want automatic file editing in a workspace while requiring confirmation for shell commands.	More complex configuration, but enables practical use
Glob rule matching for paths	Allows fine-grained control over which files can be modified. A policy can allow `/src/` while blocking `/config/`.	Requires users to understand glob patterns
250-character tool descriptions	Forces discipline in capability communication; prevents context bloat. The model receives concise, actionable tool definitions.	May omit useful tool details
MAX_TURNS = 50	Prevents resource exhaustion from infinite loops. The model has no intrinsic understanding of resource constraints.	Complex tasks may need more iterations
Three-tier message compaction	Preserves semantic content while managing token limits. Full messages until threshold, then summaries.	Compression is lossy; some context may be lost
Pure Go implementation	No runtime dependencies; single-binary deployment; predictable memory usage and low latency. Aligns with local-first philosophy.	Less flexible than interpreted languages
MCP as external adapter	Protocol-agnostic tool discovery; ecosystem integration. Enables extending capabilities without modifying core code.	MCP servers add external dependencies

The Philosophy in Practice

When building features, the intelligence/reliability separation serves as a decision-making framework:

If a capability requires understanding user intent → model responsibility
If a capability requires choosing between options → model responsibility
If a capability requires guaranteeing safety → harness responsibility
If a capability requires resource management → harness responsibility
If a capability requires persistence across sessions → harness responsibility

This framework isn't perfect — some capabilities straddle the boundary. But it provides a consistent heuristic for architectural decisions.

Architecture Overview — System components and data flow
Agent Loop — State machine mechanics and execution cycle
Permission System — Three-tier model and policy configuration

Design Philosophy ​

The Core Principle ​

The Pilot and Autopilot Analogy ​

What the Model Provides: Intelligence ​

Intent Understanding ​

Tool Selection ​

Result Interpretation ​

Next-Step Planning ​

What the Harness Provides: Reliability ​

Permission Control ​

Timeout Protection ​

Output Truncation ​

Session Persistence ​

Error Recovery ​

Why This Separation Matters ​

The Model Can Be Wrong ​

The Harness Must Never Fail ​

The Interface Between Them ​

Comparison with Other Approaches ​

Design Decisions ​

The Philosophy in Practice ​

Related Documentation ​