Agent Loop
Streams model output, detects tool calls, executes work, observes the result, and feeds evidence back into the next turn.
Open agent infrastructure
Agent Harness is the operating layer around an LLM: tools, skills, memory, permissions, observation, and coordination that turn model output into reliable, reviewable work.
An Agent Harness is the complete infrastructure that wraps around a language model and lets it behave like a working agent. The model provides intelligence; the harness provides controlled action, observation, memory, and operating boundaries.
OpenHarness frames this idea as lightweight core infrastructure for tool-use, skills, memory, and multi-agent coordination. This site summarizes that pattern for builders evaluating how agent systems should be assembled.
In practice, the agent harness is the difference between a model that can answer and a system that can inspect files, call tools, enforce approvals, preserve context, and recover from failed steps.
Core components
Streams model output, detects tool calls, executes work, observes the result, and feeds evidence back into the next turn.
Exposes file, shell, search, web, MCP, and domain-specific tools through predictable interfaces the agent can call.
Loads specialized instructions, hooks, prompts, and extensions only when they matter for the current task.
Maintains project instructions, session history, persistent notes, compaction, and resumable task state.
Applies permission modes, path rules, command rules, approvals, and pre/post tool hooks before side effects happen.
Delegates tasks, tracks background workers, manages team state, and merges results into the primary agent session.
Architecture flow
Reference implementation
HKUDS/OpenHarness is an MIT-licensed Python implementation of agent harness infrastructure. It supports provider setup, a terminal UI, tool execution, skills, memory, permissions, MCP integration, and multi-agent coordination primitives.
FAQ
No. The LLM is the reasoning engine. The harness is the surrounding runtime that gives it tools, state, rules, and feedback.
Start with tool boundaries, permission behavior, observability, memory strategy, and whether the loop can recover from failed actions.
Tool contracts, permission policy, memory retention, logs, retries, and the release process around the harness all matter before adding more model complexity.
No. It is an independent Agent Harness keyword reference that links to OpenHarness as a public source.