Agent System¶

OtterBot's multi-agent architecture uses a hierarchy of specialized AI agents, each with distinct roles, tools, and capabilities.

Agent Roles & Hierarchy¶

Every agent instance is created from a template in the Agent Registry. Each template defines a role, which determines the agent's position in the hierarchy:

Role	Count	Responsibility
`coo`	Exactly 1	Primary orchestrator. Receives CEO goals, creates projects, writes charters, delegates to Team Leads. Maintains conversation context per chat thread.
`team_lead`	1 per project	Project manager. Breaks directives into Kanban tasks, spawns workers, monitors progress, reports results to the COO.
`admin_assistant`	Exactly 1	Personal productivity agent. Manages todos, reads/sends email via Gmail, and manages Google Calendar events. Reports to the CEO alongside the COO.
`scheduler`	Exactly 1	Background agent that runs recurring tasks on configurable intervals (e.g., daily summaries, periodic monitoring). Operates autonomously.
`worker`	1 per task	Individual contributor. Executes a single task using its assigned tools. Reports completion back to its Team Lead.
`specialist_agent`	1 per specialist	Extension agent spawned by a Specialist. Has access to an isolated knowledge store, custom tools, and data ingestion pipelines. Queries are routed through the COO.

Info

The CEO is you. The human user is always the CEO -- the top of the hierarchy. All work flows down from your goals and back up as results.

Built-in Templates¶

OtterBot ships with 19 built-in agent templates. These cannot be modified or deleted, but they can be cloned and customized.

Management Templates¶

COO¶

Role: COO

Chief Operating Officer. Receives goals from the CEO and delegates to Team Leads.

Capabilities: management, delegation, coordination, github

Tools: run_command, create_project, send_directive, update_charter, update_project_status, get_project_status, manage_models, manage_search, web_search, manage_packages, delegate_to_admin, memory_save, github_list_issues, github_get_issue, github_list_prs, github_get_pr

Default Model: claude-sonnet-4-5-20250929 (Anthropic)

Team Lead¶

Role: Team Lead

Manages a team of workers. Breaks directives into tasks and assigns them.

Capabilities: management, planning, coordination, github, issues, pull-requests

Tools: search_registry, spawn_worker, web_search, report_to_coo, create_task, update_task, list_tasks, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Default Model: claude-sonnet-4-5-20250929 (Anthropic)

Worker Templates¶

Coder¶

Role: Worker

Writes, edits, and debugs code across multiple languages.

Capabilities: code, typescript, python, debugging, github, issues, pull-requests

Tools: file_read, file_write, shell_exec, install_package, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Researcher¶

Role: Worker

Searches the web, reads content, and synthesizes findings.

Capabilities: research, analysis, summarization, github, issues, pull-requests

Tools: web_search, web_browse, file_read, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Reviewer¶

Role: Worker

Reviews code for quality, correctness, and best practices.

Capabilities: code-review, testing, quality, github, issues, pull-requests

Tools: file_read, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Writer¶

Role: Worker

Creates documentation, specifications, and written content.

Capabilities: writing, documentation, specs

Tools: file_read, file_write

Planner¶

Role: Worker

Designs architectures, plans implementations, and decomposes problems.

Capabilities: planning, architecture, decomposition

Tools: file_read, file_write

Security Reviewer¶

Role: Worker

Audits code for security vulnerabilities and compliance issues.

Capabilities: security, code-review, vulnerability-analysis, github, issues, pull-requests

Tools: file_read, shell_exec, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Tester¶

Role: Worker

Writes and runs tests, identifies edge cases, and ensures quality.

Capabilities: testing, test-writing, qa, edge-cases, github, issues, pull-requests

Tools: file_read, file_write, shell_exec, install_package, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Browser Agent¶

Role: Worker

Navigates websites, scrapes content, fills forms, and interacts with web applications.

Capabilities: browser, web-scraping, form-filling, web-interaction

Tools: web_browse, file_read, file_write

OpenCode Coder¶

Role: Worker

Delegates coding tasks to the OpenCode CLI via a managed PTY session.

Capabilities: code, opencode, autonomous-coding, refactoring, github, issues, pull-requests

Tools: opencode_task, file_read, shell_exec, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Claude Code Coder¶

Role: Worker

Delegates coding tasks to the Claude Code CLI via a managed PTY session.

Capabilities: code, claude-code, autonomous-coding, refactoring, github, issues, pull-requests

Tools: opencode_task, file_read, shell_exec, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Codex Coder¶

Role: Worker

Delegates coding tasks to the OpenAI Codex CLI via a managed PTY session.

Capabilities: code, codex, autonomous-coding, refactoring, github, issues, pull-requests

Default Model: gpt-5.3-codex-medium (OpenAI)

Tools: opencode_task, file_read, shell_exec, github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

Gemini CLI Coder¶

Role: Worker

Delegates coding tasks to the Google Gemini CLI via a managed PTY session.

Capabilities: code, gemini-cli, autonomous-coding, refactoring

Default Model: gemini-2.5-flash (Google)

Tools: opencode_task, file_read, shell_exec

Triage¶

Role: Worker

Triages GitHub issues by analyzing, categorizing, and prioritizing them. Used in the code review pipeline.

Capabilities: github, issues, pull-requests

Tools: github_get_issue, github_list_issues, github_get_pr, github_list_prs, github_comment, github_create_pr

SSH Administrator¶

Role: Worker

Manages remote servers via SSH — executes commands, manages connections, and performs system administration tasks.

Capabilities: ssh, remote-management, system-administration

Tools: ssh_exec, ssh_list_keys, ssh_list_hosts, ssh_connect

Tool Builder¶

Role: Worker

Creates custom JavaScript tools at runtime that other agents can use.

Capabilities: tool-building, javascript, api-integration

Tools: file_read, shell_exec, create_custom_tool, list_custom_tools, update_custom_tool, test_custom_tool

Demo Recorder¶

Role: Worker

Records video demos of running web applications with optional voiceover narration. Produces YouTube-ready MP4 videos.

Capabilities: demo, video, recording, screen-recording, voiceover, narration

Tools: demo_record, web_browse, file_read, file_write

Specialized Templates¶

Admin Assistant¶

Role: Admin Assistant

Personal productivity agent. Manages todos, reads/sends email, and manages calendar events.

Capabilities: todos, email, calendar, productivity

Tools: todo_list, todo_create, todo_update, todo_delete, email_list, email_read, email_send, email_reply, email_folder, email_archive, calendar_list_events, calendar_create_event, calendar_update_event, calendar_delete_event, calendar_list_calendars, memory_save, get_current_time, schedule_list, schedule_create, schedule_update, schedule_delete

Skills-Derived Capabilities

Agent template capabilities and tools are now derived from assigned skills, not stored directly on the registry entry. When you assign skills to a template, the tools and capabilities defined by those skills are automatically merged into the agent's configuration.

Agent Lifecycle¶

Every agent instance transitions through a defined set of statuses:

Idle -----> Thinking -----> Acting -----> Done
|                             |  |
|          (on error)         |  +--> Awaiting Input
+---------> Error <-----------+

Status	Description
`Idle`	Agent is created but not yet processing. Waiting for a message or task.
`Thinking`	Running LLM inference. Streaming tokens to the frontend via `agent:stream` or `coo:stream`.
`Acting`	Executing a tool call (file I/O, shell command, web browse, etc.). Tool invocations are broadcast via `agent:tool-call`.
`Awaiting Input`	Agent is waiting for user input (e.g., coding agent permission request or interactive prompt).
`Done`	Task completed successfully. Results reported to parent agent.
`Error`	An error occurred during thinking or acting. Error details logged.

The BaseAgent class manages this lifecycle, including LLM streaming with configurable timeouts (30s for first chunk, 120s between chunks) and a serialized message queue to prevent concurrent processing.

Tool Capabilities¶

Workers use tools to interact with the environment. Each tool is injected via a ToolContext based on the agent's template configuration.

Tool	Description	Used By
`file_read`	Read file contents from the workspace	Coder, Researcher, Reviewer, Writer, Planner, Security Reviewer, Tester, Browser Agent
`file_write`	Create or overwrite files in the workspace	Coder, Writer, Planner, Browser Agent
`shell_exec`	Execute shell commands (respects `SUDO_MODE`)	Coder, Security Reviewer, Tester, Tool Builder
`web_search`	Search the web using the configured provider	Researcher, COO, Team Lead
`web_browse`	Navigate and extract content from web pages (Playwright)	Researcher, Browser Agent
`install_package`	Install apt or npm packages in the container	Coder, Tester
`opencode_task`	Delegate a coding task to an external CLI agent (OpenCode, Claude Code, Codex, or Gemini CLI) via PTY	OpenCode Coder, Claude Code Coder, Codex Coder, Gemini CLI Coder
`demo_record`	Record video demos of web applications with Playwright video capture, optional TTS voiceover, and FFmpeg post-processing to MP4	Demo Recorder
`create_custom_tool`	Create a custom JavaScript tool at runtime, sandboxed via `isolated-vm`	Tool Builder
`list_custom_tools`	List all custom tools	Tool Builder
`update_custom_tool`	Update an existing custom tool	Tool Builder
`test_custom_tool`	Test a custom tool in the sandbox	Tool Builder
`get_current_time`	Get the current date and time	Admin Assistant
`todo_list`	List personal todo items	Admin Assistant
`todo_create`	Create a new todo item	Admin Assistant
`todo_update`	Update a todo item	Admin Assistant
`todo_delete`	Delete a todo item	Admin Assistant
`email_list`	List email messages from the inbox	Admin Assistant
`email_read`	Read a specific email message by ID	Admin Assistant
`email_send`	Send a new email via SMTP	Admin Assistant
`email_reply`	Reply to an existing email message	Admin Assistant
`email_folder`	Move an email to a different folder	Admin Assistant
`email_archive`	Archive an email (move out of inbox)	Admin Assistant
`calendar_list_events`	List Google Calendar events	Admin Assistant
`calendar_create_event`	Create a Google Calendar event	Admin Assistant
`calendar_update_event`	Update a Google Calendar event	Admin Assistant
`calendar_delete_event`	Delete a Google Calendar event	Admin Assistant
`calendar_list_calendars`	List available Google Calendars	Admin Assistant
`schedule_list`	List all scheduled tasks	Admin Assistant
`schedule_create`	Create a new recurring scheduled task	Admin Assistant
`schedule_update`	Update an existing scheduled task	Admin Assistant
`schedule_delete`	Delete a scheduled task	Admin Assistant
`memory_save`	Save an episodic memory for later semantic retrieval	Admin Assistant, COO
`github_get_issue`	Fetch a GitHub issue by number	Coder, Researcher, Reviewer, Security Reviewer, Tester, Triage, COO, Team Lead
`github_list_issues`	List GitHub issues with filters	Coder, Researcher, Reviewer, Security Reviewer, Tester, Triage, COO, Team Lead
`github_get_pr`	Fetch a GitHub pull request	Coder, Researcher, Reviewer, Security Reviewer, Tester, Triage, COO, Team Lead
`github_list_prs`	List GitHub pull requests	Coder, Researcher, Reviewer, Security Reviewer, Tester, Triage, COO, Team Lead
`github_comment`	Comment on a GitHub issue or PR	Coder, Researcher, Reviewer, Security Reviewer, Tester, Triage, Team Lead
`github_create_pr`	Create a GitHub pull request	Coder, Researcher, Reviewer, Security Reviewer, Tester, Triage, Team Lead
`ssh_exec`	Execute a command on a remote host via SSH	SSH Administrator
`ssh_list_keys`	List available SSH keys	SSH Administrator
`ssh_list_hosts`	List allowed SSH hosts	SSH Administrator
`ssh_connect`	Start an interactive SSH session	SSH Administrator

Customizing Agents¶

You can create custom agent templates through the Agent Registry. Custom templates let you define specialized agents with tailored system prompts, capabilities, and tool sets.

Registry Operations¶

List -- View all templates (built-in + custom)
Create -- Create a new custom template from scratch
Clone -- Duplicate a built-in or custom template as a starting point
Update -- Modify a custom template (built-in templates are immutable)
Delete -- Remove a custom template (built-in templates cannot be deleted)
Search -- Find templates by capability keyword

RegistryEntry Fields¶

{
  "id": "custom-api-tester",
  "name": "API Tester",
  "description": "Tests REST APIs...",
  "systemPrompt": "You are an API testing specialist...",
  "promptAddendum": null,
  "capabilities": ["testing", "api", "http"],
  "defaultModel": "claude-sonnet-4-5-20250929",
  "defaultProvider": "anthropic",
  "tools": ["shell_exec", "file_read", "web_browse"],
  "builtIn": false,
  "role": "worker",
  "modelPackId": null,
  "gearConfig": null,
  "clonedFromId": "builtin-coder"
}

The gearConfig field lets you assign a 3D model pack to an agent for the Live View visualization.

Extended Thinking¶

OtterBot supports extended thinking (reasoning) for models that support it (e.g., Claude Sonnet). When enabled, agents emit coo:thinking / agent:thinking events with reasoning tokens before the final response.

Thinking tokens are streamed in real-time to the frontend
A coo:thinking-end / agent:thinking-end event signals the end of the reasoning block
The final response follows as normal coo:stream / agent:stream tokens

This allows the UI to show the agent's reasoning process separately from its final output, giving transparency into how decisions are made.