Skip to content

AI Models

Agent HQ supports multiple AI model providers, letting you choose the right model for each job.

Claude models excel at complex reasoning, code generation, and following detailed instructions.

  • Claude Opus 4.6 — Most capable model for complex tasks.
  • Claude Sonnet 4.6 — Strong balance of speed and capability.
  • Claude Sonnet 4.5 — Fast and capable. Great for most code tasks.
  • Claude Haiku 4.5 — Fastest and cheapest. Good for simple tasks and quick iterations.
  • GPT-4o — Strong general-purpose model for code and reasoning.
  • GPT-4o Mini — Faster, lower cost option for lightweight tasks.

Gemini models offer fast responses with strong reasoning capabilities.

  • Gemini 2.5 Pro — Most capable Gemini model.
  • Gemini 2.5 Flash — Fast iteration speed with good code quality.

Free-tier models running on Cloudflare’s edge network. No API key required.

  • Llama 3.3 70B — open-source model available at no cost
  • No external API calls — runs entirely on Cloudflare’s infrastructure

When creating a new conversation, you can select a model using the model selector in the chat interface. All tasks created from that conversation will use the selected model by default.

When creating a task directly from the task board, you can choose a model in the New Task Dialog.

Use caseRecommended model
Complex code generationClaude Opus 4.6 or Sonnet 4.6
Quick fixes and small changesClaude Haiku 4.5 or Gemini Flash
General coding tasksClaude Sonnet 4.5 or GPT-4o
Experimentation / low budgetCloudflare Workers AI (free)
Large refactoring tasksClaude Opus 4.6

Models that support extended thinking (like Claude) show their reasoning process in real time. You’ll see a collapsible “Thinking” section in the chat that reveals Pilot’s chain of thought, including timing information for each reasoning step.

This is useful for understanding why Pilot made certain decisions.

To use models from Anthropic, OpenAI, or Google, your workspace needs the corresponding API keys configured.

Cloudflare Workers AI models work out of the box with no additional configuration.

Every message and task tracks token usage (input and output tokens). Combined with model pricing, this lets you see the cost of each task. See Budget & Cost Tracking for details.