The OrchestratorOverviewHow It Connects
The StudioOverviewExtensionsHow It Works
ResourcesBlogFAQAbout
Get in touch
Studio Extension

Goose

Local AI agent — Ollama, multi-provider, full-auto mode

Run a full AI coding agent on your own hardware. Goose connects to Ollama for local inference — no API keys, no cloud dependency, no data leaving your machine. When you need cloud power, switch to Anthropic, OpenAI, or Google with one setting change.

Goose configuration
Goose configuration
Local inference with Llama
Local inference with Llama
Agent slot configuration
Agent slot configuration
Ollama local Qwen chat
Ollama local Qwen chat

Configure provider, model, and runtime settings — one panel for local and cloud inference.

Capabilities

What it does

Local Inference via Ollama

Run open-weight models on your hardware. No API keys, no usage fees, no data leaving your machine. Ollama manages model downloads, quantisation, and GPU acceleration.

Multi-Provider Fallback

Switch between Ollama (local), Anthropic, OpenAI, Google, Groq, Mistral, Bedrock, Azure, and Databricks. Use local models for privacy-sensitive work, cloud models when you need more capability.

Full-Auto Mode

Run Goose in full-auto mode — no approval prompts, no human-in-the-loop pauses. The agent executes its entire plan autonomously. Ideal for bulk operations, migrations, and test generation.

ACP Protocol

Communicates via Agent Communication Protocol (ACP) over JSON-RPC 2.0. Structured message passing with streaming support — not raw stdin/stdout parsing.

Per-Slot Configuration

Each agent slot can use a different Goose provider and model. Run your security reviewer on a local model for privacy, and your coder on Claude for capability — in the same workspace.

Health Monitoring

Setup wizard validates your configuration. Status page shows process state, model loaded, and connection health. Automatic restart on crash with configurable retry policy.

How it works

From install to first use.

1
Choose your providerThe setup wizard guides you through provider selection. Choose Ollama for local inference (downloads and manages models) or configure a cloud provider with API keys.
2
Assign to agent slotsEach specialist agent in your workspace can use Goose with a different configuration. Your privacy-focused agent runs local; your capability-focused agent runs cloud.
3
Agent works autonomouslyIn full-auto mode, Goose executes without approval prompts. It reads files, writes code, runs tests, and iterates — all within the boundaries you've set.
4
Switch providers any timeChange from local to cloud (or back) with one setting change. No workflow changes, no reconfiguration. The agent's identity and history persist.
Why local matters

Your inference, your hardware, your choice.

Local inference via Ollama means zero data leaves your machine. When you need cloud capability, you choose which provider and which model — not the platform.

0data sent to cloud with Ollama
9+providers — local and cloud
Freelocal inference with open-weight models
1 clickto switch between local and cloud

Goose ships with the Studio. No extra install, no extra cost.