Local AI agent — Ollama, multi-provider, full-auto mode
Run a full AI coding agent on your own hardware. Goose connects to Ollama for local inference — no API keys, no cloud dependency, no data leaving your machine. When you need cloud power, switch to Anthropic, OpenAI, or Google with one setting change.
Configure provider, model, and runtime settings — one panel for local and cloud inference.
Run open-weight models on your hardware. No API keys, no usage fees, no data leaving your machine. Ollama manages model downloads, quantisation, and GPU acceleration.
Switch between Ollama (local), Anthropic, OpenAI, Google, Groq, Mistral, Bedrock, Azure, and Databricks. Use local models for privacy-sensitive work, cloud models when you need more capability.
Run Goose in full-auto mode — no approval prompts, no human-in-the-loop pauses. The agent executes its entire plan autonomously. Ideal for bulk operations, migrations, and test generation.
Communicates via Agent Communication Protocol (ACP) over JSON-RPC 2.0. Structured message passing with streaming support — not raw stdin/stdout parsing.
Each agent slot can use a different Goose provider and model. Run your security reviewer on a local model for privacy, and your coder on Claude for capability — in the same workspace.
Setup wizard validates your configuration. Status page shows process state, model loaded, and connection health. Automatic restart on crash with configurable retry policy.
Local inference via Ollama means zero data leaves your machine. When you need cloud capability, you choose which provider and which model — not the platform.
Goose ships with the Studio. No extra install, no extra cost.