avacli — Free, Open Source AI Agent

Features

Everything you need, nothing you don't

One binary. No daemon, no Docker, no cloud account. Install it, add your xAI key, and start building.

Local-first

Runs on your machine or VPS. Your code and API keys never leave your environment. No cloud dependency, no telemetry.

Autonomous coding agent

Reads, writes, and searches code across your entire project. Runs shell commands and plans multi-step tasks on its own.

Powered by xAI Grok

Uses Grok models via your own xAI API key. You pay xAI directly at their published rates — zero markup, zero middlemen.

Built-in web UI

Streaming chat with reasoning display, WebIDE, file browser, and settings — all served from the binary on localhost:8080.

50+ agent tools

File I/O, regex search, shell execution, web & X search, image generation, persistent memory, todos, sub-agents, app databases, services — across three operating modes.

VPS deployment & node mesh

Deploy and manage Vultr cloud servers directly from the UI. Connect multiple nodes into a private P2P network and control them all from one chat.

No account required

Download, install, add your xAI key, and go. Zero registration, zero data collection, zero strings attached. MIT licensed.

Infrastructure

Deploy, connect, and orchestrate

Spin up cloud servers and link them into a private node mesh — all from the same chat interface you use for coding.

One-click VPS deployment

Pick a plan, region, and config profile — avacli provisions a Vultr instance with your agent pre-installed. Monitor balance, bandwidth, and billing in real-time.

Private node mesh

Every connected node sees every other node. Use @node mentions in chat to send commands to specific machines, or click a node to work directly in its remote UI.

Remote UI access

Open any node’s full web interface from your local browser. It’s like sitting in front of the remote machine — edit files, run the agent, manage services — without SSH.

Auto-provisioned configs

Three deploy profiles out of the box: Agent Only (minimal avacli install), Full LAMP Stack (Apache + MariaDB + PHP + Redis + avacli), or Blank server.

v2.2 · Platform capabilities

Host services, databases, and sub-agents

avacli isn’t just a chat. It’s a small self-hosted runtime for long-lived processes, per-app storage, and delegated agent work — all sandboxed, all xAI-native.

Long-running services

Host Discord bots, Python workers, or Node daemons as type:"process" services. Auto venv / npm ci bootstrap, exponential-backoff restart policies, live SSE log streams, and vault-backed env. Per-service workdir under ~/.avacli/services/.

Per-app SQLite

Every agent-generated app gets its own authenticated database at ~/.avacli/app_data/<slug>.db, sandboxed with sqlite3_set_authorizer. Auto-injected _sdk.js exposes window.avacli.db, .main, and .ai — zero plumbing, no CORS, no secrets leaking to the browser.

Scoped sub-agents

Delegate bounded work with spawn_subagent. Each child runs on its own thread with allowed_paths / allowed_tools whitelists and write leases so siblings can’t collide. Default-deny (max_depth=0) until you raise it in Settings; xAI-only by design.

v2.3 · Cost optimization

5–20× cheaper, end to end

v2.3 re-engineers every request around the xAI prompt cache and adds a Batch API surface. Interactive chat drops to 70–90% cache hits on turn 2; offline jobs stack a flat 50% batch discount on top.

Byte-stable system prefix

The dynamic context (notes, todos, memory, edit history) is split into a separate system message after the stable prompt — so the prefix is identical across turns and users. Typical sessions go from ~0% cache hits to 70–90% from turn 2 onward.

Sticky `conv_id` routing

New X-Xai-Conv-Id header on chat completions + conv_id body field on the Responses API pin cache routing to the same shard for a conversation's lifetime. Conv IDs are minted on session create, persisted, and inherited by sub-agents so tool-heavy sessions stay warm.

Cache-hit observability

Every response logs [cache] prompt=X cached=Y ratio=Z%. cached_tokens, reasoning_tokens, billable_prompt_tokens, and cache_hit_rate are exposed on SSE token_usage frames, the headless JSON summary, and the /usage dashboard.

Responses API chaining

The Responses API path now sends only new items (tool results, next user message) plus previous_response_id — xAI rebuilds history server-side from its 30-day retention. Transparent full-resend fallback on stale-ID errors; persisted in session state.

Reasoning preservation

Chat-completions reasoning models stream reasoning_content through a dedicated callback; it's stored per-message and echoed back byte-identically on replay so the thinking stays inside the cached prefix. No more paying twice for chain-of-thought.

Batch API (~20× off)

Flat 50% discount on /v1/messages/batches, stacks with prompt caching for ~20× effective discount on offline workloads. New batches table, 30s background poller, REST surface under /api/batches/*, and five agent tools starting with batch_submit.

Your own AI agent.
Free and open source.

Everything you need, nothing you don't

Local-first

Autonomous coding agent

Powered by xAI Grok

Built-in web UI

50+ agent tools

VPS deployment & node mesh

No account required

Deploy, connect, and orchestrate

One-click VPS deployment

Private node mesh

Remote UI access

Auto-provisioned configs

Host services, databases, and sub-agents

Long-running services

Per-app SQLite

Scoped sub-agents

5–20× cheaper, end to end

Byte-stable system prefix

Sticky `conv_id` routing

Cache-hit observability

Responses API chaining

Reasoning preservation

Batch API (~20× off)

Running in 60 seconds

Install

Add your xAI key

Build with AI

Why avacli?

Your own AI agent.Free and open source.

Everything you need, nothing you don't

Local-first

Autonomous coding agent

Powered by xAI Grok

Built-in web UI

50+ agent tools

VPS deployment & node mesh

No account required

Deploy, connect, and orchestrate

One-click VPS deployment

Private node mesh

Remote UI access

Auto-provisioned configs

Host services, databases, and sub-agents

Long-running services

Per-app SQLite

Scoped sub-agents

5–20× cheaper, end to end

Byte-stable system prefix

Sticky conv_id routing

Cache-hit observability

Responses API chaining

Reasoning preservation

Batch API (~20× off)

Running in 60 seconds

Install

Add your xAI key

Build with AI

Why avacli?

Your own AI agent.
Free and open source.

Sticky `conv_id` routing