The Cloud guide gives you three clean blocks — a rented Linux box, a managed model backend, an hourly GPU. This page is the honest second pass: why each block was framed the way it was, what else you could have done, what we deliberately left out, the trending tools worth a look, how to get ready for Mythos, and how to lock cloud compute down before the bill — or an open port — bites you.
Written May 2026. The AI-tooling world moves weekly — this is a snapshot of the landscape and the reasoning, not gospel.
The guide is three self-contained blocks, not a six-step spine. Each made one choice for clarity. Here's the reasoning and the roads not taken.
We chose: create an always-on Ubuntu VM in a provider's console, then treat it exactly like the Linux guide.
Why: sameness is the whole point — once you SSH in there's no special "cloud" version of these tools, so the Linux guide is your real manual and the cloud is just the host.
Alternatives worth knowingWe chose: the big-three managed backends (Vertex AI, Bedrock, Azure), plus OpenRouter as the easy on-ramp, and showed how to point Claude Code at Bedrock/Vertex.
Why: these are the paths a business uses to route AI billing and data through its own cloud account.
Alternatives & notesCLAUDE_CODE_USE_BEDROCK=1 (plus AWS region + credentials) or CLAUDE_CODE_USE_VERTEX=1 (plus GCP project + region) before launching. That routes the agent's engine through your own cloud account instead of a personal subscription.We chose: rent a GPU by the hour (RunPod / Lambda / Vast), install Ollama + Tailscale, pull a big model.
Why: occasional serious horsepower without buying a $5,000 card you'd rarely use.
Alternatives & trade-offsIn the cloud the editor question splits in a way it doesn't anywhere else: you can bring the editor (your laptop's VS Code talks to the cloud VM over Remote-SSH), or you can buy the editor as a service (GitHub Codespaces, GitPod, etc. — full IDE in a browser tab, billed by the minute). Both work; they trade ownership for convenience and add a new failure mode the other OS guides don't have: the bill. Here's the honest breakdown of every path that's still alive in May 2026.
The four real paths in the cloud~/.ssh/config for one-click connect..devcontainer/devcontainer.json to your repo to pin the toolchain, extensions, and post-create scripts — your next codespace boots into a configured environment, not a blank Linux.curl -fsSL https://code-server.dev/install.sh | sh.127.0.0.1:8080, not 0.0.0.0: edit ~/.config/code-server/config.yaml and set bind-addr: 127.0.0.1:8080.http://your-vm-tailnet-name:8080 from a tailnet device. Never publish 8080 with a public name; never put it behind a cloud reverse proxy with a public address.systemctl --user enable --now code-server); never as root.devcontainer.json) — no more "works on my laptop."us-east from Hot Springs is fine; from Lisbon it's noticeable. code-server in your own VM lets you pick the region.The guide is deliberately three clean blocks. That clarity has a cost: real omissions — and in the cloud, the omissions are mostly the things that protect your wallet and your data. Here they are, honestly, with the reason each was cut.
| Left out | What it is | Why it was cut |
|---|---|---|
| Hermes Agent | Nous Research's model-agnostic coding agent — points at Bedrock, Vertex, OpenRouter, or a direct API | Was an oversight in v1. Now noted on the cloud page as the natural pairing for managed backends. A good reminder the big-three SDKs aren't the whole field. |
| Cost controls / budget alerts | Spending caps and email alerts you set in the provider console | The #1 real cloud mistake is a runaway bill. The guide warns about it in prose; it deserved to be its own step. This is the single most important thing on the whole topic — see Security below. |
| IAM least-privilege roles | Giving each user/tool only the permissions it needs, instead of full owner access | Cut to keep first-login simple. But daily work on root/owner credentials is how one leaked key becomes a total account takeover. |
| Security groups / firewall rules | The cloud's per-VM firewall that decides which ports are reachable from the internet | The provider's defaults are often more open than you'd want. Skipped for clarity, but leaving SSH or Ollama open to the world is a real, common exposure. |
| Cloud secret managers | Secret Manager (GCP), Secrets Manager / Parameter Store (AWS), Key Vault (Azure) | The guide puts keys in plaintext env vars to stay simple. A managed secret store is the grown-up answer — encrypted, audited, rotatable. |
| The privacy / data-residency tradeoff | What it means that your code and prompts leave your premises when you call a managed backend | Glossed over. Routing through Bedrock/Vertex is great for billing — but your data now travels to and is processed in someone else's data center. A real tradeoff, not a free lunch. |
| Remembering to DESTROY GPU instances | Tearing down (not just stopping) the hourly GPU when you're done | Warned in Block 3, but worth repeating as its own line: a "stopped" instance can keep charging. Destroy it to stop paying and shrink your attack surface. |
Scanning the developer conversation on X and GitHub in May 2026, here's what's hot that the guide doesn't yet mention. All run on Windows (most happiest in WSL2).
The open-source CLI agent everyone's talking about — 150K+ stars, ~6.5M monthly devs. LSP integration, multiple parallel sessions, shareable session links. The strongest "free, bring-your-own-model" alternative to Claude Code.
A terminal that's also an agent cockpit — runs Claude Code, Codex, and others in one windowed UI with panes. Nice if the bare terminal feels stark.
Goose (from Block) and OpenHands are open-source autonomous agents that take a goal and run a long multi-step job. The frontier of "set it and walk away."
93K+ stars. A "spec-driven development" workflow that teaches any agent (Claude Code, Copilot, Gemini, etc.) to plan before it codes. Tessl and Kiro play in the same space.
The plug-ins that matter: chrome-devtools-mcp (let an agent drive Chrome), filesystem, GitHub, database connectors. This is the fastest-moving, highest-leverage area right now.
An agentic open model with a 1M-token context and MCP-native tool use — a serious local option for Ollama if your hardware can handle it.
Mythos is Anthropic's first model specialized for one domain: defensive cybersecurity. Announced April 7 2026 as the engine of Project Glasswing, it has already found a 27-year-old vulnerability in OpenBSD and bugs in FFmpeg. It is invitation-only ($25 / $125 per million tokens), shipped to 12 founding orgs and 40+ critical-infrastructure partners — not a download. Full briefing →
So "getting ready" isn't an install — it's preparing your environment so that when domain-specialized models (Mythos and the wave behind it) open up, you can point them at something useful:
In the cloud you're renting machines that bill by the hour, holding credentials that can spend money, and exposing services to the public internet by default. That's a lot of ways to get hurt. Here's how to keep it from biting you — cloud specifics first, then the universal rules that apply everywhere.
ANTHROPIC_BASE_URL setting and exfiltrate your API key in plaintext. Anthropic patched it before disclosure — the lesson stands: keep Claude Code updated, install only from official sources, and be suspicious of any config that reroutes where a tool "phones home." This matters double in the cloud, where that key may be a cloud credential that can spin up paid resources.0.0.0.0 — that's the whole internet. The provider's default firewall is often more open than you'd want; tighten it.export … env vars are fine to learn with, but a key sitting in plaintext on the box leaks the moment that box is compromised or the shell history is read..env files, SSH keys, .secrets, and certificates — and not to read its own config. Start in ask-before-acting mode on a new box; don't run "auto/yolo" modes in a folder full of irreplaceable files. Work inside a project folder, not the whole disk.npm install -g and curl … | sh run other people's code. Use only the exact official sources; don't paste install one-liners from random blogs or X replies. Keep everything updated.OLLAMA_HOST=0.0.0.0:11434 exposes your model server to the network — only do it behind Tailscale, never a public IP.