Three-node k3s HA control plane with embedded etcd quorum #31
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally filed by @coilysiren on 2026-05-22T07:17:21Z - https://github.com/coilysiren/infrastructure/issues/237
Goal - Move the k3s homelab onto a dedicated, highly-available control plane: 3 Linux server nodes running embedded etcd, tolerant of any single node going down.
Surfaced by - 2026-05-21 night planning session. Kai wants kai-server, the desktop tower, and a "third thing" to each validate the other two. The concept is correct (etcd Raft quorum). One correction below changes the topology.
Load-bearing constraint - the Windows tower cannot be a control-plane node
K3s server (control-plane) nodes are Linux-only. There is no Windows k3s server. Windows can join a cluster only as an agent (worker) node. So the tower is a GPU agent, not an etcd member. This is also the right call on its own merits: the tower is a part-time gaming box, and an etcd member that drops out every time Kai boots into a game would churn quorum.
Corrected topology
--cluster-init, the other two join with--server.Cost-benefit
Proposed work
--cluster-init.coilysiren/agentic-os, infra-specific tocoilysiren/infrastructure. No ad-hoc one-off setup.Related - blocks/enables the OpenClaw-on-tower deploy. Sibling of infrastructure #73 (self-hosted LLM stack).
Iceboxed in the 2026-05-29 backlog burn-down: three-node HA control plane, far-future hardware play. Reopen anytime if it becomes real.