Clustered Compute Resources

Goal: Incorporate general purpose compute hosts into a Kubernetes cluster. This will serve as a foundation for highly-available services.

Justification: High availability of compute resources means we can perform zero-downtime hardware or OS maintenance, which improves system reliability.

Components:

  • Initial k3s control plane (3x Wyse 5070 thin clients). We've already built this, but still working on getting the networking configured.

  • Fighter. This will require significant downtime as Fighter must be migrated from the legacy Debian platform to a new NixOS configuration. Additionally, Fighter is awaiting installation of an AMD Instinct MI60 for AI workloads.

  • Cloud workers. Adding elastic compute resources via public cloud is the baseline for auto-scaling. Networking will be a significant challenge.

No due date
0% Completed