Clustered Compute Resources
Goal: Incorporate general purpose compute hosts into a Kubernetes cluster. This will serve as a foundation for highly-available services.
Justification: High availability of compute resources means we can perform zero-downtime hardware or OS maintenance, which improves system reliability.
Components:
-
Initial k3s control plane (3x Wyse 5070 thin clients). We've already built this, but still working on getting the networking configured.
-
Fighter. This will require significant downtime as Fighter must be migrated from the legacy Debian platform to a new NixOS configuration. Additionally, Fighter is awaiting installation of an AMD Instinct MI60 for AI workloads.
-
Cloud workers. Adding elastic compute resources via public cloud is the baseline for auto-scaling. Networking will be a significant challenge.