Infrastructure Support (Kubernetes) upto 60k
5 days ago
Cambridge
We're looking for a motivated Infrastructure Support Engineer. You'll support a compact but unusually broad infrastructure footprint that supports hardware design, security verification, FPGA-based testing, CI/CD for silicon projects, and collaborative open-source workflows. This is a fantastic opportunity for a junior to mid-level engineer who thrives on learning new technologies quickly and enjoys making incremental improvements in a mission-focused setting. You'll gain hands-on experience across declarative infrastructure, zero-trust networking, hybrid cloud/on-prem Kubernetes, and custom tooling—while contributing directly to the productivity of engineers building foundational open-source silicon. Key Responsibilities • Help maintain and troubleshoot our on-prem Kubernetes cluster, powering hardware simulation, verification, and test workloads, • Support our Nebula overlay network as we build toward a robust zero-trust model across distributed engineering teams, • Operate and enhance self-hosted GitHub Actions runners, including an FPGA test rig that handles real hardware requests from CI workflows, • Maintain and gradually improve our Python-based internal tooling (used daily by hardware and software engineers) — refactoring for maintainability, adding features, and applying better software practices where it adds value, • Manage NixOS servers through declarative configurations for reproducible, reliable deployments, • Provision and manage Google Cloud resources (VMs, GKE-based runners, storage, networking) using Terraform / OpenTofu, • Monitor system health and respond to alerts, • Collaborate with hardware engineers, security specialists, and open-source contributors to debug issues spanning CI pipelines, FPGA rigs, cloud environments, and internal scripts, • Document setups, workflows, and lessons learned to help onboard new team members and partners Required Skills & Experience • Strong Linux fundamentals and comfort administering production-like systems, • Practical Python experience for scripting, automation, and maintaining existing codebases (you'll read and improve real-world Python tools regularly), • Hands-on work with Kubernetes (basic operations, troubleshooting, YAML manifests), • Experience with Infrastructure as Code tools like Terraform / OpenTofu (or similar), • Solid Git/GitHub knowledge, ideally with Actions or CI/CD pipelines, • Curiosity and ability to ramp up on new tools and concepts quickly, • Clear communication skills for working across hardware, software, and infrastructure domains Highly Desirable • Any exposure to Nix or NixOS (personal projects welcome), • Interest in or experience with Rust for performance-sensitive tooling, • Familiarity with Google Cloud (GCE, GKE, IAM), • Knowledge of overlay/VPN technologies (Nebula, Tailscale, WireGuard) or zero-trust principles, • Previous involvement in CI for hardware/FPGA testing or open-source projects