Kubernetes Linux AIOps Engineer – Elite Quant Hedge Fund
4 days ago
City of London
Infrastructure DevOps Engineer / SRE with expertise in Kubernetes, Linux, Observability, IaC and AIOps sought by a market-leading Quantitative Hedge Fund to further aide further business growth. Our client is one of the World's Elite Quant Hedge Fund Managers with large-scale, massively Distributed Systems, and ample opportunity for Engineering-at-Scale on huge, big ticket projects in an environment boasting nearly 100,000 Kubernetes cores. They boast a highly lean and flat, non-hierarchical business, which has very little in the way of siloes - unlike many of their industry peers - and therefore engineers are fortunate enough to take ownership of projects and relationships end-to-end across the firm. This operating model relies on strong collaboration and communication skills. This a newly created Platform Infrastructure team, separated out of a Core Infrastructure function, and with a focus on Core Platform Engineering, DevOps / Developer Tooling, AI Tooling / AIOps – for example building an automated CI/CD / GitLab deployment platform for AI applications. This team collaborate heavily with diverse stakeholders - typically peer Technology teams (Data, Software, Infrastructure....), Quantitative Development, and business users in Quantitative Research and other business stakeholders across the firm. Responsibilities • Collaboratively architecting a rock-solid and secure Kubernetes platform that can handle the huge volumes of data and load of our diverse technology estate., • Accelerate the migration strategy to more cloud-native, distributed applications., • Enhance and simplify the on-prem stack and its integrations with the hybrid Kubernetes setup., • Create, implement, and evangelize the "Infrastructure as Code" mind-set and best practices across the environment., • Eliminate the toil that emerges with large, distributed systems by automating where possible., • Working as both an individual contributor and collaboratively to find new ways of improving the reliability, availability, and performance of the infrastructure. Requirements • Infrastructure Engineering: 8+ years' minimum professional experience., • Kubernetes & Docker: expertise to include knowledge of Internals and experience of Planning around Production environments and deployments. The ideal candidate will need to have deep knowledge of Kubernetes as their platform is a growing presence and is critical to many parts of the business., • Linux: including debugging and command line skills. RHEL preferred. Broad knowledge across network technologies, server virtualisation and storage, • Programming / IaC: proficiency in at least one language (Python, Go, Bash, Terraform, C...) Must be able to write high quality Automation / scripts from scratch., • Configuration Management Tools (Ansible / Puppet / Kapitan / Terraform....), • Observability: Experience within the modern open-source ecosystem (ELK, OpenTelemetry, LGTM stack, Prometheus, Grafana, Loki...), • CI/CD and GitLab / GitOps: working with Development teams. A track-record in Engineering for Developer Experience / Developer Tooling would be highly desirable., • Communication: articulate, empathetic, curious, positive, enjoying working on a variety of problems in a fast-paced environment, with diverse teams, with drive and passion for Engineering and Technology This is an outstanding opportunity to join a highly successful and elite Quantitative Hedge Fund Manager, working with large-scale, bleeding edge technology and collaborating with diverse, world-class colleagues. Compensation is commensurately market leading as is the firm's record of employee retention/tenure. Our client requires a four-day onsite commitment in London SW1.