GCP Platform & SRE Engineer (Apigee) - Manchester OR Leeds OR Halifax (Hybrid) - IR35
4 days ago
Leeds
GCP Platform Engineer/Site Reliability Engineer (SRE) Location: Manchester OR Leeds OR Halifax (2-3 Days a week onsite is mandatory) Duration: 6 months + Extension Budget: £350 - £400 per day, all inclusive THIS PROJECT IS INSIDE IR35 We are seeking a hands-on GCP Platform Engineer/SRE to design, implement, and operate a secure, automated cloud platform supporting API-first workloads and enterprise integration services. This role focuses on GCP-native infrastructure, API platform engineering, and SRE reliability practices. Core Responsibilities • Engineer and operate secure GCP platform infrastructure using Infrastructure as Code, • Build and maintain GCP API Management (Apigee) infrastructure and API Gateway capabilities, • Design and implement networking, load balancing, and edge security (Cloud Armor), • Deploy and operate GKE clusters and container platforms in production, • Develop reusable Terraform modules and modular IaC patterns, • Implement and maintain CI/CD pipelines for infrastructure and platform components, • Embed secure-by-design controls across the platform life cycle, • Define and manage SLOs, SLIs, and error budgets, • Implement observability as code and actionable monitoring, • Improve reliability and reduce operational toil through automation and performance optimisation, • Strong hands-on experience with Google Cloud Platform (GCP), • Deep experience with Apigee API Management infrastructure, • Proven knowledge of GCP Networking, VPC design, Cloud Armor, and Load Balancers, • Production experience operating Google Kubernetes Engine (GKE), • Strong Terraform expertise with modular and maintainable IaC design, • Experience building and operating CI/CD pipelines (Jenkins, GitHub Actions, Harness, etc.), • Strong understanding of cloud security, IAM, and API security standards (REST/OpenAPI, AuthN/AuthZ, mTLS, certificate life cycle), • Hands-on experience with observability tooling and monitoring platforms, • Experience defining and operating to SLO/SLI-based reliability models, • Experience with HashiCorp Vault, • Familiarity with service mesh technologies (Istio or similar), • Experience with Dynatrace SLO-based monitoring or observability-as-code approaches, • Exposure to Backstage or internal developer platforms, • Strong platform engineering and automation mindset, • Experience operating large-scale production cloud environments, • Passion for reducing toil and improving reliability, • Excellent debugging and root cause analysis skills, • Comfortable working in cross-functional engineering environments