Associate Technical Architect - Platform Engineering
2 days ago
Leeds
About Quantiphi: Quantiphi is an award-winning Applied AI and Big Data software and services company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine-learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed. Quantiphi has seen 2.5x growth YoY since its inception in 2013, we don’t just innovate - we lead. Headquartered in Boston, with 4,000+ professionals across the globe. Quantiphi leverages Applied AI technologies across multiple a. Industry Verticals (Telco, BFSI, HCLS etc.) and is an established Elite/Premier Partner of NVIDIA, Google Cloud, AWS, Snowflake, and others. We have been recognized with: • 17x Google Cloud Partner of the Year awards in the last 8 years, • 3x AWS AI/ML award wins, • 3x NVIDIA Partner of the Year titles, • 2x Snowflake Partner of the Year awards, • Recognized Leaders by Gartner, Forrester, IDC, ISG, Everest Group and other leading analyst and independent research firms, • We offer first-in-class industry solutions across Healthcare, Financial Services, Consumer Goods, Manufacturing, and more, powered by cutting-edge Generative AI and Agentic AI accelerators, • We have been certified as a Great Place to Work for the third year in a row- 2021, 2022, 2023 Be part of a trailblazing team that’s shaping the future of AI, ML, and cloud innovation. Your next big opportunity starts here! For more details, visit: Website or LinkedIn Page. Role: Associate Technical Architect - Platform Engineering Experience Level: 7+ years Employment type: Full Time Location: Remote (UK) Description: We are seeking a highly skilled Platform Architect to design, optimize, and scale infrastructure for GenAI workloads. The ideal candidate will have deep hands-on experience in GPU profiling, parallelization strategies, and scheduling compute-intensive jobs using Slurm and Red Hat OpenShift. This role also includes supporting the build-out of GenAI platform foundations and contributing to customer-facing projects in production environments. Key Responsibilities: • Design and implement infrastructure architectures for LLM and GenAI workloads on multi-GPU systems., • Perform GPU profiling, benchmarking, and performance optimization across distributed training workloads., • Manage and schedule jobs on Slurm-based clusters and containerized environments like Red Hat OpenShift/Kubernetes., • Enable and optimize NVIDIA GPU stack components (CUDA, cuDNN, NCCL, Triton Inference Server, RAPIDS, etc.) for GenAI and DL workloads., • Collaborate with data scientists, MLOps, and application teams to deploy large-scale models in both research and production settings., • Build secure and scalable GenAI pipelines supporting fine-tuning, RAG, multi-modal inferencing, and LLMOps., • Create reusable infrastructure templates (e.g., Terraform, Helm charts) for deploying GPU-ready environments., • Contribute to internal capability development (e.g., workshops, PoCs) and translate solutions to client delivery engagements. Required Skills: • Strong expertise in Slurm job scheduler and distributed training environments., • Experience with Red Hat OpenShift and/or Kubernetes-based orchestration., • Knowledge of NVIDIA GPU ecosystem – CUDA, cuDNN, NCCL, Nsight Systems, and NVIDIA Triton or TensorRT., • Proficiency in Linux systems, performance tuning, and resource optimization in multi-GPU clusters., • Exposure deploying GenAI workloads including LLM fine-tuning, RAG pipelines, and multi-modal systems., • Familiarity with infrastructure-as-code tools (e.g., Terraform, Ansible)., • Exposure to cloud GPU environments (GCP, Azure, AWS, OCI) and on-premise GPU cluster setups. Preferred Skills: • Experience with NVIDIA NIMs, DGX systems, and/or GPU-accelerated containers., • Knowledge of LLMOps frameworks and MLOps integration for GenAI pipelines., • Familiarity with vector databases and retrieval systems used in RAG architectures., • Ability to collaborate with AI solution teams and participate in client-facing technical engagements. What is in it for you: • Make an impact at one of the world’s fastest-growing AI-first digital engineering companies., • Upskill and discover your potential as you solve complex challenges in cutting-edge areas of technology alongside passionate, talented colleagues., • Work where innovation happens - work with disruptive innovators in a research-focused organization with 60+ patents filed across various disciplines., • Stay ahead of the curve—immerse yourself in breakthrough AI, ML, data, and cloud technologies and gain exposure working with Fortune 500 companies.