Artificial Intelligence Engineer - Distributed Inference
14 days ago
Birmingham
Experience with distributed inference engines: Ray Serve, Triton Inference Server, vLLM, SLURM 🌐. Design and implement high-performance distributed inference systems for running large language models and multimodal AI models at scale 👷. Collaborate with the team to build and improve our distri