AI Inference Engineer (London)
hace 16 días
London
Our current stack includes Python, Rust, C++, PyTorch, Triton, CUDA, and Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference. - Develop APIs for AI inference used by internal and external customers - Benchmark and add...