Lead AI Engineer (Gen AI Platform Services, Python, Kubernetes)
3 months ago
San Francisco
LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang* Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost* Passion for stayin