Site Reliability Engineer
4 days ago
Implement and maintain workflows and tools (CI/CD, containerization, orchestration, monitoring, logging and alerting systems) for both our client-facing APIs and large training runs. Design, build, and maintain scalable, highly available and fault-tolerant infrastructures to support our web services