Site Reliability Engineering (SRE) Manager
7 hours ago
Key Responsibilities Lead major incidents, mitigation, RCA, and preventative improvements Own and refine SLIs, SLOs, and error budgets Reduce operational toil through automation Deep-dive Linux debugging, performance tuning, and systems analysis Strengthen observability, monitoring, and alerting Pro