Job Summary
A company is looking for a Sr. Manager, SRE to lead a global team responsible for the performance, availability, and reliability of distributed services and infrastructure.
Key Responsibilities:
- Build, lead, and mentor a team of SREs across multiple regions and time zones
- Own the end-to-end reliability of critical customer-facing services and establish SLOs and SLIs
- Expand automation in deployment, monitoring, and incident response to reduce toil
Qualifications:
- Bachelor's degree in Computer Science, Engineering, or related field (Master's preferred)
- 10+ years in infrastructure, reliability, or operations engineering roles
- 5+ years in people leadership with experience managing managers and global teams
- Deep expertise in Linux operating systems and strong knowledge of distributed systems and cloud platforms
- Proficiency with automation tools and familiarity with containers and microservices architectures
Comments