Job Summary
A company is looking for a Site Reliability Engineering Manager to lead their site reliability engineering team.
Key Responsibilities
- Build, lead, and mentor a high-performing team of SREs while establishing technical vision and strategy
- Develop and execute long-term infrastructure and reliability strategies, including establishing reliability standards and driving architectural decisions
- Collaborate with cross-functional teams to embed reliability practices and drive incident response processes
Required Qualifications
- 3+ years of experience managing and leading technical teams of 5 or more
- 8+ years of experience in site reliability, platform engineering, or infrastructure roles
- Deep expertise with cloud platforms, particularly Google Cloud Platform (GCP)
- Strong proficiency in multiple programming languages (Python, Go, Java, etc.) and extensive experience with containerization and orchestration
- Expert-level knowledge of Infrastructure as Code and advanced understanding of monitoring and distributed systems architecture
Comments