Job Summary
A company is looking for a Head of Site Reliability Engineering.
Key Responsibilities
- Lead and mentor a high-performing Site Reliability Engineering (SRE) team, driving initiatives to enhance service reliability and performance
- Oversee the entire lifecycle of services, ensuring high availability and scalability while supporting system design and deployment
- Conduct postmortems to analyze incidents, implement preventive measures, and collaborate with engineering teams on automation and performance testing
Required Qualifications
- Bachelor's degree in Computer Science, Information Technology, or a related field
- At least 8 years of experience in Reliability Engineering, DevOps, or infrastructure-focused roles
- Proven track record of leading and managing a high-performing SRE team
- Experience with coding in Python, Rust/C++, or JavaScript, and familiarity with cloud architecture
- Knowledge of SRE principles and experience with Docker containers and orchestration technologies
Comments