Job Summary
A company is looking for a Director, Production Operations.
Key Responsibilities
- Lead and scale Site Reliability Engineering (SRE) and Cloud Engineering functions
- Define and execute strategic roadmaps that align technology with business needs
- Foster a culture of operational excellence and continuous learning from incidents
Required Qualifications
- Proven leadership in mentoring SRE and Cloud Engineering teams
- Deep expertise in SRE best practices, including SLIs/SLOs and error budgets
- Hands-on experience with AWS and cloud infrastructure architecture
- Strong incident management skills and experience leading triage efforts
- Passion for automation and improving system reliability through tooling
Comments