Job Summary
A company is looking for a Manager, Site Reliability Engineering.
Key Responsibilities
- Support high uptime and reliability of SaaS offerings 24x7
- Lead a remote US team and coordinate efforts to reduce manual work and define SLOs
- Manage incident response processes and maintain FEDRAMP production instances
Required Qualifications
- Bachelor's degree or equivalent work experience
- 5+ years of software development and operational leadership experience
- 3+ years of experience with public cloud platforms, preferably AWS and Kubernetes
- Strong understanding of microservice architecture and distributed software systems
- Experience with cloud native infrastructure components in a shared environment
Comments