Job Summary
A company is looking for a Senior/Staff Site Reliability Engineer (SRE).
Key Responsibilities:
- Administer and maintain container orchestration platforms and workloads
- Monitor and troubleshoot production systems, participating in on-call rotations
- Drive observability improvements by enhancing monitoring, logging, and alerting capabilities
Required Qualifications:
- 5+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles
- Proven success leading large-scale production systems in cloud environments
- Advanced proficiency in Kubernetes administration and troubleshooting
- Bachelor's degree in Computer Science, Engineering, or a related technical field
- Strong programming and automation skills in Python and Bash
Comments