Job Summary
A company is looking for a Staff Engineer, Production Operations.
Key Responsibilities
- Lead the design and implementation of cloud infrastructure and SRE practices for high availability and performance
- Act as a technical authority for complex SRE and cloud engineering challenges, providing expert guidance
- Develop and optimize cloud infrastructure using Infrastructure as Code and automation tools
Required Qualifications
- Expert-level knowledge of AWS services (e.g., EC2, S3, EKS)
- Mastery of Infrastructure as Code tools, primarily Terraform
- Strong experience with Docker and Kubernetes in production environments
- Ability to apply SRE principles (SLIs, SLOs, error budgets) in production
- Experience in designing and optimizing monitoring, logging, and alerting systems
Comments