Job Summary
A company is looking for a Staff Site Reliability Engineer (SRE).
Key Responsibilities
- Drive system design discussions and software development within engineering teams
- Monitor and manage observability and alerting of containerized applications to ensure performance and reliability
- Collaborate with cloud and platform engineers to build CI/CD pipelines and standardize deployment patterns
Required Qualifications
- Strong understanding of cloud platforms (AWS+GCP), Kubernetes, and containerization
- Knowledge of distributed systems, system design, and capacity planning
- Experience with metrics gathering, monitoring, and automation in CI/CD
- Familiarity with risk management and change processes
- Strong understanding of distributed systems architecture and testing strategies
Comments