Job Summary
A company is looking for a Site Reliability Engineer 4.
Key Responsibilities
- Manage system availability, health, and service levels of large-scale cloud infrastructure
- Proactively monitor and diagnose production issues across microservices and distributed platforms
- Participate in on-call rotation and manage incident lifecycle, including reporting and resolution
Required Qualifications
- Bachelor's degree in Computer Science or Computer Engineering or equivalent
- Minimum 5 years of DevOps/SRE experience
- 3 years' experience with AWS and/or GCP
- Technical experience with EC2, IAM, S3, Kubernetes, Jenkins, and CloudWatch
- General understanding of distributed systems and data management technologies
Comments