Job Summary
A company is looking for a Site Reliability Engineer.
Key Responsibilities:
- Manage AWS infrastructure using Terraform/Terragrunt, optimizing cost, reliability, and scalability
- Deploy and maintain Kubernetes clusters, ensuring high availability and zero downtime
- Implement monitoring and alerting systems, and define SLAs while leading incident response efforts
Required Qualifications:
- 3+ years of experience in Site Reliability Engineering or related roles
- Strong AWS experience (EC2, S3, EKS, RDS, Lambda, etc.)
- Proficiency in Kubernetes, Helm, Kustomize, Docker, and Terraform
- Experience with observability tools such as Prometheus, Grafana, and New Relic
- Optional certifications in AWS Solutions Architect or Kubernetes Admin/Developer
Comments