Job Summary
A company is looking for a Senior Site Reliability Engineer to support its digital learning platforms.
Key Responsibilities
- Design, develop, and troubleshoot large-scale, distributed, event-driven cloud systems for high availability and performance
- Coordinate and implement infrastructure and software improvements for resiliency and scalability
- Maintain and enhance infrastructure and monitoring-as-code for automation transparency
Required Qualifications
- 5+ years of experience in SRE, DevOps, or Software Engineering roles
- Deep expertise in the AWS ecosystem, particularly with ECS, RDS, EKS, IAM, and CloudWatch
- Expertise with Terraform for managing scalable cloud infrastructure
- Skilled in CI/CD pipelines and managing software delivery lifecycles
- Strong familiarity with telemetry and observability tools
Comments