Job Summary
A company is looking for a Site Reliability Engineer to join their Infrastructure team.
Key Responsibilities
- Design and maintain AWS infrastructure for over 1M daily users while ensuring high availability and cost optimization
- Build monitoring systems and lead incident response, including root cause analysis
- Research new technologies and implement practices to support scaling and resilience
Required Qualifications
- 1+ years in SRE/DevOps roles with production responsibilities
- Strong expertise in AWS services such as EC2, RDS, and Lambda
- Proficiency in Infrastructure as Code using Terraform for 2+ years
- Experience with monitoring tools and CI/CD pipeline design
- Proficient in scripting languages like Python, Bash, or Go
Comments